You are running a Hadoop cluster with MapReduce version 2 (MRv2) on YARN. You consistently see that MapReduce map tasks on your cluster are running slowly because of excessive garbage collection of JVM, how do you increase JVM heap property to 3GB to optimize performance?
A. Yarn.application.child.java.opts-Xax3072m
B. Yarn.application.child.java.opts=-3072m
C. Mapreduce.map.java.opts=-Xmx3072m
D. Mapreduce.map.java.opts=-Xms3072m
Which three basic configuration parameters must you set to migrate your cluster from MapReduce1 (MRv1) to MapReduce v2 (MRv2)?
A. Configure the NodeManager hostname and enable services on YARN by setting the following property in yarn-site.xml:
B. Configure the number of map tasks per job on YARN by setting the following property in mapredsite.xml:
C. Configure MapReduce as a framework running on YARN by setting the following property in mapredsite.xml:
D. Configure the ResourceManager hostname and enable node services on YARN by setting the following property in yarn-site.xml:
E. Configure a default scheduler to run on YARN by setting the following property in sapred- site.xml:
F. Configure the NodeManager to enable MapReduce services on YARN by adding following property in yarn-site.xml:
Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is the result when you execute: hadoop jar samplejar.jar MyClass on a client machine?
A. SampleJar.jar is sent to the ApplicationMaster which allocation a container for Sample.jar
B. SampleJar.Jar is serialized into an XML file which is submitted to the ApplicationMaster
C. SampleJar.Jar is sent directly to the ResourceManager
D. SampleJar.Jar is placed in a temporary directly in HDFS
You suspect that your NameNode is incorrectly configured, and is swapping memory to disk. Which Linux commands help you to identify whether swapping is occurring? (Select 3)
A. free
B. df
C. memcat
D. top
E. vmstat
F. swapinfo
Your cluster's mapped-site.xml includes the following parameters
And your cluster's yarn-site.xml includes the following parameters
What is the maximum amount of virtual memory allocated for each map before YARN will kill its Container?
A. 4 GB
B. 17.2 GB
C. 24.6 GB
D. 8.2 GB
On a cluster running CDH 5.0 or above, you use the hadoop fs put command to write a 300MB file into a previously empty directory using an HDFS block of 64MB. Just after this command has finished writing 200MB of this file, what would another use see when they look in the directory?
A. They will see the file with its original name. if they attempt to view the file, they will get a ConcurrentFileAccessException until the entire file write is completed on the cluster
B. They will see the file with a ._COPYING_extension on its name. If they attempt to view the file, they will get a ConcurrentFileAccessException until the entire file write is completed on the cluster.
C. They will see the file with a ._COPYING_ extension on its name. if they view the file, they will see contents of the file up to the last completed block (as each 64MB block is written, that block becomes available)
D. The directory will appear to be empty until the entire file write is completed on the cluster
You are configuring a cluster running HDFS, MapReduce version 2 (MRv2) on YARN running Linux. How must you format the underlying filesystem of each DataNode?
A. They must not formatted - - HDFS will format the filesystem automatically
B. They may be formatted in any Linux filesystem
C. They must be formatted as HDFS
D. They must be formatted as either ext3 or ext4
Your cluster has the following characteristics:
A rack aware topology is configured and on
Replication is not set to 3
Cluster block size is set to 64 MB
Which describes the file read process when a client application connects into the cluster and requests a 50MB file?
A. The client queries the NameNode which retrieves the block from the nearest DataNode to the client and then passes that block back to the client.
B. The client queries the NameNode for the locations of the block, and reads from a random location in the list it retrieves to eliminate network I/O leads by balancing which nodes it retrieves data from at any given time.
C. The client queries the NameNode for the locations of the block, and reads all three copies. The first copy to complete transfer to the client is the one the client reads as part of Hadoop's
speculative execution framework.
D. The client queries the NameNode for the locations of the block, and reads from the first location in the list it receives.
Which YARN daemon or service negotiates map and reduce Containers from the Scheduler, tracking their status and monitoring for progress?
A. ResourceManager
B. ApplicationMaster
C. NodeManager
D. ApplicationManager
Which two are Features of Hadoop's rack topology?
A. Configuration of rack awareness is accomplished using a configuration file. You cannot use a rack topology script.
B. Even for small clusters on a single rack, configuring rack awareness will improve performance.
C. Rack location is considered in the HDFS block placement policy
D. HDFS is rack aware but MapReduce daemons are not
E. Hadoop gives preference to Intra rack data transfer in order to conserve bandwidth
Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only Cloudera exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your CCA-505 exam preparations and Cloudera certification application, do not hesitate to visit our Vcedump.com to find your solutions here.