Exam Details

  • Exam Code
    :APACHE-HADOOP-DEVELOPER
  • Exam Name
    :Hadoop 2.0 Certification exam for Pig and Hive Developer
  • Certification
    :HCAHD
  • Vendor
    :Hortonworks
  • Total Questions
    :108 Q&As
  • Last Updated
    :Apr 23, 2024

Hortonworks HCAHD APACHE-HADOOP-DEVELOPER Questions & Answers

  • Question 1:

    When is the earliest point at which the reduce method of a given Reducer can be called?

    A. As soon as at least one mapper has finished processing its input split.

    B. As soon as a mapper has emitted at least one record.

    C. Not until all mappers have finished processing all records.

    D. It depends on the InputFormat used for the job.

  • Question 2:

    In Hadoop 2.0, which one of the following statements is true about a standby NameNode? The Standby NameNode:

    A. Communicates directly with the active NameNode to maintain the state of the active NameNode.

    B. Receives the same block reports as the active NameNode.

    C. Runs on the same machine and shares the memory of the active NameNode.

    D. Processes all client requests and block reports from the appropriate DataNodes.

  • Question 3:

    Given the following Hive commands:

    Which one of the following statements Is true?

    A. The file mydata.txt is copied to a subfolder of /apps/hive/warehouse

    B. The file mydata.txt is moved to a subfolder of /apps/hive/warehouse

    C. The file mydata.txt is copied into Hive's underlying relational database 0.

    D. The file mydata.txt does not move from Its current location in HDFS

  • Question 4:

    Consider the following two relations, A and B.

    A Pig JOIN statement that combined relations A by its first field and B by its second field would produce what output?

    A. 2 Jim Chris 2 3 Terry 3 4 Brian 4

    B. 2 cherry 2 cherry 3 orange 4 peach

    C. 2 cherry Jim, Chris 3 orange Terry

    4 peach Brian

    D. 2 cherry Jim 2 2 cherry Chris 2 3 orange Terry 3 4 peach Brian 4

  • Question 5:

    Review the following data and Pig code:

    What command to define B would produce the output (M,62,95l02) when invoking the DUMP operator on B?

    A. B = FILTER A BY (zip = = '95102' AND gender = = M");

    B. B= FOREACH A BY (gender = = 'M' AND zip = = '95102');

    C. B = JOIN A BY (gender = = 'M' AND zip = = '95102');

    D. B= GROUP A BY (zip = = '95102' AND gender = = 'M');

  • Question 6:

    You have written a Mapper which invokes the following five calls to the OutputColletor.collect method: output.collect (new Text ("Apple"), new Text ("Red") ) ;

    output.collect (new Text ("Banana"), new Text ("Yellow") ) ;

    output.collect (new Text ("Apple"), new Text ("Yellow") ) ;

    output.collect (new Text ("Cherry"), new Text ("Red") ) ;

    output.collect (new Text ("Apple"), new Text ("Green") ) ;

    How many times will the Reducer's reduce method be invoked?

    A. 6

    B. 3

    C. 1

    D. 0

    E. 5

  • Question 7:

    You want to run Hadoop jobs on your development workstation for testing before you submit them to your production cluster. Which mode of operation in Hadoop allows you to most closely simulate a production cluster while using a single machine?

    A. Run all the nodes in your production cluster as virtual machines on your development workstation.

    B. Run the hadoop command with the –jt local and the –fs file:///options.

    C. Run the DataNode, TaskTracker, NameNode and JobTracker daemons on a single machine.

    D. Run simldooop, the Apache open-source software for simulating Hadoop clusters.

  • Question 8:

    Can you use MapReduce to perform a relational join on two large tables sharing a key? Assume that the two tables are formatted as comma-separated files in HDFS.

    A. Yes.

    B. Yes, but only if one of the tables fits into memory

    C. Yes, so long as both tables fit into memory.

    D. No, MapReduce cannot perform relational operations.

    E. No, but it can be done with either Pig or Hive.

  • Question 9:

    Which one of the following statements describes a Hive user-defined aggregate function?

    A. Operates on multiple input rows and creates a single row as output

    B. Operates on a single input row and produces a single row as output

    C. Operates on a single input row and produces a table as output

    D. Operates on multiple input rows and produces a table as output

  • Question 10:

    Assuming the following Hive query executes successfully:

    Which one of the following statements describes the result set?

    A. A bigram of the top 80 sentences that contain the substring "you are" in the lines column of the input data A1 table.

    B. An 80-value ngram of sentences that contain the words "you" or "are" in the lines column of the inputdata table.

    C. A trigram of the top 80 sentences that contain "you are" followed by a null space in the lines column of the inputdata table.

    D. A frequency distribution of the top 80 words that follow the subsequence "you are" in the lines column of the inputdata table.

Tips on How to Prepare for the Exams

Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only Hortonworks exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your APACHE-HADOOP-DEVELOPER exam preparations and Hortonworks certification application, do not hesitate to visit our Vcedump.com to find your solutions here.