E20-007 Exam Details

  • Exam Code
    :E20-007
  • Exam Name
    :Data Science and Big Data Analytics
  • Certification
    :EMC Certifications
  • Vendor
    :EMC
  • Total Questions
    :198 Q&As
  • Last Updated
    :May 31, 2026

EMC E20-007 Online Questions & Answers

  • Question 141:

    Your colleague, who is new to Hadoop, approaches you with a question. They want to know how best to access their data. This colleague has previously worked extensively with SQL and databases. Which query interface would you recommend?

    A. Hive
    B. Pig
    C. Howl
    D. HBase

  • Question 142:

    Refer to the exhibit.

    You are assigned to do an end of the year sales analysis of 1, 000 different products, based on the transaction table. Which column in the end of year report requires the use of a window function?

    A. Total Sales to Date
    B. Daily Sales
    C. Average Daily Price
    D. Maximum Price

  • Question 143:

    Assume you are performing an analysis to determine fraud detection on credit card usage. You will need to ensure higher-risk transactions. These may indicate that fraudulent credit card activity is retained in your data for analysis and not dropped as outliers during pre- processing.

    What is the approach for loading data into the analytical sandbox for this analysis?

    A. ELT
    B. ETL
    C. EDW
    D. OLTP

  • Question 144:

    A data scientist plans to classify the sentiment polarity of 10, 000 product reviews collected from the Internet. What is the most appropriate model to use? Suppose labeled training data is available.

    A. Na飗e Bayesian classifier
    B. Linear regression
    C. Logistic regression
    D. K-means clustering

  • Question 145:

    In the MapReduce framework, what is the purpose of the Map Function?

    A. It processes the input and generates key-value pairs
    B. It collects the output of the Reduce function
    C. It sorts the results of the Reduce function
    D. It breaks the input into smaller components and distributes to other nodes in the cluster

  • Question 146:

    When is the GROUP BY ROLLUP clause used in an OLAP query?

    A. All subtotals and grand totals are to be included in the output
    B. Subtotals are only to be included in the output
    C. Grand totals are only to be included in the output D. Specific subtotals and grand totals for a combination of variables are only to be included in the output

  • Question 147:

    Refer to the exhibit.

    You have scored your Naive bayesian classifier model on a hold out test data for cross validation and determined the way the samples scored and tabulated them as shown in the exhibit. What are the the False Positive Rate (FPR) and the False Negative Rate (FNR) of the model?

    A. FPR = 15/262 FNR = 26/288
    B. FPR = 26/288 FNR = 15/262
    C. FPR = 262/15 FNR = 288/26
    D. FPR = 288/26 FNR = 262/15

  • Question 148:

    You have used k-means clustering to classify behavior of 100, 000 customers for a retail store. You decide to use household income, age, gender and yearly purchase amount as measures. You have chosen to use 8 clusters and notice that 2 clusters only have 3 customers assigned. What should you do?

    A. Decrease the number of clusters
    B. Increase the number of clusters
    C. Decrease the number of measures used
    D. Identify additional measures to add to the analysis

  • Question 149:

    You have been assigned to run a linear regression model for each of 5, 000 distinct districts, and all the data is currently stored in a PostgreSQL database. Which tool/library would you use to produce these models with the least effort?

    A. MADlib
    B. Mahout
    C. R
    D. HBase

  • Question 150:

    Which functionality do regular expressions provide?

    A. text pattern matching
    B. underflow prevention
    C. increased numerical precision
    D. decreased processing complexity

Tips on How to Prepare for the Exams

Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only EMC exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your E20-007 exam preparations and EMC certification application, do not hesitate to visit our Vcedump.com to find your solutions here.