E20-007 Exam Details

  • Exam Code
    :E20-007
  • Exam Name
    :Data Science and Big Data Analytics
  • Certification
    :EMC Certifications
  • Vendor
    :EMC
  • Total Questions
    :198 Q&As
  • Last Updated
    :May 31, 2026

EMC E20-007 Online Questions & Answers

  • Question 111:

    Consider the following itemsets:

    (hat, scarf, coat)

    (hat, scarf, coat, gloves)

    (hat, scarf, gloves)

    (hat, gloves)

    (scarf, coat, gloves)

    What is the confidence of the rule (hat, scarf) => gloves?

    A. 40%
    B. 50%
    C. 60%
    D. 66%

  • Question 112:

    For which class of problem is MapReduce most suitable?

    A. Embarrassingly parallel
    B. Minimal result data
    C. Simple marginalization tasks
    D. Non-overlapping queries

  • Question 113:

    A data scientist wants to predict the probability of death from heart disease based on three risk factors: age, gender, and blood cholesterol level. What is the most appropriate method for this project?

    A. Logistic regression
    B. Linear regression
    C. K-means clustering
    D. Apriori algorithm

  • Question 114:

    Which graphical representation shows the distribution and multiple summary statistics of a continuous variable for each value of a corresponding discrete variable?

    A. box and whisker plot
    B. dotplot
    C. scatterplot
    D. binplot

  • Question 115:

    You are analyzing a time series and want to determine its stationarity. You also want to determine the order of autoregressive models. How are the autocorrelation functions used?

    A. ACF as an indication of stationarity, and PACF for the correlation between Xt and Xt-k not explained by their mutual correlation with X1 through Xk-1.
    B. PACF as an indication of stationarity, and ACF for the correlation between Xt and Xt-k not explained by their mutual correlation with X1 through Xk-1.
    C. ACF as an indication of stationarity, and PACF to determine the correlation of X1 through Xk-1.
    D. PACF as an indication of stationarity, and ACF to determine the correlation of X1 through Xk-1.

  • Question 116:

    What is LOESS used for?

    A. It fits a smoothed curve to scatterplot data, to give a general sense of the data's behavior.
    B. It is a significance test for the correlation between two variables.
    C. It plots a continuous variable versus a discrete variable, to compare distributions across classes.
    D. It is run after a one-way ANOVA, to determine which population has the highest mean value.

  • Question 117:

    Which word or phrase completes the statement? A data warehouse is to a centralized database for reporting as an analytic sandbox is to a _______?

    A. Collection of data assets for modeling
    B. Collection of low-volume databases
    C. Centralized database of KPIs
    D. Collection of data assets for ETL

  • Question 118:

    Refer to the exhibit.

    You have created a density plot of purchase amounts from a retail website as shown. What should you do next?

    A. Recreate the plot using the barplot() function
    B. Use the rug() function to add elements to the plot
    C. Recreate the density plot using a log normal distribution of the purchase amount data
    D. Reduce the sample size of the purchase amount data used to create the plot

  • Question 119:

    A business colleague who is new to Hadoop approaches you with a question. The colleague wants to know the best approach to access their data. The colleague has previously worked extensively with SQL and databases. Which query interface should be recommended?

    A. Hive
    B. Pig
    C. Howl
    D. HBase

  • Question 120:

    In which lifecycle stage are test and training data sets created?

    A. Model building
    B. Model planning
    C. Discovery
    D. Data preparation

Tips on How to Prepare for the Exams

Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only EMC exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your E20-007 exam preparations and EMC certification application, do not hesitate to visit our Vcedump.com to find your solutions here.