Exam Details

  • Exam Code
    :DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST
  • Exam Name
    :Databricks Certified Professional Data Scientist Exam
  • Certification
    :Databricks Certification
  • Vendor
    :Databricks
  • Total Questions
    :138 Q&As
  • Last Updated
    :May 12, 2024

Databricks Databricks Certification DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST Questions & Answers

  • Question 41:

    Consider flipping a coin for which the probability of heads is p, where p is unknown, and our goa is to estimate p. The obvious approach is to count how many times the coin came up heads and divide by the total number of coin flips. If we flip the coin 1000 times and it comes up heads 367 times, it is very reasonable to estimate p as approximately 0.367. However, suppose we flip the coin only twice and we get heads both times. Is it reasonable to estimate p as 1.0? Intuitively, given that we only flipped the coin twice, it seems a bit rash to conclude that the coin will always come up heads, and____________is a way of avoiding such rash conclusions.

    A. Naive Bayes

    B. Laplace Smoothing

    C. Logistic Regression

    D. Linear Regression

  • Question 42:

    Suppose that we are interested in the factors that influence whether a political candidate wins an election. The outcome (response) variable is binary (0/1); win or lose. The predictor variables of interest are the amount of money spent on the campaign, the amount of time spent campaigning negatively and whether or not the candidate is an incumbent.

    Above is an example of:

    A. Linear Regression

    B. Logistic Regression

    C. Recommendation system

    D. Maximum likelihood estimation

    E. Hierarchical linear models

  • Question 43:

    Let's say you have two cases as below for the movie ratings

    1.

    You recommend to a user a movie with four stars and he really doesn't like it and he'd rate it two stars

    2.

    You recommend a movie with three stars but the user loves it (he'd rate it five stars). So which statement correctly applies?

    A. In both cases, the contribution to the RMSE is the same

    B. In both cases, the contribution to the RMSE is the different

    C. In both cases, the contribution to the RMSE, could varies

    D. None of the above

  • Question 44:

    Refer to Exhibit

    In the exhibit, the x-axis represents the derived probability of a borrower defaulting on a loan. Also in the exhibit, the pink represents borrowers that are known to have not defaulted on their loan, and the blue represents borrowers that are known to have defaulted on their loan. Which analytical method could produce the probabilities needed to build this exhibit?

    A. Linear Regression

    B. Logistic Regression

    C. Discriminant Analysis

    D. Association Rules

  • Question 45:

    A. Naive Bayes classifier

    B. Collaborative filtering

    C. Logistic Regression

    D. Content-based filtering

  • Question 46:

    The method based on principal component analysis (PCA) evaluates the features according to:

    A. The projection of the largest eigenvector of the correlation matrix on the initial dimensions

    B. According to the magnitude of the components of the discriminate vector

    C. The projection of the smallest eigenvector of the correlation matrix on the initial dimensions

    D. None of the above

  • Question 47:

    You are working on a email spam filtering assignment, while working on this you find there is new word e.g. HadoopExam comes in email, and in your solutions you never come across this word before, hence probability of this words is coming in either email could be zero. So which of the following algorithm can help you to avoid zero probability?

    A. Naive Bayes

    B. Laplace Smoothing

    C. Logistic Regression

    D. All of the above

  • Question 48:

    You are designing a recommendation engine for a website where the ability to generate more personalized recommendations by analyzing information from the past activity of a specific user, or the history of other users deemed to be of

    similar taste to a given user. These resources are used as user profiling and helps the site recommend content on a user-by-user basis. The more a given user makes use of the system, the better the recommendations become, as the

    system gains data to improve its model of that user.

    What kind of this recommendation engine is ?

    A. Naive Bayes classifier

    B. Collaborative filtering

    C. Logistic Regression

    D. Content-based filtering

  • Question 49:

    You have collected the 100's of parameters about the 1000's of websites e.g. daily hits, average time on the websites, number of unique visitors, number of returning visitors etc. Now you have find the most important parameters which can best describe a website, so which of the following technique you will use:

    A. PCA (Principal component analysis)

    B. Linear Regression

    C. Logistic Regression

    D. Clustering

  • Question 50:

    Suppose you have been given two Random Variables X and Y, whose joint distribution is already known, the marginal distribution of X is simply the probability distribution of X averaging over information about Y. It is the probability distribution of X when the value of Y is not known. So how do you calculate the marginal distribution of X

    A. This is typically calculated by summing the joint probability distribution over Y.

    B. This is typically calculated by integrating the joint probability distribution over Y

    C. This is typically calculated by summing (In case of discrete variable) the joint probability distribution over Y

    D. This is typically calculated by integrating(ln case of continuous variable) the joint probability distribution over Y.

Tips on How to Prepare for the Exams

Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only Databricks exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST exam preparations and Databricks certification application, do not hesitate to visit our Vcedump.com to find your solutions here.