Exam Details

  • Exam Code
    :DATABRICKS-CERTIFIED-DATA-ENGINEER-ASSOCIATE
  • Exam Name
    :Databricks Certified Data Engineer Associate
  • Certification
    :Databricks Certifications
  • Vendor
    :Databricks
  • Total Questions
    :132 Q&As
  • Last Updated
    :Jul 02, 2025

Databricks Databricks Certifications DATABRICKS-CERTIFIED-DATA-ENGINEER-ASSOCIATE Questions & Answers

  • Question 41:

    A dataset has been defined using Delta Live Tables and includes an expectations clause:

    CONSTRAINT valid_timestamp EXPECT (timestamp > '2020-01-01') ON VIOLATION DROP ROW

    What is the expected behavior when a batch of data containing data that violates these constraints is processed?

    A. Records that violate the expectation are dropped from the target dataset and loaded into a quarantine table.

    B. Records that violate the expectation are added to the target dataset and flagged as invalid in a field added to the target dataset.

    C. Records that violate the expectation are dropped from the target dataset and recorded as invalid in the event log.

    D. Records that violate the expectation are added to the target dataset and recorded as invalid in the event log.

    E. Records that violate the expectation cause the job to fail.

  • Question 42:

    A data engineer needs to apply custom logic to identify employees with more than 5 years of experience in array column employees in table stores. The custom logic should create a new column exp_employees that is an array of all of the employees with more than 5 years of experience for each row. In order to apply this custom logic at scale, the data engineer wants to use the FILTER higher-order function.

    Which of the following code blocks successfully completes this task?

    A. Option A

    B. Option B

    C. Option C

    D. Option D

    E. Option E

  • Question 43:

    Which of the following commands will return the number of null values in the member_id column?

    A. SELECT count(member_id) FROM my_table;

    B. SELECT count(member_id) - count_null(member_id) FROM my_table;

    C. SELECT count_if(member_id IS NULL) FROM my_table;

    D. SELECT null(member_id) FROM my_table;

    E. SELECT count_null(member_id) FROM my_table;

  • Question 44:

    A data engineer runs a statement every day to copy the previous day's sales into the table transactions. Each day's sales are in their own file in the location "/transactions/raw". Today, the data engineer runs the following command to complete this task:

    After running the command today, the data engineer notices that the number of records in table transactions has not changed. Which of the following describes why the statement might not have copied any new records into the table?

    A. The format of the files to be copied were not included with the FORMAT_OPTIONS keyword.

    B. The names of the files to be copied were not included with the FILES keyword.

    C. The previous day's file has already been copied into the table.

    D. The PARQUET file format does not support COPY INTO.

    E. The COPY INTO statement requires the table to be refreshed to view the copied rows.

  • Question 45:

    A data engineer wants to schedule their Databricks SQL dashboard to refresh once per day, but they only want the associated SQL endpoint to be running when it is necessary.

    Which of the following approaches can the data engineer use to minimize the total running time of the SQL endpoint used in the refresh schedule of their dashboard?

    A. They can ensure the dashboard's SQL endpoint matches each of the queries' SQL endpoints.

    B. They can set up the dashboard's SQL endpoint to be serverless.

    C. They can turn on the Auto Stop feature for the SQL endpoint.

    D. They can reduce the cluster size of the SQL endpoint.

    E. They can ensure the dashboard's SQL endpoint is not one of the included query's SQL endpoint.

  • Question 46:

    Which of the following describes the relationship between Gold tables and Silver tables?

    A. Gold tables are more likely to contain aggregations than Silver tables.

    B. Gold tables are more likely to contain valuable data than Silver tables.

    C. Gold tables are more likely to contain a less refined view of data than Silver tables.

    D. Gold tables are more likely to contain more data than Silver tables.

    E. Gold tables are more likely to contain truthful data than Silver tables.

  • Question 47:

    Which of the following commands can be used to write data into a Delta table while avoiding the writing of duplicate records?

    A. DROP

    B. IGNORE

    C. MERGE

    D. APPEND

    E. INSERT

  • Question 48:

    Which of the following Git operations must be performed outside of Databricks Repos?

    A. Commit

    B. Pull

    C. Push

    D. Clone

    E. Merge

  • Question 49:

    A data engineer has a Python notebook in Databricks, but they need to use SQL to accomplish a specific task within a cell. They still want all of the other cells to use Python without making any changes to those cells.

    Which of the following describes how the data engineer can use SQL within a cell of their Python notebook?

    A. It is not possible to use SQL in a Python notebook

    B. They can attach the cell to a SQL endpoint rather than a Databricks cluster

    C. They can simply write SQL syntax in the cell

    D. They can add %sql to the first line of the cell

    E. They can change the default language of the notebook to SQL

  • Question 50:

    A data engineer has been using a Databricks SQL dashboard to monitor the cleanliness of the input data to an ELT job. The ELT job has its Databricks SQL query that returns the number of input records containing unexpected NULL values. The data engineer wants their entire team to be notified via a messaging webhook whenever this value reaches 100.

    Which of the following approaches can the data engineer use to notify their entire team via a messaging webhook whenever the number of NULL values reaches 100?

    A. They can set up an Alert with a custom template.

    B. They can set up an Alert with a new email alert destination.

    C. They can set up an Alert with a new webhook alert destination.

    D. They can set up an Alert with one-time notifications.

    E. They can set up an Alert without notifications.

Tips on How to Prepare for the Exams

Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only Databricks exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your DATABRICKS-CERTIFIED-DATA-ENGINEER-ASSOCIATE exam preparations and Databricks certification application, do not hesitate to visit our Vcedump.com to find your solutions here.