DATA-ENGINEER-ASSOCIATE Exam Details

  • Exam Code
    :DATA-ENGINEER-ASSOCIATE
  • Exam Name
    :AWS Certified Data Engineer - Associate (DEA-C01)
  • Certification
    :Amazon Certifications
  • Vendor
    :Amazon
  • Total Questions
    :403 Q&As
  • Last Updated
    :May 29, 2026

Amazon DATA-ENGINEER-ASSOCIATE Online Questions & Answers

  • Question 311:

    A company runs a data platform on AWS.

    The data platform uses AWS Glue to provide a data catalog and to perform processing.

    The company notices quality issues in the data.

    The company needs to implement data quality validations.

    The validations must include rules for known issues.

    The validations must have the ability to automatically detect unexpected data quality issues.

    Which solution will meet these requirements with the LEAST operation overhead?

    A. Use AWS Glue jobs to implement AWS Glue Data Quality validations that include anomaly detection.
    B. Use AWS Glue jobs to implement data quality rules that use open source data quality frameworks.
    C. Use AWS Glue DataBrew to profile the data. Configure data quality rules based on the data quality results from the profiling.
    D. Use AWS Glue jobs to implement data quality validations that use SQL statements.

  • Question 312:

    A company has AWS resources in multiple AWS Regions. The company has an Amazon EFS file system in each Region where the company operates. The company's data science team operates within only a single Region. The data that the data science team works with must remain within the team's Region.

    A data engineer needs to create a single dataset by processing les that are in each of the company's Regional EFS file systems. The data engineer wants to use an AWS Step Functions state machine to orchestrate AWS Lambda functions to process the data.

    Which solution will meet these requirements with the LEAST effort?

    A. Peer the VPCs that host the EFS file systems in each Region with the VPC that is in the data science team's Region. Enable EFS file locking. configure the Lambda functions in the data science team's Region to mount each of the Region specific file systems. Use the Lambda functions to process the data.
    B. configure each of the Regional EFS file systems to replicate data to the data science team's Region. In the data science team's Region, configure the Lambda functions to mount the replica file systems. Use the Lambda functions to process the data.
    C. Deploy the Lambda functions to each Region. Mount the Regional EFS file systems to the Lambda functions. Use the Lambda functions to process the data. Store the output in an Amazon S3 bucket in the data science team's Region.
    D. Use AWS DataSync to transfer les from each of the Regional EFS les systems to the file system that is in the data science team's Region. configure the Lambda functions in the data science team's Region to mount the file system that is in the same Region. Use the Lambda functions to process the data.

  • Question 313:

    A company uses Amazon RDS to store transactional data. The company runs an RDS DB instance in a private subnet. A developer wrote an AWS Lambda function with default settings to insert, update, or delete data in the DB instance.

    The developer needs to give the Lambda function the ability to connect to the DB instance privately without using the public internet.

    Which combination of steps will meet this requirement with the LEAST operational overhead? (Choose two.)

    A. Turn on the public access setting for the DB instance.
    B. Update the security group of the DB instance to allow only Lambda function invocations on the database port.
    C. Configure the Lambda function to run in the same subnet that the DB instance uses.
    D. Attach the same security group to the Lambda function and the DB instance. Include a self-referencing rule that allows access through the database port.
    E. Update the network ACL of the private subnet to include a self-referencing rule that allows access through the database port.

  • Question 314:

    A company is using an AWS Transfer Family server to migrate data from an on-premises environment to AWS. Company policy mandates the use of TLS 1.2 or above to encrypt the data in transit.

    Which solution will meet these requirements?

    A. Generate new SSH keys for the Transfer Family server. Make the old keys and the new keys available for use.
    B. Update the security group rules for the on-premises network to allow only connections that use TLS 1.2 or above.
    C. Update the security policy of the Transfer Family server to specify a minimum protocol version of TLS 1.2.
    D. Install an SSL certificate on the Transfer Family server to encrypt data transfers by using TLS 1.2.

  • Question 315:

    A company plans to provision a log delivery stream within a VPC. The company configured the VPCflow logs to publish to Amazon CloudWatch Logs. The company needs to send theflow logs to Splunk in near real time for further analysis.

    Which solution will meet these requirements with the LEAST operational overhead?

    A. Configure an Amazon Kinesis Data Streams data stream to use Splunk as the destination. Create a CloudWatch Logs subscription filter to send log events to the data stream.
    B. Create an Amazon Kinesis Data Firehose delivery stream to use Splunk as the destination. Create a CloudWatch Logs subscription filter to send log events to the delivery stream.
    C. Create an Amazon Kinesis Data Firehose delivery stream to use Splunk as the destination. Create an AWS Lambda function to send the flow logs from CloudWatch Logs to the delivery stream.
    D. Configure an Amazon Kinesis Data Streams data stream to use Splunk as the destination. Create an AWS Lambda function to send the flow logs from CloudWatch Logs to the data stream.

  • Question 316:

    A data engineer maintains custom Python scripts that perform a data formatting process that many AWS Lambda functions use. When the data engineer needs to modify the Python scripts, the data engineer must manually update all the Lambda functions.

    The data engineer requires a less manual way to update the Lambda functions.

    Which solution will meet this requirement?

    A. Store a pointer to the custom Python scripts in the execution context object in a shared Amazon S3 bucket.
    B. Package the custom Python scripts into Lambda layers. Apply the Lambda layers to the Lambda functions.
    C. Store a pointer to the custom Python scripts in environment variables in a shared Amazon S3 bucket.
    D. Assign the same alias to each Lambda function. Call reach Lambda function by specifying the function's alias.

  • Question 317:

    A financial services company stores financial data in Amazon Redshift. A data engineer wants to run real-time queries on the financial data to support a web-based trading application. The data engineer wants to run the queries from within the trading application.

    Which solution will meet these requirements with the LEAST operational overhead?

    A. Establish WebSocket connections to Amazon Redshift.
    B. Use the Amazon Redshift Data API.
    C. Set up Java Database Connectivity (JDBC) connections to Amazon Redshift.
    D. Store frequently accessed data in Amazon S3. Use Amazon S3 Select to run the queries.

  • Question 318:

    A company uses an Amazon Redshift cluster to manage data, including vendor sales data. The company wants to store a copy of the vendor data in an Amazon S3 bucket.

    A data engineer sets up an AWS Glue job to upload the data to the S3 bucket data on a schedule. The data engineer set up a network connection to allow private traffic between Amazon Redshift and Amazon

    S3.

    What is the next step required to meet this requirement?

    A. Create an IAM role that has permission to write to the S3 bucket. Associate the IAM role with the Amazon Redshift cluster.
    B. Add the S3 bucket to an AWS Glue Data Catalog. Configure Amazon Redshift Spectrum to access the Data Catalog.
    C. Enable the Amazon Redshift data sharing feature. Set the S3 bucket as a target bucket for data sharing.
    D. Store login credentials for Amazon Redshift in AWS Secrets Manager. Add a reference to the secret to the Glue job configuration.

  • Question 319:

    A company receives marketing campaign data from a vendor. The company ingests the data into an Amazon S3 bucket every 40 to 60 minutes. The data is in CSV format. File sizes are between 100 KB and 300 KB.

    A data engineer needs to set-up an extract, transform, and load (ETL) pipeline to upload the content of each file to Amazon Redshift.

    Which solution will meet these requirements with the LEAST operational overhead?

    A. Create an AWS Lambda function that connects to Amazon Redshift and runs a COPY command. Use Amazon EventBridge to invoke the Lambda function based on an Amazon S3 upload trigger.
    B. Create an Amazon Data Firehose stream. Configure the stream to use an AWS Lambda function as a source to pull data from the S3 bucket. Set Amazon Redshift as the destination.
    C. Use Amazon Redshift Spectrum to query the S3 bucket. Configure an AWS Glue Crawler for the S3 bucket to update metadata in an AWS Glue Data Catalog.
    D. Creates an AWS Database Migration Service (AWS DMS) task. Specify an appropriate data schema to migrate. Specify the appropriate type of migration to use.

  • Question 320:

    A company is planning to migrate on-premises Apache Hadoop clusters to Amazon EMR. The company also needs to migrate a data catalog into a persistent storage solution.

    The company currently stores the data catalog in an on-premises Apache Hive metastore on the Hadoop clusters. The company requires a serverless solution to migrate the data catalog.

    Which solution will meet these requirements MOST cost-effectively?

    A. Use AWS Database Migration Service (AWS DMS) to migrate the Hive metastore into Amazon S3. Configure AWS Glue Data Catalog to scan Amazon S3 to produce the data catalog.
    B. Configure a Hive metastore in Amazon EMR. Migrate the existing on-premises Hive metastore into Amazon EMR. Use AWS Glue Data Catalog to store the company's data catalog as an external data catalog.
    C. Configure an external Hive metastore in Amazon EMR. Migrate the existing on-premises Hive metastore into Amazon EMR. Use Amazon Aurora MySQL to store the company's data catalog.
    D. Configure a new Hive metastore in Amazon EMR. Migrate the existing on-premises Hive metastore into Amazon EMR. Use the new metastore as the company's data catalog.

Tips on How to Prepare for the Exams

Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only Amazon exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your DATA-ENGINEER-ASSOCIATE exam preparations and Amazon certification application, do not hesitate to visit our Vcedump.com to find your solutions here.