MLS-C01 Exam Details

  • Exam Code
    :MLS-C01
  • Exam Name
    :AWS Certified Machine Learning - Specialty (MLS-C01)
  • Certification
    :Amazon Certifications
  • Vendor
    :Amazon
  • Total Questions
    :396 Q&As
  • Last Updated
    :May 26, 2026

Amazon MLS-C01 Online Questions & Answers

  • Question 211:

    A machine learning (ML) specialist is training a linear regression model. The specialist notices that the model is overfitting. The specialist applies an L1 regularization parameter and runs the model again. This change results in all features having zero weights.

    What should the ML specialist do to improve the model results?

    A. Increase the L1 regularization parameter. Do not change any other training parameters.
    B. Decrease the L1 regularization parameter. Do not change any other training parameters.
    C. Introduce a large L2 regularization parameter. Do not change the current L1 regularization value.
    D. Introduce a small L2 regularization parameter. Do not change the current L1 regularization value.

  • Question 212:

    An energy company has wind turbines, weather stations, and solar panels that generate telemetry data. The company wants to perform predictive maintenance on these devices. The devices are in various locations and have unstable internet connectivity.

    A team of data scientists is using the telemetry data to perform machine learning (ML) to conduct anomaly detection and predict maintenance before the devices start to deteriorate. The team needs a scalable, secure, high-velocity data ingestion mechanism. The team has decided to use Amazon S3 as the data storage location.

    Which approach meets these requirements?

    A. Ingest the data by using an HTTP API call to a web server that is hosted on Amazon EC2. Set up EC2 instances in an Auto Scaling configuration behind an Elastic Load Balancer to load the data into Amazon S3.
    B. Ingest the data over Message Queuing Telemetry Transport (MQTT) to AWS IoT Core. Set up a rule in AWS IoT Core to use Amazon Kinesis Data Firehose to send data to an Amazon Kinesis data stream that is configured to write to an S3 bucket.
    C. Ingest the data over Message Queuing Telemetry Transport (MQTT) to AWS IoT Core. Set up a rule in AWS IoT Core to direct all MQTT data to an Amazon Kinesis Data Firehose delivery stream that is configured to write to an S3 bucket.
    D. Ingest the data over Message Queuing Telemetry Transport (MQTT) to Amazon Kinesis data stream that is configured to write to an S3 bucket.

  • Question 213:

    A Machine Learning Specialist is building a model to predict future employment rates based on a wide range of economic factors While exploring the data, the Specialist notices that the magnitude of the input features vary greatly The Specialist does not want variables with a larger magnitude to dominate the model

    What should the Specialist do to prepare the data for model training'?

    A. Apply quantile binning to group the data into categorical bins to keep any relationships in the data by replacing the magnitude with distribution
    B. Apply the Cartesian product transformation to create new combinations of fields that are independent of the magnitude
    C. Apply normalization to ensure each field will have a mean of 0 and a variance of 1 to remove any significant magnitude
    D. Apply the orthogonal sparse Diagram (OSB) transformation to apply a fixed-size sliding window to generate new features of a similar magnitude.

  • Question 214:

    While reviewing the histogram for residuals on regression evaluation data a Machine Learning Specialist notices that the residuals do not form a zero-centered bell shape as shown. What does this mean?

    A. The model might have prediction errors over a range of target values.
    B. The dataset cannot be accurately represented using the regression model
    C. There are too many variables in the model
    D. The model is predicting its target values perfectly.

  • Question 215:

    A data scientist for a medical diagnostic testing company has developed a machine learning (ML) model to identify patients who have a specific disease. The dataset that the scientist used to train the model is imbalanced. The dataset contains a large number of healthy patients and only a small number of patients who have the disease. The model should consider that patients who are incorrectly identified as positive for the disease will increase costs for the company.

    Which metric will MOST accurately evaluate the performance of this model?

    A. Recall
    B. F1 score
    C. Accuracy
    D. Precision

  • Question 216:

    A Machine Learning Specialist previously trained a logistic regression model using scikit-learn on a local machine, and the Specialist now wants to deploy it to production for inference only.

    What steps should be taken to ensure Amazon SageMaker can host a model that was trained locally?

    A. Build the Docker image with the inference code. Tag the Docker image with the registry hostname and upload it to Amazon ECR.
    B. Serialize the trained model so the format is compressed for deployment. Tag the Docker image with the registry hostname and upload it to Amazon S3.
    C. Serialize the trained model so the format is compressed for deployment. Build the image and upload it to Docker Hub.
    D. Build the Docker image with the inference code. Configure Docker Hub and upload the image to Amazon ECR.

  • Question 217:

    A data engineer is evaluating customer data in Amazon SageMaker Data Wrangler. The data engineer will use the customer data to create a new model to predict customer behavior.

    The engineer needs to increase the model performance by checking for multicollinearity in the dataset.

    Which steps can the data engineer take to accomplish this with the LEAST operational effort? (Choose two.)

    A. Use SageMaker Data Wrangler to refit and transform the dataset by applying one-hot encoding to category-based variables.
    B. Use SageMaker Data Wrangler diagnostic visualization. Use principal components analysis (PCA) and singular value decomposition (SVD) to calculate singular values.
    C. Use the SageMaker Data Wrangler Quick Model visualization to quickly evaluate the dataset and to produce importance scores for each feature.
    D. Use the SageMaker Data Wrangler Min Max Scaler transform to normalize the data.
    E. Use SageMaker Data Wrangler diagnostic visualization. Use least absolute shrinkage and selection operator (LASSO) to plot coefficient values from a LASSO model that is trained on the dataset.

  • Question 218:

    A tourism company uses a machine learning (ML) model to make recommendations to customers. The company uses an Amazon SageMaker environment and set hyperparameter tuning completion criteria to MaxNumberOfTrainingJobs.

    An ML specialist wants to change the hyperparameter tuning completion criteria. The ML specialist wants to stop tuning immediately after an internal algorithm determines that tuning job is unlikely to improve more than 1% over the objective metric from the best training job.

    Which completion criteria will meet this requirement?

    A. MaxRuntimeInSeconds
    B. TargetObjectiveMetricValue
    C. CompleteOnConvergence
    D. MaxNumberOfTrainingJobsNotImproving

  • Question 219:

    A data scientist uses Amazon SageMaker Data Wrangler to analyze and visualize data. The data scientist wants to refine a training dataset by selecting predictor variables that are strongly predictive of the target variable. The target variable correlates with other predictor variables.

    The data scientist wants to understand the variance in the data along various directions in the feature space.

    Which solution will meet these requirements?

    A. Use the SageMaker Data Wrangler multicollinearity measurement features with a variance inflation factor (VIF) score. Use the VIF score as a measurement of how closely the variables are related to each other.
    B. Use the SageMaker Data Wrangler Data Quality and Insights Report quick model visualization to estimate the expected quality of a model that is trained on the data.
    C. Use the SageMaker Data Wrangler multicollinearity measurement features with the principal component analysis (PCA) algorithm to provide a feature space that includes all of the predictor variables.
    D. Use the SageMaker Data Wrangler Data Quality and Insights Report feature to review features by their predictive power.

  • Question 220:

    A company wants to predict stock market price trends. The company stores stock market data each business day in Amazon S3 in Apache Parquet format. The company stores 20 GB of data each day for each stock code.

    A data engineer must use Apache Spark to perform batch preprocessing data transformations quickly so the company can complete prediction jobs before the stock market opens the next day. The company plans to track more stock market codes and needs a way to scale the preprocessing data transformations. Which AWS service or feature will meet these requirements with the LEAST development effort over time?

    A. AWS Glue jobs
    B. Amazon EMR cluster
    C. Amazon Athena
    D. AWS Lambda

Tips on How to Prepare for the Exams

Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only Amazon exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your MLS-C01 exam preparations and Amazon certification application, do not hesitate to visit our Vcedump.com to find your solutions here.