Exam Code | MLS-C01 |
Exam Name | AWS Certified Machine Learning - Specialty |
Questions | 208 Questions Answers With Explanation |
Update Date | November 08,2024 |
Price |
Was : |
Are you ready to take your career to the next level with AWS Certified Machine Learning - Specialty? At Prep4Certs, we're dedicated to helping you achieve your goals by providing high-quality MLS-C01 Dumps and resources for a wide range of certification exams.
At Prep4Certs, we're committed to your success in the Amazon MLS-C01 exam. Our comprehensive study materials and resources are designed to equip you with the knowledge and skills needed to ace the exam with confidence:
Start Your Certification Journey Today
Whether you're looking to advance your career, expand your skill set, or pursue new opportunities, Prep4Certs is here to support you on your certification journey. Explore our comprehensive study materials, take your exam preparation to the next level, and unlock new possibilities for professional growth and success.
Ready to achieve your certification goals? Begin your journey with Prep4Certs today!
A company is building a demand forecasting model based on machine learning (ML). In thedevelopment stage, an ML specialist uses an Amazon SageMaker notebook to performfeature engineering during work hours that consumes low amounts of CPU and memoryresources. A data engineer uses the same notebook to perform data preprocessing once aday on average that requires very high memory and completes in only 2 hours. The datapreprocessing is not configured to use GPU. All the processes are running well on anml.m5.4xlarge notebook instance.The company receives an AWS Budgets alert that the billing for this month exceeds theallocated budget.Which solution will result in the MOST cost savings?
A. Change the notebook instance type to a memory optimized instance with the samevCPU number as the ml.m5.4xlarge instance has. Stop the notebook when it is not in use.Run both data preprocessing and feature engineering development on that instance.
B. Keep the notebook instance type and size the same. Stop the notebook when it is not inuse. Run data preprocessing on a P3 instance type with the same memory as theml.m5.4xlarge instance by using Amazon SageMaker Processing.
C. Change the notebook instance type to a smaller general purpose instance. Stop thenotebook when it is not in use. Run data preprocessing on an ml.r5 instance with the samememory size as the ml.m5.4xlarge instance by using Amazon SageMaker Processing.
D. Change the notebook instance type to a smaller general purpose instance. Stop thenotebook when it is not in use. Run data preprocessing on an R5 instance with the samememory size as the ml.m5.4xlarge instance by using the Reserved Instance option.
A manufacturing company wants to use machine learning (ML) to automate quality controlin its facilities. The facilities are in remote locations and have limited internet connectivity.The company has 20 of training data that consists of labeled images of defective productparts. The training data is in the corporate on-premises data center.The company will use this data to train a model for real-time defect detection in new partsas the parts move on a conveyor belt in the facilities. The company needs a solution thatminimizes costs for compute infrastructure and that maximizes the scalability of resourcesfor training. The solution also must facilitate the company’s use of an ML model in the lowconnectivity environments.Which solution will meet these requirements?
A. Move the training data to an Amazon S3 bucket. Train and evaluate the model by usingAmazon SageMaker. Optimize the model by using SageMaker Neo. Deploy the model on aSageMaker hosting services endpoint.
B. Train and evaluate the model on premises. Upload the model to an Amazon S3 bucket.Deploy the model on an Amazon SageMaker hosting services endpoint.
C. Move the training data to an Amazon S3 bucket. Train and evaluate the model by usingAmazon SageMaker. Optimize the model by using SageMaker Neo. Set up an edge devicein the manufacturing facilities with AWS IoT Greengrass. Deploy the model on the edgedevice.
D. Train the model on premises. Upload the model to an Amazon S3 bucket. Set up anedge device in the manufacturing facilities with AWS IoT Greengrass. Deploy the model onthe edge device.
A company is building a predictive maintenance model based on machine learning (ML).The data is stored in a fully private Amazon S3 bucket that is encrypted at rest with AWSKey Management Service (AWS KMS) CMKs. An ML specialist must run datapreprocessing by using an Amazon SageMaker Processing job that is triggered from codein an Amazon SageMaker notebook. The job should read data from Amazon S3, process it,and upload it back to the same S3 bucket. The preprocessing code is stored in a containerimage in Amazon Elastic Container Registry (Amazon ECR). The ML specialist needs togrant permissions to ensure a smooth data preprocessing workflowWhich set of actions should the ML specialist take to meet these requirements?
A. Create an IAM role that has permissions to create Amazon SageMaker Processing jobs,S3 read and write access to the relevant S3 bucket, and appropriate KMS and ECRpermissions. Attach the role to the SageMaker notebook instance. Create an AmazonSageMaker Processing job from the notebook.
B. Create an IAM role that has permissions to create Amazon SageMaker Processing jobs.Attach the role to the SageMaker notebook instance. Create an Amazon SageMakerProcessing job with an IAM role that has read and write permissions to the relevant S3bucket, and appropriate KMS and ECR permissions.
C. Create an IAM role that has permissions to create Amazon SageMaker Processing jobsand to access Amazon ECR. Attach the role to the SageMaker notebook instance. Set upboth an S3 endpoint and a KMS endpoint in the default VPC. Create Amazon SageMakerProcessing jobs from the notebook.
D. Create an IAM role that has permissions to create Amazon SageMaker Processing jobs.Attach the role to the SageMaker notebook instance. Set up an S3 endpoint in the defaultVPC. Create Amazon SageMaker Processing jobs with the access key and secret key ofthe IAM user with appropriate KMS and ECR permissions.
A machine learning specialist is developing a proof of concept for government users whoseprimary concern is security. The specialist is using Amazon SageMaker to train aconvolutional neural network (CNN) model for a photo classifier application. The specialistwants to protect the data so that it cannot be accessed and transferred to a remote host bymalicious code accidentally installed on the training container.Which action will provide the MOST secure protection?
A. Remove Amazon S3 access permissions from the SageMaker execution role.
B. Encrypt the weights of the CNN model.
C. Encrypt the training and validation dataset.
D. Enable network isolation for training jobs.
A company wants to create a data repository in the AWS Cloud for machine learning (ML)projects. The company wants to use AWS to perform complete ML lifecycles and wants touse Amazon S3 for the data storage. All of the company’s data currently resides onpremises and is 40 in size.The company wants a solution that can transfer and automatically update data between theon-premises object storage and Amazon S3. The solution must support encryption,scheduling, monitoring, and data integrity validation.Which solution meets these requirements?
A. Use the S3 sync command to compare the source S3 bucket and the destination S3bucket. Determine which source files do not exist in the destination S3 bucket and whichsource files were modified.
B. Use AWS Transfer for FTPS to transfer the files from the on-premises storage toAmazon S3.
C. Use AWS DataSync to make an initial copy of the entire dataset. Schedule subsequentincremental transfers of changing data until the final cutover from on premises to AWS.
D. Use S3 Batch Operations to pull data periodically from the on-premises storage. EnableS3 Versioning on the S3 bucket to protect against accidental overwrites.
A machine learning (ML) specialist must develop a classification model for a financialservices company. A domain expert provides the dataset, which is tabular with 10,000 rowsand 1,020 features. During exploratory data analysis, the specialist finds no missing valuesand a small percentage of duplicate rows. There are correlation scores of > 0.9 for 200feature pairs. The mean value of each feature is similar to its 50th percentile.Which feature engineering strategy should the ML specialist use with Amazon SageMaker?
A. Apply dimensionality reduction by using the principal component analysis (PCA)algorithm.
B. Drop the features with low correlation scores by using a Jupyter notebook.
C. Apply anomaly detection by using the Random Cut Forest (RCF) algorithm.
D. Concatenate the features with high correlation scores by using a Jupyter notebook.
A Machine Learning Specialist is designing a scalable data storage solution for AmazonSageMaker. There is an existing TensorFlow-based model implemented as a train.py scriptthat relies on static training data that is currently stored as TFRecordsWhich method of providing training data to Amazon SageMaker would meet the businessrequirements with the LEAST development overhead?
A. Use Amazon SageMaker script mode and use train.py unchanged. Point the AmazonSageMaker training invocation to the local path of the data without reformatting the trainingdata.
B. Use Amazon SageMaker script mode and use train.py unchanged. Put the TFRecorddata into an Amazon S3 bucket. Point the Amazon SageMaker training invocation to the S3bucket without reformatting the training data.
C. Rewrite the train.py script to add a section that converts TFRecords to protobuf andingests the protobuf data instead of TFRecords.
D. Prepare the data in the format accepted by Amazon SageMaker. Use AWS Glue orAWS Lambda to reformat and store the data in an Amazon S3 bucket.
A data scientist is using the Amazon SageMaker Neural Topic Model (NTM) algorithm tobuild a model that recommends tags from blog posts. The raw blog post data is stored inan Amazon S3 bucket in JSON format. During model evaluation, the data scientistdiscovered that the model recommends certain stopwords such as "a," "an,” and "the" astags to certain blog posts, along with a few rare words that are present only in certain blogentries. After a few iterations of tag review with the content team, the data scientist noticesthat the rare words are unusual but feasible. The data scientist also must ensure that thetag recommendations of the generated model do not include the stopwords.What should the data scientist do to meet these requirements?
A. Use the Amazon Comprehend entity recognition API operations. Remove the detectedwords from the blog post data. Replace the blog post data source in the S3 bucket.
B. Run the SageMaker built-in principal component analysis (PCA) algorithm with the blogpost data from the S3 bucket as the data source. Replace the blog post data in the S3bucket with the results of the training job.
C. Use the SageMaker built-in Object Detection algorithm instead of the NTM algorithm forthe training job to process the blog post data.
D. Remove the stopwords from the blog post data by using the Count Vectorizer function inthe scikit-learn library. Replace the blog post data in the S3 bucket with the results of thevectorizer.
A Data Scientist received a set of insurance records, each consisting of a record ID, thefinal outcome among200 categories, and the date of the final outcome. Some partial information on claimcontents is also provided,but only for a few of the 200 categories. For each outcome category, there are hundreds ofrecords distributedover the past 3 years. The Data Scientist wants to predict how many claims to expect ineach category from month to month, a few months in advance.What type of machine learning model should be used?
A. Classification month-to-month using supervised learning of the 200 categories based onclaim contents.
B. Reinforcement learning using claim IDs and timestamps where the agent will identifyhow many claims in each category to expect from month to month.
C. Forecasting using claim IDs and timestamps to identify how many claims in eachcategory to expect from month to month.
D. Classification with supervised learning of the categories for which partial information onclaim contents is provided, and forecasting using claim IDs and timestamps for all other categories.
A Machine Learning Specialist uploads a dataset to an Amazon S3 bucket protected withserver-sideencryption using AWS KMS.How should the ML Specialist define the Amazon SageMaker notebook instance so it canread the samedataset from Amazon S3?
A. Define security group(s) to allow all HTTP inbound/outbound traffic and assign thosesecurity group(s) to the Amazon SageMaker notebook instance.
B. onfigure the Amazon SageMaker notebook instance to have access to the VPC. Grantpermission in the KMS key policy to the notebook’s KMS role.
C. Assign an IAM role to the Amazon SageMaker notebook with S3 read access to thedataset. Grant permission in the KMS key policy to that role.
D. Assign the same KMS key used to encrypt data in Amazon S3 to the AmazonSageMaker notebook instance.