{"id":128173,"date":"2026-06-08T08:36:02","date_gmt":"2026-06-08T08:36:02","guid":{"rendered":"https:\/\/www.dumpsbase.com\/freedumps\/?p=128173"},"modified":"2026-06-08T08:36:06","modified_gmt":"2026-06-08T08:36:06","slug":"amazon-dea-c01-exam-questions-2026-v13-02-free-dumps-part-1-q1-q40-for-aws-data-engineer-prep","status":"publish","type":"post","link":"https:\/\/www.dumpsbase.com\/freedumps\/amazon-dea-c01-exam-questions-2026-v13-02-free-dumps-part-1-q1-q40-for-aws-data-engineer-prep.html","title":{"rendered":"Amazon DEA-C01 Exam Questions 2026 V13.02: Free Dumps (Part 1, Q1-Q40) for AWS Data Engineer Prep"},"content":{"rendered":"\n<p>DumpsBase address your challenge of AWS Certified Data Engineer &#8211; Associate (DEA-C01) exam preparation by offering the most updated DEA-C01 dumps. We have updated the dumps to V13.02, which are designed to reflect modern question formats, helping you focus only on relevant topics instead of wasting time on outdated content. We have 294 practice questions and answers in V13.02, offering you a more targeted approach to build your confidence and maintain consistency throughout your preparation journey. To help you know more about the DEA-C01 dumps (V13.02), we will share our free demo questions, as a demo of V13.02. These free questions will help you check the quality of these exam questions. Also, you can find that our updated DEA-C01 dumps (V13.02) ensure you are studying content aligned with the latest exam objectives and difficulty levels. Smart preparation with current Amazon DEA-C01 dumps ultimately leads to better retention and improved exam readiness.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Start with Amazon DEA-C01 free dumps (Part 1, Q1-Q40) of V13.02 below:<\/h2>\n\n\n<script>\n\t  window.fbAsyncInit = function() {\n\t    FB.init({\n\t      appId            : '622169541470367',\n\t      autoLogAppEvents : true,\n\t      xfbml            : true,\n\t      version          : 'v3.1'\n\t    });\n\t  };\n\t\n\t  (function(d, s, id){\n\t     var js, fjs = d.getElementsByTagName(s)[0];\n\t     if (d.getElementById(id)) {return;}\n\t     js = d.createElement(s); js.id = id;\n\t     js.src = \"https:\/\/connect.facebook.net\/en_US\/sdk.js\";\n\t     fjs.parentNode.insertBefore(js, fjs);\n\t   }(document, 'script', 'facebook-jssdk'));\n\t<\/script><script type=\"text\/javascript\" >\ndocument.addEventListener(\"DOMContentLoaded\", function(event) { \nif(!window.jQuery) alert(\"The important jQuery library is not properly loaded in your site. Your WordPress theme is probably missing the essential wp_head() call. You can switch to another theme and you will see that the plugin works fine and this notice disappears. If you are still not sure what to do you can contact us for help.\");\n});\n<\/script>  \n  \n<div  id=\"watupro_quiz\" class=\"quiz-area single-page-quiz\">\n<p id=\"submittingExam12348\" style=\"display:none;text-align:center;\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.dumpsbase.com\/freedumps\/wp-content\/plugins\/watupro\/img\/loading.gif\" width=\"16\" height=\"16\"><\/p>\n\n<div class=\"watupro-exam-description\" id=\"description-quiz-12348\"><\/div>\n\n<form action=\"\" method=\"post\" class=\"quiz-form\" id=\"quiz-12348\"  enctype=\"multipart\/form-data\" >\n<div class='watu-question ' id='question-1' style=';'><div id='questionWrap-1'  class='   watupro-question-id-482189'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>1. <\/span>A transportation company wants to track vehicle movements by capturing geolocation records. The records are 10 bytes in size. The company receives up to 10,000 records every second. Data transmission delays of a few minutes are acceptable because of unreliable network conditions. <br \/>\r<br>The transportation company wants to use Amazon Kinesis Data Streams to ingest the geolocation data. The company needs a reliable mechanism to send data to Kinesis Data Streams. The company needs to maximize the throughput efficiency of the Kinesis shards. <br \/>\r<br>Which solution will meet these requirements in the MOST operationally efficient way?<\/div><input type='hidden' name='question_id[]' id='qID_1' value='482189' \/><input type='hidden' id='answerType482189' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482189[]' id='answer-id-1862659' class='answer   answerof-482189 ' value='1862659'   \/><label for='answer-id-1862659' id='answer-label-1862659' class=' answer'><span>Kinesis Agent<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482189[]' id='answer-id-1862660' class='answer   answerof-482189 ' value='1862660'   \/><label for='answer-id-1862660' id='answer-label-1862660' class=' answer'><span>Kinesis Producer Library (KPL)<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482189[]' id='answer-id-1862661' class='answer   answerof-482189 ' value='1862661'   \/><label for='answer-id-1862661' id='answer-label-1862661' class=' answer'><span>Amazon Data Firehose<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482189[]' id='answer-id-1862662' class='answer   answerof-482189 ' value='1862662'   \/><label for='answer-id-1862662' id='answer-label-1862662' class=' answer'><span>Kinesis SDK<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-2' style=';'><div id='questionWrap-2'  class='   watupro-question-id-482190'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>2. <\/span>A retail company uses Amazon Aurora PostgreSQL to process and store live transactional data. The company uses an Amazon Redshift cluster for a data warehouse. <br \/>\r<br>An extract, transform, and load (ETL) job runs every morning to update the Redshift cluster with new data from the PostgreSQL database. The company has grown rapidly and needs to cost optimize the Redshift cluster. <br \/>\r<br>A data engineer needs to create a solution to archive historical data. The data engineer must be able to run analytics queries that effectively combine data from live transactional data in PostgreSQL, current data in Redshift, and archived historical data. The solution must keep only the most recent 15 months of data in Amazon Redshift to reduce costs. <br \/>\r<br>Which combination of steps will meet these requirements? (Select TWO.)<\/div><input type='hidden' name='question_id[]' id='qID_2' value='482190' \/><input type='hidden' id='answerType482190' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482190[]' id='answer-id-1862663' class='answer   answerof-482190 ' value='1862663'   \/><label for='answer-id-1862663' id='answer-label-1862663' class=' answer'><span>Configure the Amazon Redshift Federated Query feature to query live transactional data that is in the PostgreSQL database.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482190[]' id='answer-id-1862664' class='answer   answerof-482190 ' value='1862664'   \/><label for='answer-id-1862664' id='answer-label-1862664' class=' answer'><span>Configure Amazon Redshift Spectrum to query live transactional data that is in the PostgreSQL database.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482190[]' id='answer-id-1862665' class='answer   answerof-482190 ' value='1862665'   \/><label for='answer-id-1862665' id='answer-label-1862665' class=' answer'><span>Schedule a monthly job to copy data that is older than 15 months to Amazon S3 by using the UNLOAD command. Delete the old data from the Redshift cluster. Configure Amazon Redshift Spectrum to access historical data in Amazon S3.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482190[]' id='answer-id-1862666' class='answer   answerof-482190 ' value='1862666'   \/><label for='answer-id-1862666' id='answer-label-1862666' class=' answer'><span>Schedule a monthly job to copy data that is older than 15 months to Amazon S3 Glacier Flexible Retrieval by using the UNLOAD command. Delete the old data from the Redshift duster. Configure Redshift Spectrum to access historical data from S3 Glacier Flexible Retrieval.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482190[]' id='answer-id-1862667' class='answer   answerof-482190 ' value='1862667'   \/><label for='answer-id-1862667' id='answer-label-1862667' class=' answer'><span>Create a materialized view in Amazon Redshift that combines live, current, and historical data from different sources.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-3' style=';'><div id='questionWrap-3'  class='   watupro-question-id-482191'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>3. <\/span>A company needs a solution that restricts access to Amazon S3 data and encrypts the data by using AWS managed keys. The solution must manage database credentials that an AWS Lambda function uses and must rotate the credentials automatically. <br \/>\r<br>Which solution will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_3' value='482191' \/><input type='hidden' id='answerType482191' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482191[]' id='answer-id-1862668' class='answer   answerof-482191 ' value='1862668'   \/><label for='answer-id-1862668' id='answer-label-1862668' class=' answer'><span>Use S3 bucket policies to control access. Use server-side encryption with Amazon S3 managed keys (SSE-S3) to encrypt the data. Store the database credentials as Lambda environment variables.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482191[]' id='answer-id-1862669' class='answer   answerof-482191 ' value='1862669'   \/><label for='answer-id-1862669' id='answer-label-1862669' class=' answer'><span>Use IAM policies to control access. Use server-side encryption with AWS KMS keys (SSE-KMS) to encrypt the data. Configure AWS Secrets Manager to store and automatically rotate the credentials by using a Lambda function.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482191[]' id='answer-id-1862670' class='answer   answerof-482191 ' value='1862670'   \/><label for='answer-id-1862670' id='answer-label-1862670' class=' answer'><span>Use S3 ACLs to control access. Use server-side encryption with AWS KMS keys (SSE-KMS) to encrypt the data. Store the credentials in AWS Systems Manager Parameter Store and automatically rotate the credentials by using a Lambda function.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482191[]' id='answer-id-1862671' class='answer   answerof-482191 ' value='1862671'   \/><label for='answer-id-1862671' id='answer-label-1862671' class=' answer'><span>Use IAM policies to control access. Use server-side encryption with Amazon S3 managed keys (SSE-S3) to encrypt the data. Store the credentials in AWS Systems Manager Parameter Store. Configure a scheduled Lambda function to rotate the credentials.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-4' style=';'><div id='questionWrap-4'  class='   watupro-question-id-482192'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>4. <\/span>A company receives a data file from a partner each day in an Amazon S3 bucket. The company uses a daily AW5 Glue extract, transform, and load (ETL) pipeline to clean and transform each data file. The output of the ETL pipeline is written to a CSV file named Dairy.csv in a second 53 bucket. <br \/>\r<br>Occasionally, the daily data file is empty or is missing values for required fields. When the file is missing data, the company can use the previous day's CSV file. <br \/>\r<br>A data engineer needs to ensure that the previous day's data file is overwritten only if the new daily file is complete and valid. <br \/>\r<br>Which solution will meet these requirements with the LEAST effort?<\/div><input type='hidden' name='question_id[]' id='qID_4' value='482192' \/><input type='hidden' id='answerType482192' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482192[]' id='answer-id-1862672' class='answer   answerof-482192 ' value='1862672'   \/><label for='answer-id-1862672' id='answer-label-1862672' class=' answer'><span>Invoke an AWS Lambda function to check the file for missing data and to fill in missing values in required fields.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482192[]' id='answer-id-1862673' class='answer   answerof-482192 ' value='1862673'   \/><label for='answer-id-1862673' id='answer-label-1862673' class=' answer'><span>Configure the AWS Glue ETL pipeline to use AWS Glue Data Quality rules. Develop rules in Data Quality Definition Language (DQDL) to check for missing values in required files and empty files.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482192[]' id='answer-id-1862674' class='answer   answerof-482192 ' value='1862674'   \/><label for='answer-id-1862674' id='answer-label-1862674' class=' answer'><span>Use AWS Glue Studio to change the code in the ETL pipeline to fill in any missing values in the required fields with the most common values for each field.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482192[]' id='answer-id-1862675' class='answer   answerof-482192 ' value='1862675'   \/><label for='answer-id-1862675' id='answer-label-1862675' class=' answer'><span>Run a SQL query in Amazon Athena to read the CSV file and drop missing rows. Copy the corrected CSV file to the second S3 bucket.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-5' style=';'><div id='questionWrap-5'  class='   watupro-question-id-482193'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>5. <\/span>A company uses an Amazon Redshift provisioned cluster as its database. The Redshift cluster has five reserved ra3.4xlarge nodes and uses key distribution. <br \/>\r<br>A data engineer notices that one of the nodes frequently has a CPU load over 90%. SQL Queries that run on the node are queued. The other four nodes usually have a CPU load under 15% during daily operations. <br \/>\r<br>The data engineer wants to maintain the current number of compute nodes. The data engineer also wants to balance the load more evenly across all five compute nodes. <br \/>\r<br>Which solution will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_5' value='482193' \/><input type='hidden' id='answerType482193' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482193[]' id='answer-id-1862676' class='answer   answerof-482193 ' value='1862676'   \/><label for='answer-id-1862676' id='answer-label-1862676' class=' answer'><span>Change the sort key to be the data column that is most often used in a WHERE clause of the SQL SELECT statement.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482193[]' id='answer-id-1862677' class='answer   answerof-482193 ' value='1862677'   \/><label for='answer-id-1862677' id='answer-label-1862677' class=' answer'><span>Change the distribution key to the table column that has the largest dimension.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482193[]' id='answer-id-1862678' class='answer   answerof-482193 ' value='1862678'   \/><label for='answer-id-1862678' id='answer-label-1862678' class=' answer'><span>Upgrade the reserved node from ra3.4xlarqe to ra3.16xlarqe.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482193[]' id='answer-id-1862679' class='answer   answerof-482193 ' value='1862679'   \/><label for='answer-id-1862679' id='answer-label-1862679' class=' answer'><span>Change the primary key to be the data column that is most often used in a WHERE clause of the SQL SELECT statement.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-6' style=';'><div id='questionWrap-6'  class='   watupro-question-id-482194'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>6. <\/span>A company uses Amazon Redshift as its data warehouse service. A data engineer needs to design a physical data model. <br \/>\r<br>The data engineer encounters a de-normalized table that is growing in size. The table does not have a suitable column to use as the distribution key. <br \/>\r<br>Which distribution style should the data engineer use to meet these requirements with the LEAST maintenance overhead?<\/div><input type='hidden' name='question_id[]' id='qID_6' value='482194' \/><input type='hidden' id='answerType482194' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482194[]' id='answer-id-1862680' class='answer   answerof-482194 ' value='1862680'   \/><label for='answer-id-1862680' id='answer-label-1862680' class=' answer'><span>ALL distribution<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482194[]' id='answer-id-1862681' class='answer   answerof-482194 ' value='1862681'   \/><label for='answer-id-1862681' id='answer-label-1862681' class=' answer'><span>EVEN distribution<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482194[]' id='answer-id-1862682' class='answer   answerof-482194 ' value='1862682'   \/><label for='answer-id-1862682' id='answer-label-1862682' class=' answer'><span>AUTO distribution<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482194[]' id='answer-id-1862683' class='answer   answerof-482194 ' value='1862683'   \/><label for='answer-id-1862683' id='answer-label-1862683' class=' answer'><span>KEY distribution<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-7' style=';'><div id='questionWrap-7'  class='   watupro-question-id-482195'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>7. <\/span>A company receives a daily file that contains customer data in .xls format. The company stores the file in Amazon S3. The daily file is approximately 2 GB in size. <br \/>\r<br>A data engineer concatenates the column in the file that contains customer first names and the column that contains customer last names. The data engineer needs to determine the number of distinct customers in the file. <br \/>\r<br>Which solution will meet this requirement with the LEAST operational effort?<\/div><input type='hidden' name='question_id[]' id='qID_7' value='482195' \/><input type='hidden' id='answerType482195' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482195[]' id='answer-id-1862684' class='answer   answerof-482195 ' value='1862684'   \/><label for='answer-id-1862684' id='answer-label-1862684' class=' answer'><span>Create and run an Apache Spark job in an AWS Glue notebook. Configure the job to read the S3 file and calculate the number of distinct customers.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482195[]' id='answer-id-1862685' class='answer   answerof-482195 ' value='1862685'   \/><label for='answer-id-1862685' id='answer-label-1862685' class=' answer'><span>Create an AWS Glue crawler to create an AWS Glue Data Catalog of the S3 file. Run SQL queries from Amazon Athena to calculate the number of distinct customers.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482195[]' id='answer-id-1862686' class='answer   answerof-482195 ' value='1862686'   \/><label for='answer-id-1862686' id='answer-label-1862686' class=' answer'><span>Create and run an Apache Spark job in Amazon EMR Serverless to calculate the number of distinct customers.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482195[]' id='answer-id-1862687' class='answer   answerof-482195 ' value='1862687'   \/><label for='answer-id-1862687' id='answer-label-1862687' class=' answer'><span>Use AWS Glue DataBrew to create a recipe that uses the COUNT_DISTINCT aggregate function to calculate the number of distinct customers.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-8' style=';'><div id='questionWrap-8'  class='   watupro-question-id-482196'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>8. <\/span>A data engineer needs to deploy a complex pipeline. The stages of the pipeline must run scripts, but only fully managed and serverless services can be used.<\/div><input type='hidden' name='question_id[]' id='qID_8' value='482196' \/><input type='hidden' id='answerType482196' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482196[]' id='answer-id-1862688' class='answer   answerof-482196 ' value='1862688'   \/><label for='answer-id-1862688' id='answer-label-1862688' class=' answer'><span>Deploy AWS Glue jobs and workflows. Use AWS Glue to run the jobs and workflows on a schedule.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482196[]' id='answer-id-1862689' class='answer   answerof-482196 ' value='1862689'   \/><label for='answer-id-1862689' id='answer-label-1862689' class=' answer'><span>Use Amazon MWAA to build and schedule the pipeline.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482196[]' id='answer-id-1862690' class='answer   answerof-482196 ' value='1862690'   \/><label for='answer-id-1862690' id='answer-label-1862690' class=' answer'><span>Deploy the script to EC2. Use EventBridge to schedule it.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482196[]' id='answer-id-1862691' class='answer   answerof-482196 ' value='1862691'   \/><label for='answer-id-1862691' id='answer-label-1862691' class=' answer'><span>Use AWS Glue DataBrew and EventBridge to run on a schedule.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-9' style=';'><div id='questionWrap-9'  class='   watupro-question-id-482197'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>9. <\/span>A data engineer develops an AWS Glue Apache Spark ETL job to perform transformations on a dataset. When the data engineer runs the job, the job returns an error that reads, &quot;No space left on device.&quot; <br \/>\r<br>The data engineer needs to identify the source of the error and provide a solution. <br \/>\r<br>Which combinations of steps will meet this requirement MOST cost-effectively? (Select TWO.)<\/div><input type='hidden' name='question_id[]' id='qID_9' value='482197' \/><input type='hidden' id='answerType482197' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482197[]' id='answer-id-1862692' class='answer   answerof-482197 ' value='1862692'   \/><label for='answer-id-1862692' id='answer-label-1862692' class=' answer'><span>Scale out the workers vertically to address data skewness.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482197[]' id='answer-id-1862693' class='answer   answerof-482197 ' value='1862693'   \/><label for='answer-id-1862693' id='answer-label-1862693' class=' answer'><span>Use the Spark UI and AWS Glue metrics to monitor data skew in the Spark executors.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482197[]' id='answer-id-1862694' class='answer   answerof-482197 ' value='1862694'   \/><label for='answer-id-1862694' id='answer-label-1862694' class=' answer'><span>Scale out the number of workers horizontally to address data skewness.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482197[]' id='answer-id-1862695' class='answer   answerof-482197 ' value='1862695'   \/><label for='answer-id-1862695' id='answer-label-1862695' class=' answer'><span>Enable the --write-shuffle-files-to-s3 job parameter. Use the salting technique.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482197[]' id='answer-id-1862696' class='answer   answerof-482197 ' value='1862696'   \/><label for='answer-id-1862696' id='answer-label-1862696' class=' answer'><span>Use error logs in Amazon CloudWatch to monitor data skew.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-10' style=';'><div id='questionWrap-10'  class='   watupro-question-id-482198'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>10. <\/span>A company needs to build a data lake in AWS. The company must provide row-level data access and column-level data access to specific teams. The teams will access the data by using Amazon Athena, Amazon Redshift Spectrum, and Apache Hive from Amazon EMR. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_10' value='482198' \/><input type='hidden' id='answerType482198' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482198[]' id='answer-id-1862697' class='answer   answerof-482198 ' value='1862697'   \/><label for='answer-id-1862697' id='answer-label-1862697' class=' answer'><span>Use Amazon S3 for data lake storage. Use S3 access policies to restrict data access by rows and columns. Provide data access through Amazon S3.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482198[]' id='answer-id-1862698' class='answer   answerof-482198 ' value='1862698'   \/><label for='answer-id-1862698' id='answer-label-1862698' class=' answer'><span>Use Amazon S3 for data lake storage. Use Apache Ranger through Amazon EMR to restrict data access by rows and columns. Provide data access by using Apache Pig.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482198[]' id='answer-id-1862699' class='answer   answerof-482198 ' value='1862699'   \/><label for='answer-id-1862699' id='answer-label-1862699' class=' answer'><span>Use Amazon Redshift for data lake storage. Use Redshift security policies to restrict data access by rows and columns. Provide data access by using Apache Spark and Amazon Athena federated queries.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482198[]' id='answer-id-1862700' class='answer   answerof-482198 ' value='1862700'   \/><label for='answer-id-1862700' id='answer-label-1862700' class=' answer'><span>Use Amazon S3 for data lake storage. Use AWS Lake Formation to restrict data access by rows and columns. Provide data access through AWS Lake Formation.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-11' style=';'><div id='questionWrap-11'  class='   watupro-question-id-482199'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>11. <\/span>A company has an on-premises PostgreSQL database that contains customer data. The company wants to migrate the customer data to an Amazon Redshift data warehouse. The company has established a VPN connection between the on-premises database and AWS. <br \/>\r<br>The on-premises database is continuously updated. The company must ensure that the data in Amazon Redshift is updated as quickly as possible. <br \/>\r<br>Which solution will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_11' value='482199' \/><input type='hidden' id='answerType482199' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482199[]' id='answer-id-1862701' class='answer   answerof-482199 ' value='1862701'   \/><label for='answer-id-1862701' id='answer-label-1862701' class=' answer'><span>Use the pg_dump utility to generate a backup of the PostgreSQL database. Use the AWS Schema Conversion Tool (AWS SCT) to upload the backup to Amazon Redshift. Set up a cron job to perform a backup. Upload the backup to Amazon Redshift every night.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482199[]' id='answer-id-1862702' class='answer   answerof-482199 ' value='1862702'   \/><label for='answer-id-1862702' id='answer-label-1862702' class=' answer'><span>Create an AWS Database Migration Service (AWS DMS) full-load task. Set Amazon Redshift as the target. Configure the task to use the change data capture (CDC) feature.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482199[]' id='answer-id-1862703' class='answer   answerof-482199 ' value='1862703'   \/><label for='answer-id-1862703' id='answer-label-1862703' class=' answer'><span>Use the pg_dump utility to generate a backup of the PostgreSQL database. Upload the backup to an Amazon S3 bucket. Use the COPY command to import the data into Amazon Redshift.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482199[]' id='answer-id-1862704' class='answer   answerof-482199 ' value='1862704'   \/><label for='answer-id-1862704' id='answer-label-1862704' class=' answer'><span>Create an AWS Database Migration Service (AWS DMS) full-load task. Set Amazon Redshift as the target. Configure the task to perform a full load of the database to Amazon Redshift every night.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-12' style=';'><div id='questionWrap-12'  class='   watupro-question-id-482200'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>12. <\/span>A company has an application that uses a microservice architecture. The company hosts the application on an Amazon Elastic Kubernetes Services (Amazon EKS) cluster. <br \/>\r<br>The company wants to set up a robust monitoring system for the application. The company needs to analyze the logs from the EKS cluster and the application. The company needs to correlate the cluster's logs with the application's traces to identify points of failure in the whole application request flow. <br \/>\r<br>Which combination of steps will meet these requirements with the LEAST development effort? (Select TWO.)<\/div><input type='hidden' name='question_id[]' id='qID_12' value='482200' \/><input type='hidden' id='answerType482200' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482200[]' id='answer-id-1862705' class='answer   answerof-482200 ' value='1862705'   \/><label for='answer-id-1862705' id='answer-label-1862705' class=' answer'><span>Use FluentBit to collect logs. Use Open Telemetry to collect traces.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482200[]' id='answer-id-1862706' class='answer   answerof-482200 ' value='1862706'   \/><label for='answer-id-1862706' id='answer-label-1862706' class=' answer'><span>Use Amazon CloudWatch to collect logs. Use Amazon Kinesis to collect traces.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482200[]' id='answer-id-1862707' class='answer   answerof-482200 ' value='1862707'   \/><label for='answer-id-1862707' id='answer-label-1862707' class=' answer'><span>Use Amazon CloudWatch to collect logs. Use Amazon Managed Streaming for Apache Kafka (Amazon MSK) to collect traces.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482200[]' id='answer-id-1862708' class='answer   answerof-482200 ' value='1862708'   \/><label for='answer-id-1862708' id='answer-label-1862708' class=' answer'><span>Use Amazon OpenSearch to correlate the logs and traces.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482200[]' id='answer-id-1862709' class='answer   answerof-482200 ' value='1862709'   \/><label for='answer-id-1862709' id='answer-label-1862709' class=' answer'><span>Use AWS Glue to correlate the logs and traces.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-13' style=';'><div id='questionWrap-13'  class='   watupro-question-id-482201'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>13. <\/span>A company uses AWS Glue ETL pipelines to process data. The company uses Amazon Athena to analyze data in an Amazon S3 bucket. <br \/>\r<br>To better understand shipping timelines, the company decides to collect and store shipping dates and delivery dates in addition to order data. The company adds a data quality check to ensure that the shipping date is later than the order date and that the delivery date is later than the shipping date. Orders that fail the quality check must be stored in a second Amazon S3 bucket. <br \/>\r<br>Which solution will meet these requirements in the MOST cost-effective way?<\/div><input type='hidden' name='question_id[]' id='qID_13' value='482201' \/><input type='hidden' id='answerType482201' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482201[]' id='answer-id-1862710' class='answer   answerof-482201 ' value='1862710'   \/><label for='answer-id-1862710' id='answer-label-1862710' class=' answer'><span>Use AWS Glue DataBrew DATEDIFF functions to create two additional columns. Validate the new columns. Write failed records to a second S3 bucket.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482201[]' id='answer-id-1862711' class='answer   answerof-482201 ' value='1862711'   \/><label for='answer-id-1862711' id='answer-label-1862711' class=' answer'><span>Use Amazon Athena to query the three date columns and compare the values. Export failed records to a second S3 bucket.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482201[]' id='answer-id-1862712' class='answer   answerof-482201 ' value='1862712'   \/><label for='answer-id-1862712' id='answer-label-1862712' class=' answer'><span>Use AWS Glue Data Quality to create a custom rule that validates the three date columns. Route records that fail the rule to a second S3 bucket.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482201[]' id='answer-id-1862713' class='answer   answerof-482201 ' value='1862713'   \/><label for='answer-id-1862713' id='answer-label-1862713' class=' answer'><span>Use an AWS Glue crawler to populate the AWS Glue Data Catalog. Use the three date columns to create a filter.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-14' style=';'><div id='questionWrap-14'  class='   watupro-question-id-482202'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>14. <\/span>A company uses Amazon S3 buckets, AWS Glue tables, and Amazon Athena as components of a data lake. Recently, the company expanded its sales range to multiple new states. The company wants to introduce state names as a new partition to the existing S3 bucket, which is currently partitioned by date. <br \/>\r<br>The company needs to ensure that additional partitions will not disrupt daily synchronization between the AWS Glue Data Catalog and the S3 buckets. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_14' value='482202' \/><input type='hidden' id='answerType482202' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482202[]' id='answer-id-1862714' class='answer   answerof-482202 ' value='1862714'   \/><label for='answer-id-1862714' id='answer-label-1862714' class=' answer'><span>Use the AWS Glue API to manually update the Data Catalog.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482202[]' id='answer-id-1862715' class='answer   answerof-482202 ' value='1862715'   \/><label for='answer-id-1862715' id='answer-label-1862715' class=' answer'><span>Run an MSCK REPAIR TABLE command in Athena.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482202[]' id='answer-id-1862716' class='answer   answerof-482202 ' value='1862716'   \/><label for='answer-id-1862716' id='answer-label-1862716' class=' answer'><span>Schedule an AWS Glue crawler to periodically update the Data Catalog.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482202[]' id='answer-id-1862717' class='answer   answerof-482202 ' value='1862717'   \/><label for='answer-id-1862717' id='answer-label-1862717' class=' answer'><span>Run a REFRESH TABLE command in Athena.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-15' style=';'><div id='questionWrap-15'  class='   watupro-question-id-482203'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>15. <\/span>A company uses an Amazon Redshift cluster as a data warehouse that is shared across two departments. To comply with a security policy, each department must have unique access permissions. <br \/>\r<br>Department A must have access to tables and views for Department A. Department B must have access to tables and views for Department B. <br \/>\r<br>The company often runs SQL queries that use objects from both departments in one query. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_15' value='482203' \/><input type='hidden' id='answerType482203' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482203[]' id='answer-id-1862718' class='answer   answerof-482203 ' value='1862718'   \/><label for='answer-id-1862718' id='answer-label-1862718' class=' answer'><span>Group tables and views for each department into dedicated schemas. Manage permissions at the schema level.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482203[]' id='answer-id-1862719' class='answer   answerof-482203 ' value='1862719'   \/><label for='answer-id-1862719' id='answer-label-1862719' class=' answer'><span>Group tables and views for each department into dedicated databases. Manage permissions at the database level.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482203[]' id='answer-id-1862720' class='answer   answerof-482203 ' value='1862720'   \/><label for='answer-id-1862720' id='answer-label-1862720' class=' answer'><span>Update the names of the tables and views to follow a naming convention that contains the department names. Manage permissions based on the new naming convention.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482203[]' id='answer-id-1862721' class='answer   answerof-482203 ' value='1862721'   \/><label for='answer-id-1862721' id='answer-label-1862721' class=' answer'><span>Create an IAM user group for each department. Use identity-based IAM policies to grant table and view permissions based on the IAM user group.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-16' style=';'><div id='questionWrap-16'  class='   watupro-question-id-482204'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>16. <\/span>A company uses Amazon Redshift as its data warehouse. Data encoding is applied to the existing tables of the data warehouse. A data engineer discovers that the compression encoding applied to some of the tables is not the best fit for the data. <br \/>\r<br>The data engineer needs to improve the data encoding for the tables that have sub-optimal encoding. <br \/>\r<br>Which solution will meet this requirement?<\/div><input type='hidden' name='question_id[]' id='qID_16' value='482204' \/><input type='hidden' id='answerType482204' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482204[]' id='answer-id-1862722' class='answer   answerof-482204 ' value='1862722'   \/><label for='answer-id-1862722' id='answer-label-1862722' class=' answer'><span>Run the ANALYZE command against the identified tables. Manually update the compression encoding of columns based on the output of the command.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482204[]' id='answer-id-1862723' class='answer   answerof-482204 ' value='1862723'   \/><label for='answer-id-1862723' id='answer-label-1862723' class=' answer'><span>Run the ANALYZE COMPRESSION command against the identified tables. Manually update the compression encoding of columns based on the output of the command.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482204[]' id='answer-id-1862724' class='answer   answerof-482204 ' value='1862724'   \/><label for='answer-id-1862724' id='answer-label-1862724' class=' answer'><span>Run the VACUUM REINDEX command against the identified tables.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482204[]' id='answer-id-1862725' class='answer   answerof-482204 ' value='1862725'   \/><label for='answer-id-1862725' id='answer-label-1862725' class=' answer'><span>Run the VACUUM RECLUSTER command against the identified tables.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-17' style=';'><div id='questionWrap-17'  class='   watupro-question-id-482205'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>17. <\/span>A global finance company needs to implement near real-time cross-Region synchronization of trading data between trading centers in the us-east-1 Region, the eu-west-2 Region, and the ap-northeast-1 Region. The company must ensure that data is encrypted in transit. The solution must ensure data ordering and consistency and must support cross-Region disaster recovery. The solution must provide data latency of less than 500 milliseconds. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational effort?<\/div><input type='hidden' name='question_id[]' id='qID_17' value='482205' \/><input type='hidden' id='answerType482205' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482205[]' id='answer-id-1862726' class='answer   answerof-482205 ' value='1862726'   \/><label for='answer-id-1862726' id='answer-label-1862726' class=' answer'><span>Deploy Apache Kafka Connect in each AWS Region. Use custom-developed connectors to set up cross-Region data replication. Configure the SSL security protocol.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482205[]' id='answer-id-1862727' class='answer   answerof-482205 ' value='1862727'   \/><label for='answer-id-1862727' id='answer-label-1862727' class=' answer'><span>Use Amazon Managed Streaming for Apache Kafka (Amazon MSK) Replicator to establish fully interconnected replication relationships between MSK clusters in the three AWS Regions. Enable TLS encryption and IAM authentication. Set up cross-Region backup configurations.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482205[]' id='answer-id-1862728' class='answer   answerof-482205 ' value='1862728'   \/><label for='answer-id-1862728' id='answer-label-1862728' class=' answer'><span>Deploy Apache Kafka MirrorMaker 2.0 in each AWS Region. Set up custom replication policies to handle cross-Region data synchronization. Configure the SSL security protocol.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482205[]' id='answer-id-1862729' class='answer   answerof-482205 ' value='1862729'   \/><label for='answer-id-1862729' id='answer-label-1862729' class=' answer'><span>Use Amazon Kinesis Data Streams to receive trading data from each AWS Region. Use Amazon Data Firehose to replicate data between Amazon Managed Streaming for Apache Kafka (Amazon MSK) clusters in each Region. Configure AWS Key Management Service (AWS KMS) encryption and IAM roles to manage access.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-18' style=';'><div id='questionWrap-18'  class='   watupro-question-id-482206'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>18. <\/span>A data engineer wants to orchestrate a set of extract, transform, and load (ETL) jobs that run on AWS. The ETL jobs contain tasks that must run Apache Spark jobs on Amazon EMR, make API calls to Salesforce, and load data into Amazon Redshift. <br \/>\r<br>The ETL jobs need to handle failures and retries automatically. The data engineer needs to use Python to orchestrate the jobs. <br \/>\r<br>Which service will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_18' value='482206' \/><input type='hidden' id='answerType482206' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482206[]' id='answer-id-1862730' class='answer   answerof-482206 ' value='1862730'   \/><label for='answer-id-1862730' id='answer-label-1862730' class=' answer'><span>Amazon Managed Workflows for Apache Airflow (Amazon MWAA)<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482206[]' id='answer-id-1862731' class='answer   answerof-482206 ' value='1862731'   \/><label for='answer-id-1862731' id='answer-label-1862731' class=' answer'><span>AWS Step Functions<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482206[]' id='answer-id-1862732' class='answer   answerof-482206 ' value='1862732'   \/><label for='answer-id-1862732' id='answer-label-1862732' class=' answer'><span>AWS Glue<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482206[]' id='answer-id-1862733' class='answer   answerof-482206 ' value='1862733'   \/><label for='answer-id-1862733' id='answer-label-1862733' class=' answer'><span>Amazon EventBridge<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-19' style=';'><div id='questionWrap-19'  class='   watupro-question-id-482207'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>19. <\/span>A company needs to store semi-structured transactional data for an application in a database. The database must be serverless. The application writes the data infrequently, but it reads the data frequently. The application must retrieve the data within milliseconds. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_19' value='482207' \/><input type='hidden' id='answerType482207' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482207[]' id='answer-id-1862734' class='answer   answerof-482207 ' value='1862734'   \/><label for='answer-id-1862734' id='answer-label-1862734' class=' answer'><span>Store the data in an Amazon S3 Standard bucket. Enable S3 Transfer Acceleration.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482207[]' id='answer-id-1862735' class='answer   answerof-482207 ' value='1862735'   \/><label for='answer-id-1862735' id='answer-label-1862735' class=' answer'><span>Store the data in an Amazon S3 Apache Iceberg table. Enable S3 Transfer Acceleration.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482207[]' id='answer-id-1862736' class='answer   answerof-482207 ' value='1862736'   \/><label for='answer-id-1862736' id='answer-label-1862736' class=' answer'><span>Store the data in an Amazon RDS for MySQL cluster. Configure RDS Optimized Reads for the cluster.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482207[]' id='answer-id-1862737' class='answer   answerof-482207 ' value='1862737'   \/><label for='answer-id-1862737' id='answer-label-1862737' class=' answer'><span>Store the data in an Amazon DynamoDB table. Configure a DynamoDB Accelerator cache.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-20' style=';'><div id='questionWrap-20'  class='   watupro-question-id-482208'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>20. <\/span>A data engineer is configuring Amazon SageMaker Studio to use AWS Glue interactive sessions to prepare data for machine learning (ML) models. <br \/>\r<br>The data engineer receives an access denied error when the data engineer tries to prepare the data by using SageMaker Studio. <br \/>\r<br>Which change should the engineer make to gain access to SageMaker Studio?<\/div><input type='hidden' name='question_id[]' id='qID_20' value='482208' \/><input type='hidden' id='answerType482208' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482208[]' id='answer-id-1862738' class='answer   answerof-482208 ' value='1862738'   \/><label for='answer-id-1862738' id='answer-label-1862738' class=' answer'><span>Add the AWSGlueServiceRole managed policy to the data engineer's IAM user.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482208[]' id='answer-id-1862739' class='answer   answerof-482208 ' value='1862739'   \/><label for='answer-id-1862739' id='answer-label-1862739' class=' answer'><span>Add a policy to the data engineer's IAM user that includes the sts:AssumeRole action for the AWS Glue and SageMaker service principals in the trust policy.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482208[]' id='answer-id-1862740' class='answer   answerof-482208 ' value='1862740'   \/><label for='answer-id-1862740' id='answer-label-1862740' class=' answer'><span>Add the AmazonSageMakerFullAccess managed policy to the data engineer's IAM user.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482208[]' id='answer-id-1862741' class='answer   answerof-482208 ' value='1862741'   \/><label for='answer-id-1862741' id='answer-label-1862741' class=' answer'><span>Add a policy to the data engineer's IAM user that allows the sts:AddAssociation action for the AWS Glue and SageMaker service principals in the trust policy.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-21' style=';'><div id='questionWrap-21'  class='   watupro-question-id-482209'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>21. <\/span>A company that operates globally must follow regulations that require data from an AWS Region to be accessible only within that Region. <br \/>\r<br>A data engineer is creating a data pipeline that will create resources in the Region where the data engineer works. The data pipeline should have access to data only from the Region where the data engineer works. The pipeline uses Active Directory as an identity and authentication system. The pipeline uses a custom identity broker application to verify that employees are signed in to Active Directory and to obtain temporary credentials by using the AssumeRole API operation. <br \/>\r<br>Which solution will meet the locality requirements with the LEAST administrative effort?<\/div><input type='hidden' name='question_id[]' id='qID_21' value='482209' \/><input type='hidden' id='answerType482209' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482209[]' id='answer-id-1862742' class='answer   answerof-482209 ' value='1862742'   \/><label for='answer-id-1862742' id='answer-label-1862742' class=' answer'><span>Create an IAM role that has permissions to create resources. Create a policy for each Region that ensures users can create resources only in that Region. Pass the policy as the session policy when employees obtain the temporary credentials.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482209[]' id='answer-id-1862743' class='answer   answerof-482209 ' value='1862743'   \/><label for='answer-id-1862743' id='answer-label-1862743' class=' answer'><span>Create an IAM role for data engineers in each Region separately. Instruct each data engineer to obtain temporary credentials by assuming the appropriate Region-specific IAM role.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482209[]' id='answer-id-1862744' class='answer   answerof-482209 ' value='1862744'   \/><label for='answer-id-1862744' id='answer-label-1862744' class=' answer'><span>Create an IAM group for each Region. Include the required IAM policies for each IAM group. Add users to each IAM group so that when users log in by obtaining the temporary credentials, the users will receive the appropriate access based on the IAM group.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482209[]' id='answer-id-1862745' class='answer   answerof-482209 ' value='1862745'   \/><label for='answer-id-1862745' id='answer-label-1862745' class=' answer'><span>Create individual IAM policies that allow users to create resources in a specific Region. Assign the policies to each data engineer. Allow users to assume the individually assigned role when the users log in to AW<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-22' style=';'><div id='questionWrap-22'  class='   watupro-question-id-482210'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>22. <\/span>A company runs an AWS Glue workflow every day to process time series data from an Amazon S3 bucket. The workflow loads the data into an Amazon Redshift Serverless table. The company observes that some of the jobs in the workflow occasionally fail. <br \/>\r<br>A data engineer must receive a notification when the Redshift table does not contain the most recent data. <br \/>\r<br>Which solution will meet this requirement in the MOST operationally efficient way?<\/div><input type='hidden' name='question_id[]' id='qID_22' value='482210' \/><input type='hidden' id='answerType482210' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482210[]' id='answer-id-1862746' class='answer   answerof-482210 ' value='1862746'   \/><label for='answer-id-1862746' id='answer-label-1862746' class=' answer'><span>Configure an Amazon EventBridge Scheduler to run an Amazon Macie job to scan the Redshift table for data freshness. Configure Macie to notify an Amazon Simple Notification Service (Amazon SNS) topic when an AWS Glue job fails.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482210[]' id='answer-id-1862747' class='answer   answerof-482210 ' value='1862747'   \/><label for='answer-id-1862747' id='answer-label-1862747' class=' answer'><span>Schedule an AWS Glue Data Quality job to check the freshness of the data. Create an Amazon EventBridge rule to notify an Amazon Simple Notification Service (Amazon SNS) topic when a data quality rule fails.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482210[]' id='answer-id-1862748' class='answer   answerof-482210 ' value='1862748'   \/><label for='answer-id-1862748' id='answer-label-1862748' class=' answer'><span>Load AWS Glue job logs to an Amazon S3 bucket. Configure an Amazon CloudWatch alarm to send a notification when the job logs in the S3 bucket contain Job.State=FAILE<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482210[]' id='answer-id-1862749' class='answer   answerof-482210 ' value='1862749'   \/><label for='answer-id-1862749' id='answer-label-1862749' class=' answer'><span>Create an Amazon CloudWatch dashboard that displays a metric named Failed AWS Glue Jobs that counts AWS Glue job failures during the previous day. Set a CloudWatch alarm to send a notification when the metric value exceeds zero.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-23' style=';'><div id='questionWrap-23'  class='   watupro-question-id-482211'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>23. <\/span>A data engineer is designing a new data lake architecture for a company. The data engineer plans to use Apache Iceberg tables and AWS Glue Data Catalog to achieve fast query performance and enhanced metadata handling. The data engineer needs to query historical data for trend analysis and optimize storage costs for a large volume of event data. <br \/>\r<br>Which solution will meet these requirements with the LEAST development effort?<\/div><input type='hidden' name='question_id[]' id='qID_23' value='482211' \/><input type='hidden' id='answerType482211' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482211[]' id='answer-id-1862750' class='answer   answerof-482211 ' value='1862750'   \/><label for='answer-id-1862750' id='answer-label-1862750' class=' answer'><span>Store Iceberg table data files in Amazon S3 Intelligent-Tiering.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482211[]' id='answer-id-1862751' class='answer   answerof-482211 ' value='1862751'   \/><label for='answer-id-1862751' id='answer-label-1862751' class=' answer'><span>Define partitioning schemes based on event type and event date.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482211[]' id='answer-id-1862752' class='answer   answerof-482211 ' value='1862752'   \/><label for='answer-id-1862752' id='answer-label-1862752' class=' answer'><span>Use AWS Glue Data Catalog to automatically optimize Iceberg storage.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482211[]' id='answer-id-1862753' class='answer   answerof-482211 ' value='1862753'   \/><label for='answer-id-1862753' id='answer-label-1862753' class=' answer'><span>Run a custom AWS Glue job to compact Iceberg table data files.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-24' style=';'><div id='questionWrap-24'  class='   watupro-question-id-482212'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>24. <\/span>A car sales company maintains data about cars that are listed for sale in an area. The company receives data about new car listings from vendors who upload the data daily as compressed files into Amazon S3. The compressed files are up to 5 KB in size. The company wants to see the most up-to-date listings as soon as the data is uploaded to Amazon S3. <br \/>\r<br>A data engineer must automate and orchestrate the data processing workflow of the listings to feed a dashboard. The data engineer must also provide the ability to perform one-time queries and analytical reporting. The query solution must be scalable. <br \/>\r<br>Which solution will meet these requirements MOST cost-effectively?<\/div><input type='hidden' name='question_id[]' id='qID_24' value='482212' \/><input type='hidden' id='answerType482212' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482212[]' id='answer-id-1862754' class='answer   answerof-482212 ' value='1862754'   \/><label for='answer-id-1862754' id='answer-label-1862754' class=' answer'><span>Use an Amazon EMR cluster to process incoming data. Use AWS Step Functions to orchestrate workflows. Use Apache Hive for one-time queries and analytical reporting. Use Amazon OpenSearch Service to bulk ingest the data into compute optimized instances. Use OpenSearch Dashboards in OpenSearch Service for the dashboard.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482212[]' id='answer-id-1862755' class='answer   answerof-482212 ' value='1862755'   \/><label for='answer-id-1862755' id='answer-label-1862755' class=' answer'><span>Use a provisioned Amazon EMR cluster to process incoming data. Use AWS Step Functions to orchestrate workflows. Use Amazon Athena for one-time queries and analytical reporting. Use Amazon QuickSight for the dashboard.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482212[]' id='answer-id-1862756' class='answer   answerof-482212 ' value='1862756'   \/><label for='answer-id-1862756' id='answer-label-1862756' class=' answer'><span>Use AWS Glue to process incoming data. Use AWS Step Functions to orchestrate workflows. Use Amazon Redshift Spectrum for one-time queries and analytical reporting. Use OpenSearch Dashboards in Amazon OpenSearch Service for the dashboard.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482212[]' id='answer-id-1862757' class='answer   answerof-482212 ' value='1862757'   \/><label for='answer-id-1862757' id='answer-label-1862757' class=' answer'><span>Use AWS Glue to process incoming data. Use AWS Lambda and S3 Event Notifications to orchestrate workflows. Use Amazon Athena for one-time queries and analytical reporting. Use Amazon QuickSight for the dashboard.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-25' style=';'><div id='questionWrap-25'  class='   watupro-question-id-482213'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>25. <\/span>A data engineer needs to query data from multiple sources to generate an annual report. The analytics team uses Amazon Redshift for analysis. The data engineer needs to integrate Amazon Redshift data with 10 years of historical data from Amazon RDS for PostgreSQL and RDS for MySQL. All the databases are in the same VPC. The data engineer needs a solution that provides seamless data integration with Amazon Redshift. <br \/>\r<br>Which solution will meet these requirements in the MOST cost-effective way?<\/div><input type='hidden' name='question_id[]' id='qID_25' value='482213' \/><input type='hidden' id='answerType482213' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482213[]' id='answer-id-1862758' class='answer   answerof-482213 ' value='1862758'   \/><label for='answer-id-1862758' id='answer-label-1862758' class=' answer'><span>Use federated queries in Amazon Redshift to fetch data from RDS for PostgreSQL and RDS for MySQ<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482213[]' id='answer-id-1862759' class='answer   answerof-482213 ' value='1862759'   \/><label for='answer-id-1862759' id='answer-label-1862759' class=' answer'><span>Apply the necessary transformations within Amazon Redshift.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482213[]' id='answer-id-1862760' class='answer   answerof-482213 ' value='1862760'   \/><label for='answer-id-1862760' id='answer-label-1862760' class=' answer'><span>Use the SELECT INTO OUTFILE S3 statement to export data from Amazon RDS to Amazon S3. Use the COPY command to load the data into Amazon Redshift.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482213[]' id='answer-id-1862761' class='answer   answerof-482213 ' value='1862761'   \/><label for='answer-id-1862761' id='answer-label-1862761' class=' answer'><span>Create a visual extract, transform, and load (ETL) job in AWS Glue to extract the required data and load it to Amazon Redshift.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482213[]' id='answer-id-1862762' class='answer   answerof-482213 ' value='1862762'   \/><label for='answer-id-1862762' id='answer-label-1862762' class=' answer'><span>Use AWS Database Migration Service (AWS DMS) to ingest data from RDS for PostgreSQL and RDS for MySQ<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482213[]' id='answer-id-1862763' class='answer   answerof-482213 ' value='1862763'   \/><label for='answer-id-1862763' id='answer-label-1862763' class=' answer'><span>Implement the necessary transformations within Amazon Redshift.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-26' style=';'><div id='questionWrap-26'  class='   watupro-question-id-482214'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>26. <\/span>A data engineer needs to create an Amazon Athena table based on a subset of data from an existing Athena table named cities_world. The cities_world table contains cities that are located around the world. The data engineer must create a new table named cities_us to contain only the cities from cities_world that are located in the US. <br \/>\r<br><br><img decoding=\"async\" width=649 height=244 id=\"\u56fe\u7247 6\" src=\"https:\/\/www.dumpsbase.com\/freedumps\/wp-content\/uploads\/2026\/05\/image002-24.jpg\"><br><\/div><input type='hidden' name='question_id[]' id='qID_26' value='482214' \/><input type='hidden' id='answerType482214' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482214[]' id='answer-id-1862764' class='answer   answerof-482214 ' value='1862764'   \/><label for='answer-id-1862764' id='answer-label-1862764' class=' answer'><span>Option A<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482214[]' id='answer-id-1862765' class='answer   answerof-482214 ' value='1862765'   \/><label for='answer-id-1862765' id='answer-label-1862765' class=' answer'><span>Option B<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482214[]' id='answer-id-1862766' class='answer   answerof-482214 ' value='1862766'   \/><label for='answer-id-1862766' id='answer-label-1862766' class=' answer'><span>Option C<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482214[]' id='answer-id-1862767' class='answer   answerof-482214 ' value='1862767'   \/><label for='answer-id-1862767' id='answer-label-1862767' class=' answer'><span>Option D<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-27' style=';'><div id='questionWrap-27'  class='   watupro-question-id-482215'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>27. <\/span>A company has multiple applications that use datasets that are stored in an Amazon S3 bucket. The company has an ecommerce application that generates a dataset that contains personally identifiable information (PII). The company has an internal analytics application that does not require access to the PII. <br \/>\r<br>To comply with regulations, the company must not share PII unnecessarily. A data engineer needs to implement a solution that with redact PII dynamically, based on the needs of each application that accesses the dataset. <br \/>\r<br>Which solution will meet the requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_27' value='482215' \/><input type='hidden' id='answerType482215' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482215[]' id='answer-id-1862768' class='answer   answerof-482215 ' value='1862768'   \/><label for='answer-id-1862768' id='answer-label-1862768' class=' answer'><span>Create an S3 bucket policy to limit the access each application has. Create multiple copies of the dataset. Give each dataset copy the appropriate level of redaction for the needs of the application that accesses the copy.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482215[]' id='answer-id-1862769' class='answer   answerof-482215 ' value='1862769'   \/><label for='answer-id-1862769' id='answer-label-1862769' class=' answer'><span>Create an S3 Object Lambda endpoint. Use the S3 Object Lambda endpoint to read data from the S3 bucket. Implement redaction logic within an S3 Object Lambda function to dynamically redact PII based on the needs of each application that accesses the data.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482215[]' id='answer-id-1862770' class='answer   answerof-482215 ' value='1862770'   \/><label for='answer-id-1862770' id='answer-label-1862770' class=' answer'><span>Use AWS Glue to transform the data for each application. Create multiple copies of the dataset. Give each dataset copy the appropriate level of redaction for the needs of the application that accesses the copy.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482215[]' id='answer-id-1862771' class='answer   answerof-482215 ' value='1862771'   \/><label for='answer-id-1862771' id='answer-label-1862771' class=' answer'><span>Create an API Gateway endpoint that has custom authorizers. Use the API Gateway endpoint to read data from the S3 bucket. Initiate a REST API call to dynamically redact PII based on the needs of each application that accesses the data.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-28' style=';'><div id='questionWrap-28'  class='   watupro-question-id-482216'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>28. <\/span>A data engineer needs to make tabular data available in an Amazon S3Cbased data lake. Users must be able to query the data by using SQL queries in Amazon Redshift, Amazon Athena, and Amazon EMR. The data is updated daily. The data engineer must ensure that updates and deletions are reflected in the data lake. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_28' value='482216' \/><input type='hidden' id='answerType482216' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482216[]' id='answer-id-1862772' class='answer   answerof-482216 ' value='1862772'   \/><label for='answer-id-1862772' id='answer-label-1862772' class=' answer'><span>Store the data in S3 Standard. Configure Apache Hudi with merge-on-read in Amazon EM<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482216[]' id='answer-id-1862773' class='answer   answerof-482216 ' value='1862773'   \/><label for='answer-id-1862773' id='answer-label-1862773' class=' answer'><span>Use Apache Spark SQL in Amazon EMR to perform the daily updates and deletions. Use Amazon EMR to schedule compaction jobs. Use AWS Glue to create a data catalog of Hudi tables that are stored in Amazon S3.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482216[]' id='answer-id-1862774' class='answer   answerof-482216 ' value='1862774'   \/><label for='answer-id-1862774' id='answer-label-1862774' class=' answer'><span>Create S3 tables for the tabular data. Use AWS Glue and an S3 tables catalog for Apache Iceberg JAR to perform the daily updates and deletions. Configure a compaction size target. Set up snapshot management and unreferenced file removal for the S3 tables bucket.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482216[]' id='answer-id-1862775' class='answer   answerof-482216 ' value='1862775'   \/><label for='answer-id-1862775' id='answer-label-1862775' class=' answer'><span>Load the data into an Amazon Redshift cluster. Use SQL to perform the daily updates and deletions. Upload the data to an Amazon S3 bucket in Apache Parquet format to create the data lake.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482216[]' id='answer-id-1862776' class='answer   answerof-482216 ' value='1862776'   \/><label for='answer-id-1862776' id='answer-label-1862776' class=' answer'><span>Load the data into an Amazon EMR cluster. Use Apache Spark to perform the daily updates and deletions. Upload the data into an Amazon S3 bucket in Apache Parquet format to create the data \r\nlake.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-29' style=';'><div id='questionWrap-29'  class='   watupro-question-id-482217'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>29. <\/span>A data engineer uses AWS Lake Formation to manage access to data that is stored in an Amazon S3 bucket. The data engineer configures an AWS Glue crawler to discover data at a specific file location in the bucket, s3:\/\/examplepath. The crawler execution fails with the following error: <br \/>\r<br>&quot;The S3 location: s3:\/\/examplepath is not registered.&quot; <br \/>\r<br>The data engineer needs to resolve the error.<\/div><input type='hidden' name='question_id[]' id='qID_29' value='482217' \/><input type='hidden' id='answerType482217' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482217[]' id='answer-id-1862777' class='answer   answerof-482217 ' value='1862777'   \/><label for='answer-id-1862777' id='answer-label-1862777' class=' answer'><span>Attach an appropriate IAM policy to the IAM role of the AWS Glue crawler to grant the crawler permission to read the S3 location.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482217[]' id='answer-id-1862778' class='answer   answerof-482217 ' value='1862778'   \/><label for='answer-id-1862778' id='answer-label-1862778' class=' answer'><span>Register the S3 location in Lake Formation to allow the crawler to access the data.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482217[]' id='answer-id-1862779' class='answer   answerof-482217 ' value='1862779'   \/><label for='answer-id-1862779' id='answer-label-1862779' class=' answer'><span>Create a new AWS Glue database. Assign the correct permissions to the database for the crawler.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482217[]' id='answer-id-1862780' class='answer   answerof-482217 ' value='1862780'   \/><label for='answer-id-1862780' id='answer-label-1862780' class=' answer'><span>Configure the S3 bucket policy to allow cross-account access.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-30' style=';'><div id='questionWrap-30'  class='   watupro-question-id-482218'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>30. <\/span>A company maintains multiple extract, transform, and load (ETL) workflows that ingest data from the company's operational databases into an Amazon S3 based data lake. The ETL workflows use AWS Glue and Amazon EMR to process data. <br \/>\r<br>The company wants to improve the existing architecture to provide automated orchestration and to require minimal manual effort. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_30' value='482218' \/><input type='hidden' id='answerType482218' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482218[]' id='answer-id-1862781' class='answer   answerof-482218 ' value='1862781'   \/><label for='answer-id-1862781' id='answer-label-1862781' class=' answer'><span>AWS Glue workflows<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482218[]' id='answer-id-1862782' class='answer   answerof-482218 ' value='1862782'   \/><label for='answer-id-1862782' id='answer-label-1862782' class=' answer'><span>AWS Step Functions tasks<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482218[]' id='answer-id-1862783' class='answer   answerof-482218 ' value='1862783'   \/><label for='answer-id-1862783' id='answer-label-1862783' class=' answer'><span>AWS Lambda functions<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482218[]' id='answer-id-1862784' class='answer   answerof-482218 ' value='1862784'   \/><label for='answer-id-1862784' id='answer-label-1862784' class=' answer'><span>Amazon Managed Workflows for Apache Airflow (Amazon MWAA) workflows<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-31' style=';'><div id='questionWrap-31'  class='   watupro-question-id-482219'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>31. <\/span>A company wants to use Apache Spark jobs that run on an Amazon EMR cluster to process streaming data. The Spark jobs will transform and store the data in an Amazon S3 bucket. The company will use Amazon Athena to perform analysis. <br \/>\r<br>The company needs to optimize the data format for analytical queries. <br \/>\r<br>Which solutions will meet these requirements with the SHORTEST query times? (Select TWO.)<\/div><input type='hidden' name='question_id[]' id='qID_31' value='482219' \/><input type='hidden' id='answerType482219' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482219[]' id='answer-id-1862785' class='answer   answerof-482219 ' value='1862785'   \/><label for='answer-id-1862785' id='answer-label-1862785' class=' answer'><span>Use Avro format. Use AWS Glue Data Catalog to track schema changes.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482219[]' id='answer-id-1862786' class='answer   answerof-482219 ' value='1862786'   \/><label for='answer-id-1862786' id='answer-label-1862786' class=' answer'><span>Use ORC format. Use AWS Glue Data Catalog to track schema changes.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482219[]' id='answer-id-1862787' class='answer   answerof-482219 ' value='1862787'   \/><label for='answer-id-1862787' id='answer-label-1862787' class=' answer'><span>Use Apache Parquet format. Use an external Amazon DynamoDB table to track schema changes.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482219[]' id='answer-id-1862788' class='answer   answerof-482219 ' value='1862788'   \/><label for='answer-id-1862788' id='answer-label-1862788' class=' answer'><span>Use Apache Parquet format. Use AWS Glue Data Catalog to track schema changes.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482219[]' id='answer-id-1862789' class='answer   answerof-482219 ' value='1862789'   \/><label for='answer-id-1862789' id='answer-label-1862789' class=' answer'><span>Use ORC format. Store schema definitions in separate files in Amazon S3.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-32' style=';'><div id='questionWrap-32'  class='   watupro-question-id-482220'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>32. <\/span>A company maintains a data warehouse in an on-premises Oracle database. The company wants to build a data lake on AWS. The company wants to load data warehouse tables into Amazon S3 and synchronize the tables with incremental data that arrives from the data warehouse every day. <br \/>\r<br>Each table has a column that contains monotonically increasing values. The size of each table is less than 50 GB. The data warehouse tables are refreshed every night between 1 AM and 2 AM. A business intelligence team queries the tables between 10 AM and 8 PM every day. <br \/>\r<br>Which solution will meet these requirements in the MOST operationally efficient way?<\/div><input type='hidden' name='question_id[]' id='qID_32' value='482220' \/><input type='hidden' id='answerType482220' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482220[]' id='answer-id-1862790' class='answer   answerof-482220 ' value='1862790'   \/><label for='answer-id-1862790' id='answer-label-1862790' class=' answer'><span>Use an AWS Database Migration Service (AWS DMS) full load plus CDC job to load tables that contain monotonically increasing data columns from the on-premises data warehouse to Amazon S3. Use custom logic in AWS Glue to append the daily incremental data to a full-load copy that is in Amazon S3.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482220[]' id='answer-id-1862791' class='answer   answerof-482220 ' value='1862791'   \/><label for='answer-id-1862791' id='answer-label-1862791' class=' answer'><span>Use an AWS Glue Java Database Connectivity (JDBC) connection. Configure a job bookmark for a column that contains monotonically increasing values. Write custom logic to append the daily incremental data to a full-load copy that is in Amazon S3.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482220[]' id='answer-id-1862792' class='answer   answerof-482220 ' value='1862792'   \/><label for='answer-id-1862792' id='answer-label-1862792' class=' answer'><span>Use an AWS Database Migration Service (AWS DMS) full load migration to load the data warehouse tables into Amazon S3 every day Overwrite the previous day's full-load copy every day.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482220[]' id='answer-id-1862793' class='answer   answerof-482220 ' value='1862793'   \/><label for='answer-id-1862793' id='answer-label-1862793' class=' answer'><span>Use AWS Glue to load a full copy of the data warehouse tables into Amazon S3 every day. \r\nOverwrite the previous day's full-load copy every day.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-33' style=';'><div id='questionWrap-33'  class='   watupro-question-id-482221'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>33. <\/span>A company uses an Amazon QuickSight dashboard to monitor usage of one of the company's applications. The company uses AWS Glue jobs to process data for the dashboard. The company stores the data in a single Amazon S3 bucket. The company adds new data every day. <br \/>\r<br>A data engineer discovers that dashboard queries are becoming slower over time. The data engineer determines that the root cause of the slowing queries is long-running AWS Glue jobs. <br \/>\r<br>Which actions should the data engineer take to improve the performance of the AWS Glue jobs? (Choose two.)<\/div><input type='hidden' name='question_id[]' id='qID_33' value='482221' \/><input type='hidden' id='answerType482221' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482221[]' id='answer-id-1862794' class='answer   answerof-482221 ' value='1862794'   \/><label for='answer-id-1862794' id='answer-label-1862794' class=' answer'><span>Partition the data that is in the S3 bucket. Organize the data by year, month, and day.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482221[]' id='answer-id-1862795' class='answer   answerof-482221 ' value='1862795'   \/><label for='answer-id-1862795' id='answer-label-1862795' class=' answer'><span>Increase the AWS Glue instance size by scaling up the worker type.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482221[]' id='answer-id-1862796' class='answer   answerof-482221 ' value='1862796'   \/><label for='answer-id-1862796' id='answer-label-1862796' class=' answer'><span>Convert the AWS Glue schema to the DynamicFrame schema class.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482221[]' id='answer-id-1862797' class='answer   answerof-482221 ' value='1862797'   \/><label for='answer-id-1862797' id='answer-label-1862797' class=' answer'><span>Adjust AWS Glue job scheduling frequency so the jobs run half as many times each day.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-482221[]' id='answer-id-1862798' class='answer   answerof-482221 ' value='1862798'   \/><label for='answer-id-1862798' id='answer-label-1862798' class=' answer'><span>Modify the 1AM role that grants access to AWS glue to grant access to all S3 features.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-34' style=';'><div id='questionWrap-34'  class='   watupro-question-id-482222'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>34. <\/span>A company stores a large dataset in an Amazon S3 bucket. A data engineer frequently runs complex queries on the dataset by using Amazon Athena. The data engineer needs to optimize query performance and optimize costs for queries that are run multiple times with the same parameters. <br \/>\r<br>Which solution will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_34' value='482222' \/><input type='hidden' id='answerType482222' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482222[]' id='answer-id-1862799' class='answer   answerof-482222 ' value='1862799'   \/><label for='answer-id-1862799' id='answer-label-1862799' class=' answer'><span>Convert the dataset to JSON format before running Athena queries.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482222[]' id='answer-id-1862800' class='answer   answerof-482222 ' value='1862800'   \/><label for='answer-id-1862800' id='answer-label-1862800' class=' answer'><span>Use Amazon EMR to pre-process the data before running Athena queries.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482222[]' id='answer-id-1862801' class='answer   answerof-482222 ' value='1862801'   \/><label for='answer-id-1862801' id='answer-label-1862801' class=' answer'><span>Configure query result reuse settings in the Athena workgroup.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482222[]' id='answer-id-1862802' class='answer   answerof-482222 ' value='1862802'   \/><label for='answer-id-1862802' id='answer-label-1862802' class=' answer'><span>Use Amazon Redshift Spectrum to query the data in Amazon S3.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-35' style=';'><div id='questionWrap-35'  class='   watupro-question-id-482223'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>35. <\/span>A data engineer must manage the ingestion of real-time streaming data into AWS. The data engineer wants to perform real-time analytics on the incoming streaming data by using time-based aggregations over a window of up to 30 minutes. The data engineer needs a solution that is highly fault tolerant. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_35' value='482223' \/><input type='hidden' id='answerType482223' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482223[]' id='answer-id-1862803' class='answer   answerof-482223 ' value='1862803'   \/><label for='answer-id-1862803' id='answer-label-1862803' class=' answer'><span>Use an AWS Lambda function that includes both the business and the analytics logic to perform time-based aggregations over a window of up to 30 minutes for the data in Amazon Kinesis Data Streams.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482223[]' id='answer-id-1862804' class='answer   answerof-482223 ' value='1862804'   \/><label for='answer-id-1862804' id='answer-label-1862804' class=' answer'><span>Use Amazon Managed Service for Apache Flink (previously known as Amazon Kinesis Data Analytics) to analyze the data that might occasionally contain duplicates by using multiple types of aggregations.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482223[]' id='answer-id-1862805' class='answer   answerof-482223 ' value='1862805'   \/><label for='answer-id-1862805' id='answer-label-1862805' class=' answer'><span>Use an AWS Lambda function that includes both the business and the analytics logic to perform aggregations for a tumbling window of up to 30 minutes, based on the event timestamp.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482223[]' id='answer-id-1862806' class='answer   answerof-482223 ' value='1862806'   \/><label for='answer-id-1862806' id='answer-label-1862806' class=' answer'><span>Use Amazon Managed Service for Apache Flink (previously known as Amazon Kinesis Data Analytics) to analyze the data by using multiple types of aggregations to perform time-based analytics over a window of up to 30 minutes.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-36' style=';'><div id='questionWrap-36'  class='   watupro-question-id-482224'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>36. <\/span>A company uses a variety of AWS and third-party data stores. The company wants to consolidate all the data into a central data warehouse to perform analytics. Users need fast response times for analytics queries. <br \/>\r<br>The company uses Amazon QuickSight in direct query mode to visualize the data. Users normally run queries during a few hours each day with unpredictable spikes. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_36' value='482224' \/><input type='hidden' id='answerType482224' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482224[]' id='answer-id-1862807' class='answer   answerof-482224 ' value='1862807'   \/><label for='answer-id-1862807' id='answer-label-1862807' class=' answer'><span>Use Amazon Redshift Serverless to load all the data into Amazon Redshift managed storage (RMS).<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482224[]' id='answer-id-1862808' class='answer   answerof-482224 ' value='1862808'   \/><label for='answer-id-1862808' id='answer-label-1862808' class=' answer'><span>Use Amazon Athena to load all the data into Amazon S3 in Apache Parquet format.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482224[]' id='answer-id-1862809' class='answer   answerof-482224 ' value='1862809'   \/><label for='answer-id-1862809' id='answer-label-1862809' class=' answer'><span>Use Amazon Redshift provisioned clusters to load all the data into Amazon Redshift managed storage (RMS).<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482224[]' id='answer-id-1862810' class='answer   answerof-482224 ' value='1862810'   \/><label for='answer-id-1862810' id='answer-label-1862810' class=' answer'><span>Use Amazon Aurora PostgreSQL to load all the data into Aurora.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-37' style=';'><div id='questionWrap-37'  class='   watupro-question-id-482225'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>37. <\/span>A data engineer needs Amazon Athena queries to finish faster. The data engineer notices that all the files the Athena queries use are currently stored in uncompressed .csv format. The data engineer also notices that users perform most queries by selecting a specific column. <br \/>\r<br>Which solution will MOST speed up the Athena query performance?<\/div><input type='hidden' name='question_id[]' id='qID_37' value='482225' \/><input type='hidden' id='answerType482225' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482225[]' id='answer-id-1862811' class='answer   answerof-482225 ' value='1862811'   \/><label for='answer-id-1862811' id='answer-label-1862811' class=' answer'><span>Change the data format from .csvto JSON format. Apply Snappy compression.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482225[]' id='answer-id-1862812' class='answer   answerof-482225 ' value='1862812'   \/><label for='answer-id-1862812' id='answer-label-1862812' class=' answer'><span>Compress the .csv files by using Snappy compression.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482225[]' id='answer-id-1862813' class='answer   answerof-482225 ' value='1862813'   \/><label for='answer-id-1862813' id='answer-label-1862813' class=' answer'><span>Change the data format from .csvto Apache Parquet. Apply Snappy compression.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482225[]' id='answer-id-1862814' class='answer   answerof-482225 ' value='1862814'   \/><label for='answer-id-1862814' id='answer-label-1862814' class=' answer'><span>Compress the .csv files by using gzjg compression.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-38' style=';'><div id='questionWrap-38'  class='   watupro-question-id-482226'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>38. <\/span>A hotel management company receives daily data files from each of its hotels. The company wants to upload its data to AWS. The company plans to use Amazon Athena to access the files. The company needs to protect the files from accidental deletion. The company will develop an application on its on-premises servers to automatically forward the files to a fully managed AWS ingestion service. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_38' value='482226' \/><input type='hidden' id='answerType482226' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482226[]' id='answer-id-1862815' class='answer   answerof-482226 ' value='1862815'   \/><label for='answer-id-1862815' id='answer-label-1862815' class=' answer'><span>Use AWS DataSync to replicate data from the on-premises servers to Amazon Elastic File System (Amazon EFS). Configure automatic backups in AWS Backup.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482226[]' id='answer-id-1862816' class='answer   answerof-482226 ' value='1862816'   \/><label for='answer-id-1862816' id='answer-label-1862816' class=' answer'><span>Use the Amazon Kinesis Agent on the on-premises servers to send data to Amazon Data Firehose. \r\nStore the data in an Amazon S3 bucket that has versioning enabled.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482226[]' id='answer-id-1862817' class='answer   answerof-482226 ' value='1862817'   \/><label for='answer-id-1862817' id='answer-label-1862817' class=' answer'><span>Use AWS Glue jobs to ingest data from the on-premises servers into Amazon RD<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482226[]' id='answer-id-1862818' class='answer   answerof-482226 ' value='1862818'   \/><label for='answer-id-1862818' id='answer-label-1862818' class=' answer'><span>Enable automated backups for data protection.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482226[]' id='answer-id-1862819' class='answer   answerof-482226 ' value='1862819'   \/><label for='answer-id-1862819' id='answer-label-1862819' class=' answer'><span>Use a self-managed Apache Kafka agent on the on-premises servers to stream data to Amazon Managed Streaming for Apache Kafka (Amazon MSK). Store the data in an Amazon S3 bucket with versioning enabled.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-39' style=';'><div id='questionWrap-39'  class='   watupro-question-id-482227'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>39. <\/span>A company has used an Amazon Redshift table that is named Orders for 6 months. The company performs weekly updates and deletes on the table. The table has an interleaved sort key on a column that contains AWS Regions.<br \/>\r\n<br \/>\r\nThe company wants to reclaim disk space so that the company will not run out of storage space. The company also wants to analyze the sort key column.<br \/>\r\n<br \/>\r\nWhich Amazon Redshift command will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_39' value='482227' \/><input type='hidden' id='answerType482227' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482227[]' id='answer-id-1862820' class='answer   answerof-482227 ' value='1862820'   \/><label for='answer-id-1862820' id='answer-label-1862820' class=' answer'><span>VACUUM FULL Orders<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482227[]' id='answer-id-1881644' class='answer   answerof-482227 ' value='1881644'   \/><label for='answer-id-1881644' id='answer-label-1881644' class=' answer'><span>VACUUM DELETE ONLY Orders<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482227[]' id='answer-id-1881645' class='answer   answerof-482227 ' value='1881645'   \/><label for='answer-id-1881645' id='answer-label-1881645' class=' answer'><span>VACUUM REINDEX Orders<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482227[]' id='answer-id-1881646' class='answer   answerof-482227 ' value='1881646'   \/><label for='answer-id-1881646' id='answer-label-1881646' class=' answer'><span>VACUUM SORT ONLY Orders<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-40' style=';'><div id='questionWrap-40'  class='   watupro-question-id-482228'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>40. <\/span>A company wants to ingest streaming data into an Amazon Redshift data warehouse from an Amazon Managed Streaming for Apache Kafka (Amazon MSK) cluster. A data engineer needs to develop a solution that provides low data access time and that optimizes storage costs. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_40' value='482228' \/><input type='hidden' id='answerType482228' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482228[]' id='answer-id-1862821' class='answer   answerof-482228 ' value='1862821'   \/><label for='answer-id-1862821' id='answer-label-1862821' class=' answer'><span>Create an external schema that maps to the MSK cluster. Create a materialized view that references the external schema to consume the streaming data from the MSK topic.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482228[]' id='answer-id-1862822' class='answer   answerof-482228 ' value='1862822'   \/><label for='answer-id-1862822' id='answer-label-1862822' class=' answer'><span>Develop an AWS Glue streaming extract, transform, and load (ETL) job to process the incoming data from Amazon MS<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482228[]' id='answer-id-1862823' class='answer   answerof-482228 ' value='1862823'   \/><label for='answer-id-1862823' id='answer-label-1862823' class=' answer'><span>Load the data into Amazon S3. Use Amazon Redshift Spectrum to read the data from Amazon S3.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482228[]' id='answer-id-1862824' class='answer   answerof-482228 ' value='1862824'   \/><label for='answer-id-1862824' id='answer-label-1862824' class=' answer'><span>Create an external schema that maps to the streaming data source. Create a new Amazon Redshift table that references the external schema.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482228[]' id='answer-id-1862825' class='answer   answerof-482228 ' value='1862825'   \/><label for='answer-id-1862825' id='answer-label-1862825' class=' answer'><span>Create an Amazon S3 bucket. Ingest the data from Amazon MS<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-482228[]' id='answer-id-1862826' class='answer   answerof-482228 ' value='1862826'   \/><label for='answer-id-1862826' id='answer-label-1862826' class=' answer'><span>Create an event-driven AWS Lambda function to load the data from the S3 bucket to a new Amazon Redshift table.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div style='display:none' id='question-41'>\n\t<div class='question-content'>\n\t\t<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.dumpsbase.com\/freedumps\/wp-content\/plugins\/watupro\/img\/loading.gif\" width=\"16\" height=\"16\" alt=\"Loading...\" title=\"Loading...\" \/>&nbsp;Loading...\t<\/div>\n<\/div>\n\n<br \/>\n\t\n\t\t\t<div class=\"watupro_buttons flex \" id=\"watuPROButtons12348\" >\n\t\t  <div id=\"prev-question\" style=\"display:none;\"><input type=\"button\" value=\"&lt; Previous\" onclick=\"WatuPRO.nextQuestion(event, 'previous');\"\/><\/div>\t\t  \t\t  \t\t   \n\t\t   \t  \t\t<div><input type=\"button\" name=\"action\" class=\"watupro-submit-button\" onclick=\"WatuPRO.submitResult(event)\" id=\"action-button\" value=\"View Results\"  \/>\n\t\t<\/div>\n\t\t<\/div>\n\t\t\n\t<input type=\"hidden\" name=\"quiz_id\" value=\"12348\" id=\"watuPROExamID\"\/>\n\t<input type=\"hidden\" name=\"start_time\" id=\"startTime\" value=\"2026-06-08 11:15:24\" \/>\n\t<input type=\"hidden\" name=\"start_timestamp\" id=\"startTimeStamp\" value=\"1780917324\" \/>\n\t<input type=\"hidden\" name=\"question_ids\" value=\"\" \/>\n\t<input type=\"hidden\" name=\"watupro_questions\" value=\"482189:1862659,1862660,1862661,1862662 | 482190:1862663,1862664,1862665,1862666,1862667 | 482191:1862668,1862669,1862670,1862671 | 482192:1862672,1862673,1862674,1862675 | 482193:1862676,1862677,1862678,1862679 | 482194:1862680,1862681,1862682,1862683 | 482195:1862684,1862685,1862686,1862687 | 482196:1862688,1862689,1862690,1862691 | 482197:1862692,1862693,1862694,1862695,1862696 | 482198:1862697,1862698,1862699,1862700 | 482199:1862701,1862702,1862703,1862704 | 482200:1862705,1862706,1862707,1862708,1862709 | 482201:1862710,1862711,1862712,1862713 | 482202:1862714,1862715,1862716,1862717 | 482203:1862718,1862719,1862720,1862721 | 482204:1862722,1862723,1862724,1862725 | 482205:1862726,1862727,1862728,1862729 | 482206:1862730,1862731,1862732,1862733 | 482207:1862734,1862735,1862736,1862737 | 482208:1862738,1862739,1862740,1862741 | 482209:1862742,1862743,1862744,1862745 | 482210:1862746,1862747,1862748,1862749 | 482211:1862750,1862751,1862752,1862753 | 482212:1862754,1862755,1862756,1862757 | 482213:1862758,1862759,1862760,1862761,1862762,1862763 | 482214:1862764,1862765,1862766,1862767 | 482215:1862768,1862769,1862770,1862771 | 482216:1862772,1862773,1862774,1862775,1862776 | 482217:1862777,1862778,1862779,1862780 | 482218:1862781,1862782,1862783,1862784 | 482219:1862785,1862786,1862787,1862788,1862789 | 482220:1862790,1862791,1862792,1862793 | 482221:1862794,1862795,1862796,1862797,1862798 | 482222:1862799,1862800,1862801,1862802 | 482223:1862803,1862804,1862805,1862806 | 482224:1862807,1862808,1862809,1862810 | 482225:1862811,1862812,1862813,1862814 | 482226:1862815,1862816,1862817,1862818,1862819 | 482227:1862820,1881644,1881645,1881646 | 482228:1862821,1862822,1862823,1862824,1862825,1862826\" \/>\n\t<input type=\"hidden\" name=\"no_ajax\" value=\"0\">\t\t\t<\/form>\n\t<p>&nbsp;<\/p>\n<\/div>\n\n<script type=\"text\/javascript\">\n\/\/jQuery(document).ready(function(){\ndocument.addEventListener(\"DOMContentLoaded\", function(event) { \t\nvar question_ids = \"482189,482190,482191,482192,482193,482194,482195,482196,482197,482198,482199,482200,482201,482202,482203,482204,482205,482206,482207,482208,482209,482210,482211,482212,482213,482214,482215,482216,482217,482218,482219,482220,482221,482222,482223,482224,482225,482226,482227,482228\";\nWatuPROSettings[12348] = {};\nWatuPRO.qArr = question_ids.split(',');\nWatuPRO.exam_id = 12348;\t    \nWatuPRO.post_id = 128173;\nWatuPRO.store_progress = 0;\nWatuPRO.curCatPage = 1;\nWatuPRO.requiredIDs=\"0\".split(\",\");\nWatuPRO.hAppID = \"0.47464700 1780917324\";\nvar url = \"https:\/\/www.dumpsbase.com\/freedumps\/wp-content\/plugins\/watupro\/show_exam.php\";\nWatuPRO.examMode = 1;\nWatuPRO.siteURL=\"https:\/\/www.dumpsbase.com\/freedumps\/wp-admin\/admin-ajax.php\";\nWatuPRO.emailIsNotRequired = 0;\nWatuPROIntel.init(12348);\nWatuPRO.inCategoryPages=1;});    \t \n<\/script>\n","protected":false},"excerpt":{"rendered":"<p>DumpsBase address your challenge of AWS Certified Data Engineer &#8211; Associate (DEA-C01) exam preparation by offering the most updated DEA-C01 dumps. We have updated the dumps to V13.02, which are designed to reflect modern question formats, helping you focus only on relevant topics instead of wasting time on outdated content. We have 294 practice questions [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[175,18249],"tags":[18538,21317],"class_list":["post-128173","post","type-post","status-publish","format-standard","hentry","category-amazon","category-data-engineer-associate","tag-aws-certified-data-engineer-associate-dea-c01","tag-dea-c01-exam-questions"],"_links":{"self":[{"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/posts\/128173","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/comments?post=128173"}],"version-history":[{"count":2,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/posts\/128173\/revisions"}],"predecessor-version":[{"id":128175,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/posts\/128173\/revisions\/128175"}],"wp:attachment":[{"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/media?parent=128173"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/categories?post=128173"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/tags?post=128173"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}