{"id":96045,"date":"2025-01-16T02:12:17","date_gmt":"2025-01-16T02:12:17","guid":{"rendered":"https:\/\/www.dumpsbase.com\/freedumps\/?p=96045"},"modified":"2025-01-16T02:42:52","modified_gmt":"2025-01-16T02:42:52","slug":"aws-certified-data-engineer-associate-dea-c01-dumps-v8-02-the-latest-study-materials-for-your-exam-preparation","status":"publish","type":"post","link":"https:\/\/www.dumpsbase.com\/freedumps\/aws-certified-data-engineer-associate-dea-c01-dumps-v8-02-the-latest-study-materials-for-your-exam-preparation.html","title":{"rendered":"AWS Certified Data Engineer &#8211; Associate DEA-C01 Dumps (V8.02) &#8211; The Latest Study Materials for Your Exam Preparation"},"content":{"rendered":"<p>The AWS Certified Data Engineer &#8211; Associate certification demonstrates your ability to design data models, manage data lifecycles, and ensure data quality. This valuable Amazon Web Services (AWS) certification validates your expertise in core data-related AWS services, including data ingestion, transformation, and pipeline orchestration with programming concepts. Acquiring the latest study materials is a great way to make preparations. DumpsBase offers the DEA-C01 dumps (V8.02) to help you study the AWS Certified Data Engineer &#8211; Associate exam. The <a href=\"https:\/\/www.dumpsbase.com\/amazon.html\"><em><strong>Amazon<\/strong><\/em><\/a> DEA-C01 exam dumps from DumpsBase are a decisive decision toward achieving authentic success. This asset is a powerful tool that promises a 100% success rate, instilling belief in candidates as they navigate through their AWS Certified Data Engineer &#8211; Associate certification preparation. The AWS Certified Data Engineer &#8211; Associate DEA-C01 dumps offered by DumpsBase stand out as a valuable source of study material, expertly crafted to equip you with the expertise and skills necessary to overcome the challenges posed by the AWS Certified Data Engineer &#8211; Associate certification exam.<\/p>\n<h2>Below are the Amazon Web Services (AWS) <em><span style=\"background-color: #00ffff;\">DEA-C01 free dumps<\/span><\/em> for checking:<\/h2>\n<script>\n\t  window.fbAsyncInit = function() {\n\t    FB.init({\n\t      appId            : '622169541470367',\n\t      autoLogAppEvents : true,\n\t      xfbml            : true,\n\t      version          : 'v3.1'\n\t    });\n\t  };\n\t\n\t  (function(d, s, id){\n\t     var js, fjs = d.getElementsByTagName(s)[0];\n\t     if (d.getElementById(id)) {return;}\n\t     js = d.createElement(s); js.id = id;\n\t     js.src = \"https:\/\/connect.facebook.net\/en_US\/sdk.js\";\n\t     fjs.parentNode.insertBefore(js, fjs);\n\t   }(document, 'script', 'facebook-jssdk'));\n\t<\/script><script type=\"text\/javascript\" >\ndocument.addEventListener(\"DOMContentLoaded\", function(event) { \nif(!window.jQuery) alert(\"The important jQuery library is not properly loaded in your site. Your WordPress theme is probably missing the essential wp_head() call. You can switch to another theme and you will see that the plugin works fine and this notice disappears. If you are still not sure what to do you can contact us for help.\");\n});\n<\/script>  \n  \n<div  id=\"watupro_quiz\" class=\"quiz-area single-page-quiz\">\n<p id=\"submittingExam9379\" style=\"display:none;text-align:center;\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.dumpsbase.com\/freedumps\/wp-content\/plugins\/watupro\/img\/loading.gif\" width=\"16\" height=\"16\"><\/p>\n\n<div class=\"watupro-exam-description\" id=\"description-quiz-9379\"><\/div>\n\n<form action=\"\" method=\"post\" class=\"quiz-form\" id=\"quiz-9379\"  enctype=\"multipart\/form-data\" >\n<div class='watu-question ' id='question-1' style=';'><div id='questionWrap-1'  class='   watupro-question-id-374669'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>1. <\/span>A data engineer is configuring an AWS Glue job to read data from an Amazon S3 bucket. The data engineer has set up the necessary AWS Glue connection details and an associated IAM role. However, when the data engineer attempts to run the AWS Glue job, the data engineer receives an error message that indicates that there are problems with the Amazon S3 VPC gateway endpoint. The data engineer must resolve the error and connect the AWS Glue job to the S3 bucket. <br \/>\r<br>Which solution will meet this requirement?<\/div><input type='hidden' name='question_id[]' id='qID_1' value='374669' \/><input type='hidden' id='answerType374669' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374669[]' id='answer-id-1458257' class='answer   answerof-374669 ' value='1458257'   \/><label for='answer-id-1458257' id='answer-label-1458257' class=' answer'><span>Update the AWS Glue security group to allow inbound traffic from the Amazon S3 VPC gateway endpoint.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374669[]' id='answer-id-1458258' class='answer   answerof-374669 ' value='1458258'   \/><label for='answer-id-1458258' id='answer-label-1458258' class=' answer'><span>Configure an S3 bucket policy to explicitly grant the AWS Glue job permissions to access the S3 bucket.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374669[]' id='answer-id-1458259' class='answer   answerof-374669 ' value='1458259'   \/><label for='answer-id-1458259' id='answer-label-1458259' class=' answer'><span>Review the AWS Glue job code to ensure that the AWS Glue connection details include a fully qualified domain name.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374669[]' id='answer-id-1458260' class='answer   answerof-374669 ' value='1458260'   \/><label for='answer-id-1458260' id='answer-label-1458260' class=' answer'><span>Verify that the VPC's route table includes inbound and outbound routes for the Amazon S3 VPC gateway endpoint.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-2' style=';'><div id='questionWrap-2'  class='   watupro-question-id-374670'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>2. <\/span>A retail company has a customer data hub in an Amazon S3 bucket. Employees from many countries use the data hub to support company-wide analytics. A governance team must ensure that the <br \/>\r<br>company's data analysts can access data only for customers who are within the same country as the analysts. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational effort?<\/div><input type='hidden' name='question_id[]' id='qID_2' value='374670' \/><input type='hidden' id='answerType374670' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374670[]' id='answer-id-1458261' class='answer   answerof-374670 ' value='1458261'   \/><label for='answer-id-1458261' id='answer-label-1458261' class=' answer'><span>Create a separate table for each country's customer data. Provide access to each analyst based on the country that the analyst serves.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374670[]' id='answer-id-1458262' class='answer   answerof-374670 ' value='1458262'   \/><label for='answer-id-1458262' id='answer-label-1458262' class=' answer'><span>Register the S3 bucket as a data lake location in AWS Lake Formation. Use the Lake Formation row-level security features to enforce the company's access policies.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374670[]' id='answer-id-1458263' class='answer   answerof-374670 ' value='1458263'   \/><label for='answer-id-1458263' id='answer-label-1458263' class=' answer'><span>Move the data to AWS Regions that are close to the countries where the customers are. Provide access to each analyst based on the country that the analyst serves.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374670[]' id='answer-id-1458264' class='answer   answerof-374670 ' value='1458264'   \/><label for='answer-id-1458264' id='answer-label-1458264' class=' answer'><span>Load the data into Amazon Redshift. Create a view for each country. Create separate 1AM roles for each country to provide access to data from each country. Assign the appropriate roles to the analysts.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-3' style=';'><div id='questionWrap-3'  class='   watupro-question-id-374671'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>3. <\/span>A media company wants to improve a system that recommends media content to customer based on user behavior and preferences. To improve the recommendation system, the company needs to incorporate insights from third-party datasets into the company's existing analytics platform. <br \/>\r<br>The company wants to minimize the effort and time required to incorporate third-party datasets. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_3' value='374671' \/><input type='hidden' id='answerType374671' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374671[]' id='answer-id-1458265' class='answer   answerof-374671 ' value='1458265'   \/><label for='answer-id-1458265' id='answer-label-1458265' class=' answer'><span>Use API calls to access and integrate third-party datasets from AWS Data Exchange.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374671[]' id='answer-id-1458266' class='answer   answerof-374671 ' value='1458266'   \/><label for='answer-id-1458266' id='answer-label-1458266' class=' answer'><span>Use API calls to access and integrate third-party datasets from AWS<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374671[]' id='answer-id-1458267' class='answer   answerof-374671 ' value='1458267'   \/><label for='answer-id-1458267' id='answer-label-1458267' class=' answer'><span>Use Amazon Kinesis Data Streams to access and integrate third-party datasets from AWS CodeCommit repositories.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374671[]' id='answer-id-1458268' class='answer   answerof-374671 ' value='1458268'   \/><label for='answer-id-1458268' id='answer-label-1458268' class=' answer'><span>Use Amazon Kinesis Data Streams to access and integrate third-party datasets from Amazon Elastic Container Registry (Amazon ECR).<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-4' style=';'><div id='questionWrap-4'  class='   watupro-question-id-374672'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>4. <\/span>A financial company wants to implement a data mesh. The data mesh must support centralized data governance, data analysis, and data access control. The company has decided to use AWS Glue for data catalogs and extract, transform, and load (ETL) operations. <br \/>\r<br>Which combination of AWS services will implement a data mesh? (Choose two.)<\/div><input type='hidden' name='question_id[]' id='qID_4' value='374672' \/><input type='hidden' id='answerType374672' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374672[]' id='answer-id-1458269' class='answer   answerof-374672 ' value='1458269'   \/><label for='answer-id-1458269' id='answer-label-1458269' class=' answer'><span>Use Amazon Aurora for data storage. Use an Amazon Redshift provisioned cluster for data analysis.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374672[]' id='answer-id-1458270' class='answer   answerof-374672 ' value='1458270'   \/><label for='answer-id-1458270' id='answer-label-1458270' class=' answer'><span>Use Amazon S3 for data storage. Use Amazon Athena for data analysis.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374672[]' id='answer-id-1458271' class='answer   answerof-374672 ' value='1458271'   \/><label for='answer-id-1458271' id='answer-label-1458271' class=' answer'><span>Use AWS Glue DataBrewfor centralized data governance and access control.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374672[]' id='answer-id-1458272' class='answer   answerof-374672 ' value='1458272'   \/><label for='answer-id-1458272' id='answer-label-1458272' class=' answer'><span>Use Amazon RDS for data storage. Use Amazon EMR for data analysis.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374672[]' id='answer-id-1458273' class='answer   answerof-374672 ' value='1458273'   \/><label for='answer-id-1458273' id='answer-label-1458273' class=' answer'><span>Use AWS Lake Formation for centralized data governance and access control.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-5' style=';'><div id='questionWrap-5'  class='   watupro-question-id-374673'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>5. <\/span>A data engineer maintains custom Python scripts that perform a data formatting process that many AWS Lambda functions use. When the data engineer needs to modify the Python scripts, the data engineer must manually update all the Lambda functions. <br \/>\r<br>The data engineer requires a less manual way to update the Lambda functions. <br \/>\r<br>Which solution will meet this requirement?<\/div><input type='hidden' name='question_id[]' id='qID_5' value='374673' \/><input type='hidden' id='answerType374673' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374673[]' id='answer-id-1458274' class='answer   answerof-374673 ' value='1458274'   \/><label for='answer-id-1458274' id='answer-label-1458274' class=' answer'><span>Store a pointer to the custom Python scripts in the execution context object in a shared Amazon \r\nS3 bucket.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374673[]' id='answer-id-1458275' class='answer   answerof-374673 ' value='1458275'   \/><label for='answer-id-1458275' id='answer-label-1458275' class=' answer'><span>Package the custom Python scripts into Lambda layers. Apply the Lambda layers to the Lambda \r\nfunctions.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374673[]' id='answer-id-1458276' class='answer   answerof-374673 ' value='1458276'   \/><label for='answer-id-1458276' id='answer-label-1458276' class=' answer'><span>Store a pointer to the custom Python scripts in environment variables in a shared Amazon S3 bucket.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374673[]' id='answer-id-1458277' class='answer   answerof-374673 ' value='1458277'   \/><label for='answer-id-1458277' id='answer-label-1458277' class=' answer'><span>Assign the same alias to each Lambda function. Call reach Lambda function by specifying the function's alias.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-6' style=';'><div id='questionWrap-6'  class='   watupro-question-id-374674'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>6. <\/span>A company created an extract, transform, and load (ETL) data pipeline in AWS Glue. A data engineer must crawl a table that is in Microsoft SQL Server. The data engineer needs to extract, transform, and load the output of the crawl to an Amazon S3 bucket. The data engineer also must orchestrate the data pipeline. <br \/>\r<br>Which AWS service or feature will meet these requirements MOST cost-effectively?<\/div><input type='hidden' name='question_id[]' id='qID_6' value='374674' \/><input type='hidden' id='answerType374674' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374674[]' id='answer-id-1458278' class='answer   answerof-374674 ' value='1458278'   \/><label for='answer-id-1458278' id='answer-label-1458278' class=' answer'><span>AWS Step Functions<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374674[]' id='answer-id-1458279' class='answer   answerof-374674 ' value='1458279'   \/><label for='answer-id-1458279' id='answer-label-1458279' class=' answer'><span>AWS Glue workflows<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374674[]' id='answer-id-1458280' class='answer   answerof-374674 ' value='1458280'   \/><label for='answer-id-1458280' id='answer-label-1458280' class=' answer'><span>AWS Glue Studio<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374674[]' id='answer-id-1458281' class='answer   answerof-374674 ' value='1458281'   \/><label for='answer-id-1458281' id='answer-label-1458281' class=' answer'><span>Amazon Managed Workflows for Apache Airflow (Amazon MWAA)<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-7' style=';'><div id='questionWrap-7'  class='   watupro-question-id-374675'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>7. <\/span>A financial services company stores financial data in Amazon Redshift. A data engineer wants to run real-time queries on the financial data to support a web-based trading application. The data engineer wants to run the queries from within the trading application. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_7' value='374675' \/><input type='hidden' id='answerType374675' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374675[]' id='answer-id-1458282' class='answer   answerof-374675 ' value='1458282'   \/><label for='answer-id-1458282' id='answer-label-1458282' class=' answer'><span>Establish WebSocket connections to Amazon Redshift.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374675[]' id='answer-id-1458283' class='answer   answerof-374675 ' value='1458283'   \/><label for='answer-id-1458283' id='answer-label-1458283' class=' answer'><span>Use the Amazon Redshift Data AP<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374675[]' id='answer-id-1458284' class='answer   answerof-374675 ' value='1458284'   \/><label for='answer-id-1458284' id='answer-label-1458284' class=' answer'><span>Set up Java Database Connectivity (JDBC) connections to Amazon Redshift.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374675[]' id='answer-id-1458285' class='answer   answerof-374675 ' value='1458285'   \/><label for='answer-id-1458285' id='answer-label-1458285' class=' answer'><span>Store frequently accessed data in Amazon S3. Use Amazon S3 Select to run the queries.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-8' style=';'><div id='questionWrap-8'  class='   watupro-question-id-374676'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>8. <\/span>A company uses Amazon Athena for one-time queries against data that is in Amazon S3. The company has several use cases. The company must implement permission controls to separate query processes and access to query history among users, teams, and applications that are in the same AWS account. <br \/>\r<br>Which solution will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_8' value='374676' \/><input type='hidden' id='answerType374676' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374676[]' id='answer-id-1458286' class='answer   answerof-374676 ' value='1458286'   \/><label for='answer-id-1458286' id='answer-label-1458286' class=' answer'><span>Create an S3 bucket for each use case. Create an S3 bucket policy that grants permissions to appropriate individual IAM users. Apply the S3 bucket policy to the S3 bucket.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374676[]' id='answer-id-1458287' class='answer   answerof-374676 ' value='1458287'   \/><label for='answer-id-1458287' id='answer-label-1458287' class=' answer'><span>Create an Athena workgroup for each use case. Apply tags to the workgroup. Create an 1AM policy that uses the tags to apply appropriate permissions to the workgroup.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374676[]' id='answer-id-1458288' class='answer   answerof-374676 ' value='1458288'   \/><label for='answer-id-1458288' id='answer-label-1458288' class=' answer'><span>Create an JAM role for each use case. Assign appropriate permissions to the role for each use case. \r\nAssociate the role with Athena.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374676[]' id='answer-id-1458289' class='answer   answerof-374676 ' value='1458289'   \/><label for='answer-id-1458289' id='answer-label-1458289' class=' answer'><span>Create an AWS Glue Data Catalog resource policy that grants permissions to appropriate individual IAM users for each use case. Apply the resource policy to the specific tables that Athena uses.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-9' style=';'><div id='questionWrap-9'  class='   watupro-question-id-374677'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>9. <\/span>A data engineer needs to schedule a workflow that runs a set of AWS Glue jobs every day. The data engineer does not require the Glue jobs to run or finish at a specific time. <br \/>\r<br>Which solution will run the Glue jobs in the MOST cost-effective way?<\/div><input type='hidden' name='question_id[]' id='qID_9' value='374677' \/><input type='hidden' id='answerType374677' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374677[]' id='answer-id-1458290' class='answer   answerof-374677 ' value='1458290'   \/><label for='answer-id-1458290' id='answer-label-1458290' class=' answer'><span>Choose the FLEX execution class in the Glue job properties.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374677[]' id='answer-id-1458291' class='answer   answerof-374677 ' value='1458291'   \/><label for='answer-id-1458291' id='answer-label-1458291' class=' answer'><span>Use the Spot Instance type in Glue job properties.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374677[]' id='answer-id-1458292' class='answer   answerof-374677 ' value='1458292'   \/><label for='answer-id-1458292' id='answer-label-1458292' class=' answer'><span>Choose the STANDARD execution class in the Glue job properties.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374677[]' id='answer-id-1458293' class='answer   answerof-374677 ' value='1458293'   \/><label for='answer-id-1458293' id='answer-label-1458293' class=' answer'><span>Choose the latest version in the GlueVersion field in the Glue job properties.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-10' style=';'><div id='questionWrap-10'  class='   watupro-question-id-374678'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>10. <\/span>A data engineer needs to create an AWS Lambda function that converts the format of data from .csv to Apache Parquet. The Lambda function must run only if a user uploads a .csv file to an Amazon S3 bucket. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_10' value='374678' \/><input type='hidden' id='answerType374678' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374678[]' id='answer-id-1458294' class='answer   answerof-374678 ' value='1458294'   \/><label for='answer-id-1458294' id='answer-label-1458294' class=' answer'><span>Create an S3 event notification that has an event type of s3:ObjectCreated:*. Use a filter rule to generate notifications only when the suffix includes .csv. Set the Amazon Resource Name (ARN) of the Lambda function as the destination for the event notification.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374678[]' id='answer-id-1458295' class='answer   answerof-374678 ' value='1458295'   \/><label for='answer-id-1458295' id='answer-label-1458295' class=' answer'><span>Create an S3 event notification that has an event type of s3:ObjectTagging:* for objects that have a tag set to .csv. Set the Amazon Resource Name (ARN) of the Lambda function as the destination for the event notification.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374678[]' id='answer-id-1458296' class='answer   answerof-374678 ' value='1458296'   \/><label for='answer-id-1458296' id='answer-label-1458296' class=' answer'><span>Create an S3 event notification that has an event type of s3:*. Use a filter rule to generate notifications only when the suffix includes .csv. Set the Amazon Resource Name (ARN) of the Lambda function as the destination for the event notification.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374678[]' id='answer-id-1458297' class='answer   answerof-374678 ' value='1458297'   \/><label for='answer-id-1458297' id='answer-label-1458297' class=' answer'><span>Create an S3 event notification that has an event type of s3:ObjectCreated:*. Use a filter rule to generate notifications only when the suffix includes .csv. Set an Amazon Simple Notification Service (Amazon SNS) topic as the destination for the event notification. Subscribe the Lambda function to the SNS topic.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-11' style=';'><div id='questionWrap-11'  class='   watupro-question-id-374679'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>11. <\/span>A data engineer needs Amazon Athena queries to finish faster. The data engineer notices that all the files the Athena queries use are currently stored in uncompressed .csv format. The data engineer also notices that users perform most queries by selecting a specific column. <br \/>\r<br>Which solution will MOST speed up the Athena query performance?<\/div><input type='hidden' name='question_id[]' id='qID_11' value='374679' \/><input type='hidden' id='answerType374679' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374679[]' id='answer-id-1458298' class='answer   answerof-374679 ' value='1458298'   \/><label for='answer-id-1458298' id='answer-label-1458298' class=' answer'><span>Change the data format from .csvto JSON format. Apply Snappy compression.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374679[]' id='answer-id-1458299' class='answer   answerof-374679 ' value='1458299'   \/><label for='answer-id-1458299' id='answer-label-1458299' class=' answer'><span>Compress the .csv files by using Snappy compression.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374679[]' id='answer-id-1458300' class='answer   answerof-374679 ' value='1458300'   \/><label for='answer-id-1458300' id='answer-label-1458300' class=' answer'><span>Change the data format from .csvto Apache Parquet. Apply Snappy compression.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374679[]' id='answer-id-1458301' class='answer   answerof-374679 ' value='1458301'   \/><label for='answer-id-1458301' id='answer-label-1458301' class=' answer'><span>Compress the .csv files by using gzjg compression.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-12' style=';'><div id='questionWrap-12'  class='   watupro-question-id-374680'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>12. <\/span>A manufacturing company collects sensor data from its factory floor to monitor and enhance operational efficiency. The company uses Amazon Kinesis Data Streams to publish the data that the sensors collect to a data stream. Then Amazon Kinesis Data Firehose writes the data to an Amazon S3 bucket. <br \/>\r<br>The company needs to display a real-time view of operational efficiency on a large screen in the manufacturing facility. <br \/>\r<br>Which solution will meet these requirements with the LOWEST latency? <br \/>\r<br>A. Use Amazon Managed Service for Apache Flink (previously known as Amazon Kinesis Data Analytics) to process the sensor data. Use a connector for Apache Flink to write data to an Amazon Timestream database. Use the Timestream database as a source to create a Grafana dashboard. <br \/>\r<br>B. Configure the S3 bucket to send a notification to an AWS Lambda function when any new object is created. Use the Lambda function to publish the data to Amazon Aurora. Use Aurora as a source to create an Amazon QuickSight dashboard. <br \/>\r<br>C. Use Amazon Managed Service for Apache Flink (previously known as Amazon Kinesis Data Analytics) to process the sensor data. Create a new Data Firehose delivery stream to publish data directly to an Amazon Timestream database. Use the Timestream database as a source to create an Amazon QuickSight dashboard. <br \/>\r<br>D. Use AWS Glue bookmarks to read sensor data from the S3 bucket in real time. Publish the data to an Amazon Timestream database. Use the Timestream database as a source to create a Grafana dashboard.<\/div><input type='hidden' name='question_id[]' id='qID_12' value='374680' \/><input type='hidden' id='answerType374680' value='textarea'><!-- end question-content--><\/div><div class='question-choices '><p><textarea name='answer-374680[]' id='textarea_q_374680' class='watupro-textarea-medium' rows='5' cols='80'><\/textarea>\n<\/p><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-13' style=';'><div id='questionWrap-13'  class='   watupro-question-id-374681'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>13. <\/span>A company stores daily records of the financial performance of investment portfolios in .csv format in an Amazon S3 bucket. A data engineer uses AWS Glue crawlers to crawl the S3 data. The data engineer must make the S3 data accessible daily in the AWS Glue Data Catalog. <br \/>\r<br>Which solution will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_13' value='374681' \/><input type='hidden' id='answerType374681' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374681[]' id='answer-id-1458303' class='answer   answerof-374681 ' value='1458303'   \/><label for='answer-id-1458303' id='answer-label-1458303' class=' answer'><span>Create an IAM role that includes the AmazonS3FullAccess policy. Associate the role with the crawler. Specify the S3 bucket path of the source data as the crawler's data store. Create a daily schedule to run the crawler. Configure the output destination to a new path in the existing S3 bucket.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374681[]' id='answer-id-1458304' class='answer   answerof-374681 ' value='1458304'   \/><label for='answer-id-1458304' id='answer-label-1458304' class=' answer'><span>Create an IAM role that includes the AWSGlueServiceRole policy. Associate the role with the crawler. Specify the S3 bucket path of the source data as the crawler's data store. Create a daily schedule to run the crawler. Specify a database name for the output.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374681[]' id='answer-id-1458305' class='answer   answerof-374681 ' value='1458305'   \/><label for='answer-id-1458305' id='answer-label-1458305' class=' answer'><span>Create an IAM role that includes the AmazonS3FullAccess policy. Associate the role with the crawler. Specify the S3 bucket path of the source data as the crawler's data store. Allocate data processing units (DPUs) to run the crawler every day. Specify a database name for the output.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374681[]' id='answer-id-1458306' class='answer   answerof-374681 ' value='1458306'   \/><label for='answer-id-1458306' id='answer-label-1458306' class=' answer'><span>Create an IAM role that includes the AWSGlueServiceRole policy. Associate the role with the crawler. Specify the S3 bucket path of the source data as the crawler's data store. Allocate data processing units (DPUs) to run the crawler every day. Configure the output destination to a new path in the existing S3 bucket.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-14' style=';'><div id='questionWrap-14'  class='   watupro-question-id-374682'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>14. <\/span>A company loads transaction data for each day into Amazon Redshift tables at the end of each day. The company wants to have the ability to track which tables have been loaded and which tables still need to be loaded. <br \/>\r<br>A data engineer wants to store the load statuses of Redshift tables in an Amazon DynamoDB table. The data engineer creates an AWS Lambda function to publish the details of the load statuses to DynamoDB. <br \/>\r<br>How should the data engineer invoke the Lambda function to write load statuses to the DynamoDB table?<\/div><input type='hidden' name='question_id[]' id='qID_14' value='374682' \/><input type='hidden' id='answerType374682' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374682[]' id='answer-id-1458307' class='answer   answerof-374682 ' value='1458307'   \/><label for='answer-id-1458307' id='answer-label-1458307' class=' answer'><span>Use a second Lambda function to invoke the first Lambda function based on Amazon CloudWatch events.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374682[]' id='answer-id-1458308' class='answer   answerof-374682 ' value='1458308'   \/><label for='answer-id-1458308' id='answer-label-1458308' class=' answer'><span>Use the Amazon Redshift Data API to publish an event to Amazon EventBridqe. Configure an EventBridge rule to invoke the Lambda function.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374682[]' id='answer-id-1458309' class='answer   answerof-374682 ' value='1458309'   \/><label for='answer-id-1458309' id='answer-label-1458309' class=' answer'><span>Use the Amazon Redshift Data API to publish a message to an Amazon Simple Queue Service (Amazon SQS) queue. Configure the SQS queue to invoke the Lambda function.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374682[]' id='answer-id-1458310' class='answer   answerof-374682 ' value='1458310'   \/><label for='answer-id-1458310' id='answer-label-1458310' class=' answer'><span>Use a second Lambda function to invoke the first Lambda function based on AWS CloudTrail events.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-15' style=';'><div id='questionWrap-15'  class='   watupro-question-id-374683'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>15. <\/span>A data engineer needs to securely transfer 5 TB of data from an on-premises data center to an Amazon S3 bucket. Approximately 5% of the data changes every day. Updates to the data need to be regularly proliferated to the S3 bucket. The data includes files that are in multiple formats. The data engineer needs to automate the transfer process and must schedule the process to run periodically. <br \/>\r<br>Which AWS service should the data engineer use to transfer the data in the MOST operationally efficient way?<\/div><input type='hidden' name='question_id[]' id='qID_15' value='374683' \/><input type='hidden' id='answerType374683' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374683[]' id='answer-id-1458311' class='answer   answerof-374683 ' value='1458311'   \/><label for='answer-id-1458311' id='answer-label-1458311' class=' answer'><span>AWS DataSync<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374683[]' id='answer-id-1458312' class='answer   answerof-374683 ' value='1458312'   \/><label for='answer-id-1458312' id='answer-label-1458312' class=' answer'><span>AWS Glue<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374683[]' id='answer-id-1458313' class='answer   answerof-374683 ' value='1458313'   \/><label for='answer-id-1458313' id='answer-label-1458313' class=' answer'><span>AWS Direct Connect<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374683[]' id='answer-id-1458314' class='answer   answerof-374683 ' value='1458314'   \/><label for='answer-id-1458314' id='answer-label-1458314' class=' answer'><span>Amazon S3 Transfer Acceleration<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-16' style=';'><div id='questionWrap-16'  class='   watupro-question-id-374684'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>16. <\/span>A company uses an on-premises Microsoft SQL Server database to store financial transaction data. The company migrates the transaction data from the on-premises database to AWS at the end of each month. The company has noticed that the cost to migrate data from the on-premises database to an Amazon RDS for SQL Server database has increased recently. <br \/>\r<br>The company requires a cost-effective solution to migrate the data to AWS. The solution must cause minimal downtown for the applications that access the database. <br \/>\r<br>Which AWS service should the company use to meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_16' value='374684' \/><input type='hidden' id='answerType374684' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374684[]' id='answer-id-1458315' class='answer   answerof-374684 ' value='1458315'   \/><label for='answer-id-1458315' id='answer-label-1458315' class=' answer'><span>AWS Lambda<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374684[]' id='answer-id-1458316' class='answer   answerof-374684 ' value='1458316'   \/><label for='answer-id-1458316' id='answer-label-1458316' class=' answer'><span>AWS Database Migration Service (AWS DMS)<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374684[]' id='answer-id-1458317' class='answer   answerof-374684 ' value='1458317'   \/><label for='answer-id-1458317' id='answer-label-1458317' class=' answer'><span>AWS Direct Connect<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374684[]' id='answer-id-1458318' class='answer   answerof-374684 ' value='1458318'   \/><label for='answer-id-1458318' id='answer-label-1458318' class=' answer'><span>AWS DataSync<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-17' style=';'><div id='questionWrap-17'  class='   watupro-question-id-374685'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>17. <\/span>A data engineer is building a data pipeline on AWS by using AWS Glue extract, transform, and load (ETL) jobs. The data engineer needs to process data from Amazon RDS and MongoDB, perform transformations, and load the transformed data into Amazon Redshift for analytics. The data updates must occur every hour. <br \/>\r<br>Which combination of tasks will meet these requirements with the LEAST operational overhead? (Choose two.)<\/div><input type='hidden' name='question_id[]' id='qID_17' value='374685' \/><input type='hidden' id='answerType374685' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374685[]' id='answer-id-1458319' class='answer   answerof-374685 ' value='1458319'   \/><label for='answer-id-1458319' id='answer-label-1458319' class=' answer'><span>Configure AWS Glue triggers to run the ETL jobs even\/ hour.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374685[]' id='answer-id-1458320' class='answer   answerof-374685 ' value='1458320'   \/><label for='answer-id-1458320' id='answer-label-1458320' class=' answer'><span>Use AWS Glue DataBrewto clean and prepare the data for analytics.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374685[]' id='answer-id-1458321' class='answer   answerof-374685 ' value='1458321'   \/><label for='answer-id-1458321' id='answer-label-1458321' class=' answer'><span>Use AWS Lambda functions to schedule and run the ETL jobs even\/ hour.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374685[]' id='answer-id-1458322' class='answer   answerof-374685 ' value='1458322'   \/><label for='answer-id-1458322' id='answer-label-1458322' class=' answer'><span>Use AWS Glue connections to establish connectivity between the data sources and Amazon Redshift.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374685[]' id='answer-id-1458323' class='answer   answerof-374685 ' value='1458323'   \/><label for='answer-id-1458323' id='answer-label-1458323' class=' answer'><span>Use the Redshift Data API to load transformed data into Amazon Redshift.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-18' style=';'><div id='questionWrap-18'  class='   watupro-question-id-374686'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>18. <\/span>A company uses an Amazon Redshift cluster that runs on RA3 nodes. The company wants to scale read and write capacity to meet demand. A data engineer needs to identify a solution that will turn on concurrency scaling. <br \/>\r<br>Which solution will meet this requirement?<\/div><input type='hidden' name='question_id[]' id='qID_18' value='374686' \/><input type='hidden' id='answerType374686' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374686[]' id='answer-id-1458324' class='answer   answerof-374686 ' value='1458324'   \/><label for='answer-id-1458324' id='answer-label-1458324' class=' answer'><span>Turn on concurrency scaling in workload management (WLM) for Redshift Serverless workgroups.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374686[]' id='answer-id-1458325' class='answer   answerof-374686 ' value='1458325'   \/><label for='answer-id-1458325' id='answer-label-1458325' class=' answer'><span>Turn on concurrency scaling at the workload management (WLM) queue level in the Redshift cluster.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374686[]' id='answer-id-1458326' class='answer   answerof-374686 ' value='1458326'   \/><label for='answer-id-1458326' id='answer-label-1458326' class=' answer'><span>Turn on concurrency scaling in the settings during the creation of and new Redshift cluster.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374686[]' id='answer-id-1458327' class='answer   answerof-374686 ' value='1458327'   \/><label for='answer-id-1458327' id='answer-label-1458327' class=' answer'><span>Turn on concurrency scaling for the daily usage quota for the Redshift cluster.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-19' style=';'><div id='questionWrap-19'  class='   watupro-question-id-374687'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>19. <\/span>A data engineer must orchestrate a series of Amazon Athena queries that will run every day. Each query can run for more than 15 minutes. <br \/>\r<br>Which combination of steps will meet these requirements MOST cost-effectively? (Choose two.)<\/div><input type='hidden' name='question_id[]' id='qID_19' value='374687' \/><input type='hidden' id='answerType374687' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374687[]' id='answer-id-1458328' class='answer   answerof-374687 ' value='1458328'   \/><label for='answer-id-1458328' id='answer-label-1458328' class=' answer'><span>Use an AWS Lambda function and the Athena Boto3 client start_query_execution API call to invoke the Athena queries programmatically.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374687[]' id='answer-id-1458329' class='answer   answerof-374687 ' value='1458329'   \/><label for='answer-id-1458329' id='answer-label-1458329' class=' answer'><span>Create an AWS Step Functions workflow and add two states. Add the first state before the Lambda function. Configure the second state as a Wait state to periodically check whether the Athena query has finished using the Athena Boto3 get_query_execution API call. Configure the workflow to invoke the next query when the current query has finished running.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374687[]' id='answer-id-1458330' class='answer   answerof-374687 ' value='1458330'   \/><label for='answer-id-1458330' id='answer-label-1458330' class=' answer'><span>Use an AWS Glue Python shell job and the Athena Boto3 client start_query_execution API call to invoke the Athena queries programmatically.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374687[]' id='answer-id-1458331' class='answer   answerof-374687 ' value='1458331'   \/><label for='answer-id-1458331' id='answer-label-1458331' class=' answer'><span>Use an AWS Glue Python shell script to run a sleep timer that checks every 5 minutes to determine whether the current Athena query has finished running successfully. Configure the Python shell script to invoke the next query when the current query has finished running.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374687[]' id='answer-id-1458332' class='answer   answerof-374687 ' value='1458332'   \/><label for='answer-id-1458332' id='answer-label-1458332' class=' answer'><span>Use Amazon Managed Workflows for Apache Airflow (Amazon MWAA) to orchestrate the Athena queries in AWS Batch.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-20' style=';'><div id='questionWrap-20'  class='   watupro-question-id-374688'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>20. <\/span>A company is migrating on-premises workloads to AWS. The company wants to reduce overall operational overhead. The company also wants to explore serverless options. <br \/>\r<br>The company's current workloads use Apache Pig, Apache Oozie, Apache Spark, Apache Hbase, and Apache Flink. The on-premises workloads process petabytes of data in seconds. The company must maintain similar or better performance after the migration to AWS. <br \/>\r<br>Which extract, transform, and load (ETL) service will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_20' value='374688' \/><input type='hidden' id='answerType374688' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374688[]' id='answer-id-1458333' class='answer   answerof-374688 ' value='1458333'   \/><label for='answer-id-1458333' id='answer-label-1458333' class=' answer'><span>AWS Glue<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374688[]' id='answer-id-1458334' class='answer   answerof-374688 ' value='1458334'   \/><label for='answer-id-1458334' id='answer-label-1458334' class=' answer'><span>Amazon EMR<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374688[]' id='answer-id-1458335' class='answer   answerof-374688 ' value='1458335'   \/><label for='answer-id-1458335' id='answer-label-1458335' class=' answer'><span>AWS Lambda<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374688[]' id='answer-id-1458336' class='answer   answerof-374688 ' value='1458336'   \/><label for='answer-id-1458336' id='answer-label-1458336' class=' answer'><span>Amazon Redshift<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-21' style=';'><div id='questionWrap-21'  class='   watupro-question-id-374689'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>21. <\/span>A data engineer must use AWS services to ingest a dataset into an Amazon S3 data lake. The data engineer profiles the dataset and discovers that the dataset contains personally identifiable information (PII). The data engineer must implement a solution to profile the dataset and obfuscate the PII. <br \/>\r<br>Which solution will meet this requirement with the LEAST operational effort?<\/div><input type='hidden' name='question_id[]' id='qID_21' value='374689' \/><input type='hidden' id='answerType374689' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458337' class='answer   answerof-374689 ' value='1458337'   \/><label for='answer-id-1458337' id='answer-label-1458337' class=' answer'><span>Use an Amazon Kinesis Data Firehose delivery stream to process the dataset. Create an AWS Lambda transform function to identify the PI<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458338' class='answer   answerof-374689 ' value='1458338'   \/><label for='answer-id-1458338' id='answer-label-1458338' class=' answer'><span>Use an AWS SDK to obfuscate the PI<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458339' class='answer   answerof-374689 ' value='1458339'   \/><label for='answer-id-1458339' id='answer-label-1458339' class=' answer'><span>Set the S3 data lake as the target for the delivery stream.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458340' class='answer   answerof-374689 ' value='1458340'   \/><label for='answer-id-1458340' id='answer-label-1458340' class=' answer'><span>Use the Detect PII transform in AWS Glue Studio to identify the PI<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458341' class='answer   answerof-374689 ' value='1458341'   \/><label for='answer-id-1458341' id='answer-label-1458341' class=' answer'><span>Obfuscate the PI<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458342' class='answer   answerof-374689 ' value='1458342'   \/><label for='answer-id-1458342' id='answer-label-1458342' class=' answer'><span>Use an AWS Step Functions state machine to orchestrate a data pipeline to ingest the data into the S3 data lake.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458343' class='answer   answerof-374689 ' value='1458343'   \/><label for='answer-id-1458343' id='answer-label-1458343' class=' answer'><span>Use the Detect PII transform in AWS Glue Studio to identify the PI<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458344' class='answer   answerof-374689 ' value='1458344'   \/><label for='answer-id-1458344' id='answer-label-1458344' class=' answer'><span>Create a rule in AWS Glue Data Quality to obfuscate the PI<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458345' class='answer   answerof-374689 ' value='1458345'   \/><label for='answer-id-1458345' id='answer-label-1458345' class=' answer'><span>Use an AWS Step Functions state machine to orchestrate a data pipeline to ingest the data into the S3 data lake.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458346' class='answer   answerof-374689 ' value='1458346'   \/><label for='answer-id-1458346' id='answer-label-1458346' class=' answer'><span>Ingest the dataset into Amazon DynamoD<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374689[]' id='answer-id-1458347' class='answer   answerof-374689 ' value='1458347'   \/><label for='answer-id-1458347' id='answer-label-1458347' class=' answer'><span>Create an AWS Lambda function to identify and obfuscate the PII in the DynamoDB table and to transform the data. Use the same Lambda function to ingest the data into the S3 data lake.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-22' style=';'><div id='questionWrap-22'  class='   watupro-question-id-374690'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>22. <\/span>A company maintains multiple extract, transform, and load (ETL) workflows that ingest data from the company's operational databases into an Amazon S3 based data lake. The ETL workflows use AWS Glue and Amazon EMR to process data. <br \/>\r<br>The company wants to improve the existing architecture to provide automated orchestration and to require minimal manual effort. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_22' value='374690' \/><input type='hidden' id='answerType374690' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374690[]' id='answer-id-1458348' class='answer   answerof-374690 ' value='1458348'   \/><label for='answer-id-1458348' id='answer-label-1458348' class=' answer'><span>AWS Glue workflows<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374690[]' id='answer-id-1458349' class='answer   answerof-374690 ' value='1458349'   \/><label for='answer-id-1458349' id='answer-label-1458349' class=' answer'><span>AWS Step Functions tasks<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374690[]' id='answer-id-1458350' class='answer   answerof-374690 ' value='1458350'   \/><label for='answer-id-1458350' id='answer-label-1458350' class=' answer'><span>AWS Lambda functions<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374690[]' id='answer-id-1458351' class='answer   answerof-374690 ' value='1458351'   \/><label for='answer-id-1458351' id='answer-label-1458351' class=' answer'><span>Amazon Managed Workflows for Apache Airflow (Amazon MWAA) workflows<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-23' style=';'><div id='questionWrap-23'  class='   watupro-question-id-374691'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>23. <\/span>A company currently stores all of its data in Amazon S3 by using the S3 Standard storage class. <br \/>\r<br>A data engineer examined data access patterns to identify trends. During the first 6 months, most data files are accessed several times each day. Between 6 months and 2 years, most data files are accessed once or twice each month. After 2 years, data files are accessed only once or twice each year. <br \/>\r<br>The data engineer needs to use an S3 Lifecycle policy to develop new data storage rules. The new storage solution must continue to provide high availability. <br \/>\r<br>Which solution will meet these requirements in the MOST cost-effective way?<\/div><input type='hidden' name='question_id[]' id='qID_23' value='374691' \/><input type='hidden' id='answerType374691' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374691[]' id='answer-id-1458352' class='answer   answerof-374691 ' value='1458352'   \/><label for='answer-id-1458352' id='answer-label-1458352' class=' answer'><span>Transition objects to S3 One Zone-Infrequent Access (S3 One Zone-IA) after 6 months. Transfer objects to S3 Glacier Flexible Retrieval after 2 years.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374691[]' id='answer-id-1458353' class='answer   answerof-374691 ' value='1458353'   \/><label for='answer-id-1458353' id='answer-label-1458353' class=' answer'><span>Transition objects to S3 Standard-Infrequent Access (S3 Standard-IA) after 6 months. Transfer objects to S3 Glacier Flexible Retrieval after 2 years.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374691[]' id='answer-id-1458354' class='answer   answerof-374691 ' value='1458354'   \/><label for='answer-id-1458354' id='answer-label-1458354' class=' answer'><span>Transition objects to S3 Standard-Infrequent Access (S3 Standard-IA) after 6 months. Transfer objects to S3 Glacier Deep Archive after 2 years.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374691[]' id='answer-id-1458355' class='answer   answerof-374691 ' value='1458355'   \/><label for='answer-id-1458355' id='answer-label-1458355' class=' answer'><span>Transition objects to S3 One Zone-Infrequent Access (S3 One Zone-IA) after 6 months. Transfer objects to S3 Glacier Deep Archive after 2 years.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-24' style=';'><div id='questionWrap-24'  class='   watupro-question-id-374692'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>24. <\/span>A company maintains an Amazon Redshift provisioned cluster that the company uses for extract, transform, and load (ETL) operations to support critical analysis tasks. A sales team within the company maintains a Redshift cluster that the sales team uses for business intelligence (BI) tasks. The sales team recently requested access to the data that is in the ETL Redshift cluster so the team can perform weekly summary analysis tasks. The sales team needs to join data from the ETL cluster with data that is in the sales team's BI cluster. <br \/>\r<br>The company needs a solution that will share the ETL cluster data with the sales team without interrupting the critical analysis tasks. The solution must minimize usage of the computing resources of the ETL cluster. <br \/>\r<br>Which solution will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_24' value='374692' \/><input type='hidden' id='answerType374692' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374692[]' id='answer-id-1458356' class='answer   answerof-374692 ' value='1458356'   \/><label for='answer-id-1458356' id='answer-label-1458356' class=' answer'><span>Set up the sales team Bl cluster as a consumer of the ETL cluster by using Redshift data sharing.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374692[]' id='answer-id-1458357' class='answer   answerof-374692 ' value='1458357'   \/><label for='answer-id-1458357' id='answer-label-1458357' class=' answer'><span>Create materialized views based on the sales team's requirements. Grant the sales team direct access to the ETL cluster.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374692[]' id='answer-id-1458358' class='answer   answerof-374692 ' value='1458358'   \/><label for='answer-id-1458358' id='answer-label-1458358' class=' answer'><span>Create database views based on the sales team's requirements. Grant the sales team direct access to the ETL cluster.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374692[]' id='answer-id-1458359' class='answer   answerof-374692 ' value='1458359'   \/><label for='answer-id-1458359' id='answer-label-1458359' class=' answer'><span>Unload a copy of the data from the ETL cluster to an Amazon S3 bucket every week. Create an Amazon Redshift Spectrum table based on the content of the ETL cluster.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-25' style=';'><div id='questionWrap-25'  class='   watupro-question-id-374693'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>25. <\/span>A data engineer needs to join data from multiple sources to perform a one-time analysis job. The data is stored in Amazon DynamoDB, Amazon RDS, Amazon Redshift, and Amazon S3. <br \/>\r<br>Which solution will meet this requirement MOST cost-effectively?<\/div><input type='hidden' name='question_id[]' id='qID_25' value='374693' \/><input type='hidden' id='answerType374693' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374693[]' id='answer-id-1458360' class='answer   answerof-374693 ' value='1458360'   \/><label for='answer-id-1458360' id='answer-label-1458360' class=' answer'><span>Use an Amazon EMR provisioned cluster to read from all sources. Use Apache Spark to join the data and perform the analysis.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374693[]' id='answer-id-1458361' class='answer   answerof-374693 ' value='1458361'   \/><label for='answer-id-1458361' id='answer-label-1458361' class=' answer'><span>Copy the data from DynamoDB, Amazon RDS, and Amazon Redshift into Amazon S3. Run Amazon Athena queries directly on the S3 files.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374693[]' id='answer-id-1458362' class='answer   answerof-374693 ' value='1458362'   \/><label for='answer-id-1458362' id='answer-label-1458362' class=' answer'><span>Use Amazon Athena Federated Query to join the data from all data sources.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374693[]' id='answer-id-1458363' class='answer   answerof-374693 ' value='1458363'   \/><label for='answer-id-1458363' id='answer-label-1458363' class=' answer'><span>Use Redshift Spectrum to query data from DynamoDB, Amazon RDS, and Amazon S3 directly from Redshift.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-26' style=';'><div id='questionWrap-26'  class='   watupro-question-id-374694'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>26. <\/span>A company is planning to use a provisioned Amazon EMR cluster that runs Apache Spark jobs to perform big data analysis. The company requires high reliability. A big data team must follow best practices for running cost-optimized and long-running workloads on Amazon EMR. The team must find a solution that will maintain the company's current level of performance. <br \/>\r<br>Which combination of resources will meet these requirements MOST cost-effectively? (Choose two.)<\/div><input type='hidden' name='question_id[]' id='qID_26' value='374694' \/><input type='hidden' id='answerType374694' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374694[]' id='answer-id-1458364' class='answer   answerof-374694 ' value='1458364'   \/><label for='answer-id-1458364' id='answer-label-1458364' class=' answer'><span>Use Hadoop Distributed File System (HDFS) as a persistent data store.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374694[]' id='answer-id-1458365' class='answer   answerof-374694 ' value='1458365'   \/><label for='answer-id-1458365' id='answer-label-1458365' class=' answer'><span>Use Amazon S3 as a persistent data store.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374694[]' id='answer-id-1458366' class='answer   answerof-374694 ' value='1458366'   \/><label for='answer-id-1458366' id='answer-label-1458366' class=' answer'><span>Use x86-based instances for core nodes and task nodes.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374694[]' id='answer-id-1458367' class='answer   answerof-374694 ' value='1458367'   \/><label for='answer-id-1458367' id='answer-label-1458367' class=' answer'><span>Use Graviton instances for core nodes and task nodes.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374694[]' id='answer-id-1458368' class='answer   answerof-374694 ' value='1458368'   \/><label for='answer-id-1458368' id='answer-label-1458368' class=' answer'><span>Use Spot Instances for all primary nodes.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-27' style=';'><div id='questionWrap-27'  class='   watupro-question-id-374695'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>27. <\/span>A company wants to implement real-time analytics capabilities. The company wants to use Amazon Kinesis Data Streams and Amazon Redshift to ingest and process streaming data at the rate of several gigabytes per second. The company wants to derive near real-time insights by using existing business intelligence (BI) and analytics tools. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_27' value='374695' \/><input type='hidden' id='answerType374695' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374695[]' id='answer-id-1458369' class='answer   answerof-374695 ' value='1458369'   \/><label for='answer-id-1458369' id='answer-label-1458369' class=' answer'><span>Use Kinesis Data Streams to stage data in Amazon S3. Use the COPY command to load data from Amazon S3 directly into Amazon Redshift to make the data immediately available for real-time analysis.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374695[]' id='answer-id-1458370' class='answer   answerof-374695 ' value='1458370'   \/><label for='answer-id-1458370' id='answer-label-1458370' class=' answer'><span>Access the data from Kinesis Data Streams by using SQL queries. Create materialized views directly on top of the stream. Refresh the materialized views regularly to query the most recent stream data.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374695[]' id='answer-id-1458371' class='answer   answerof-374695 ' value='1458371'   \/><label for='answer-id-1458371' id='answer-label-1458371' class=' answer'><span>Create an external schema in Amazon Redshift to map the data from Kinesis Data Streams to an Amazon Redshift object. Create a materialized view to read data from the stream. Set the materialized view to auto refresh.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374695[]' id='answer-id-1458372' class='answer   answerof-374695 ' value='1458372'   \/><label for='answer-id-1458372' id='answer-label-1458372' class=' answer'><span>Connect Kinesis Data Streams to Amazon Kinesis Data Firehose. Use Kinesis Data Firehose to stage the data in Amazon S3. Use the COPY command to load the data from Amazon S3 to a table in Amazon Redshift.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-28' style=';'><div id='questionWrap-28'  class='   watupro-question-id-374696'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>28. <\/span>A company uses an Amazon QuickSight dashboard to monitor usage of one of the company's applications. The company uses AWS Glue jobs to process data for the dashboard. The company stores the data in a single Amazon S3 bucket. The company adds new data every day. <br \/>\r<br>A data engineer discovers that dashboard queries are becoming slower over time. The data engineer determines that the root cause of the slowing queries is long-running AWS Glue jobs. <br \/>\r<br>Which actions should the data engineer take to improve the performance of the AWS Glue jobs? (Choose two.)<\/div><input type='hidden' name='question_id[]' id='qID_28' value='374696' \/><input type='hidden' id='answerType374696' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374696[]' id='answer-id-1458373' class='answer   answerof-374696 ' value='1458373'   \/><label for='answer-id-1458373' id='answer-label-1458373' class=' answer'><span>Partition the data that is in the S3 bucket. Organize the data by year, month, and day.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374696[]' id='answer-id-1458374' class='answer   answerof-374696 ' value='1458374'   \/><label for='answer-id-1458374' id='answer-label-1458374' class=' answer'><span>Increase the AWS Glue instance size by scaling up the worker type.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374696[]' id='answer-id-1458375' class='answer   answerof-374696 ' value='1458375'   \/><label for='answer-id-1458375' id='answer-label-1458375' class=' answer'><span>Convert the AWS Glue schema to the DynamicFrame schema class.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374696[]' id='answer-id-1458376' class='answer   answerof-374696 ' value='1458376'   \/><label for='answer-id-1458376' id='answer-label-1458376' class=' answer'><span>Adjust AWS Glue job scheduling frequency so the jobs run half as many times each day.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374696[]' id='answer-id-1458377' class='answer   answerof-374696 ' value='1458377'   \/><label for='answer-id-1458377' id='answer-label-1458377' class=' answer'><span>Modify the 1AM role that grants access to AWS glue to grant access to all S3 features.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-29' style=';'><div id='questionWrap-29'  class='   watupro-question-id-374697'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>29. <\/span>A data engineer needs to use AWS Step Functions to design an orchestration workflow. The workflow must parallel process a large collection of data files and apply a specific transformation to each file. <br \/>\r<br>Which Step Functions state should the data engineer use to meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_29' value='374697' \/><input type='hidden' id='answerType374697' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374697[]' id='answer-id-1458378' class='answer   answerof-374697 ' value='1458378'   \/><label for='answer-id-1458378' id='answer-label-1458378' class=' answer'><span>Parallel state<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374697[]' id='answer-id-1458379' class='answer   answerof-374697 ' value='1458379'   \/><label for='answer-id-1458379' id='answer-label-1458379' class=' answer'><span>Choice state<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374697[]' id='answer-id-1458380' class='answer   answerof-374697 ' value='1458380'   \/><label for='answer-id-1458380' id='answer-label-1458380' class=' answer'><span>Map state<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374697[]' id='answer-id-1458381' class='answer   answerof-374697 ' value='1458381'   \/><label for='answer-id-1458381' id='answer-label-1458381' class=' answer'><span>Wait state<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-30' style=';'><div id='questionWrap-30'  class='   watupro-question-id-374698'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>30. <\/span>A company is migrating a legacy application to an Amazon S3 based data lake. A data engineer reviewed data that is associated with the legacy application. The data engineer found that the legacy data contained some duplicate information. <br \/>\r<br>The data engineer must identify and remove duplicate information from the legacy application data. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_30' value='374698' \/><input type='hidden' id='answerType374698' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374698[]' id='answer-id-1458382' class='answer   answerof-374698 ' value='1458382'   \/><label for='answer-id-1458382' id='answer-label-1458382' class=' answer'><span>Write a custom extract, transform, and load (ETL) job in Python. Use the DataFramedrop duplicatesf) function by importing the Pandas library to perform data deduplication.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374698[]' id='answer-id-1458383' class='answer   answerof-374698 ' value='1458383'   \/><label for='answer-id-1458383' id='answer-label-1458383' class=' answer'><span>Write an AWS Glue extract, transform, and load (ETL) job. Use the FindMatches machine learning (ML) transform to transform the data to perform data deduplication.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374698[]' id='answer-id-1458384' class='answer   answerof-374698 ' value='1458384'   \/><label for='answer-id-1458384' id='answer-label-1458384' class=' answer'><span>Write a custom extract, transform, and load (ETL) job in Python. Import the Python dedupe library. Use the dedupe library to perform data deduplication.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374698[]' id='answer-id-1458385' class='answer   answerof-374698 ' value='1458385'   \/><label for='answer-id-1458385' id='answer-label-1458385' class=' answer'><span>Write an AWS Glue extract, transform, and load (ETL) job. Import the Python dedupe library. Use the dedupe library to perform data deduplication.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-31' style=';'><div id='questionWrap-31'  class='   watupro-question-id-374699'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>31. <\/span>A company is building an analytics solution. The solution uses Amazon S3 for data lake storage and Amazon Redshift for a data warehouse. The company wants to use Amazon Redshift Spectrum to query the data that is in Amazon S3. <br \/>\r<br>Which actions will provide the FASTEST queries? (Choose two.)<\/div><input type='hidden' name='question_id[]' id='qID_31' value='374699' \/><input type='hidden' id='answerType374699' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374699[]' id='answer-id-1458386' class='answer   answerof-374699 ' value='1458386'   \/><label for='answer-id-1458386' id='answer-label-1458386' class=' answer'><span>Use gzip compression to compress individual files to sizes that are between 1 GB and 5 G<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374699[]' id='answer-id-1458387' class='answer   answerof-374699 ' value='1458387'   \/><label for='answer-id-1458387' id='answer-label-1458387' class=' answer'><span>Use a columnar storage file format.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374699[]' id='answer-id-1458388' class='answer   answerof-374699 ' value='1458388'   \/><label for='answer-id-1458388' id='answer-label-1458388' class=' answer'><span>Partition the data based on the most common query predicates.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374699[]' id='answer-id-1458389' class='answer   answerof-374699 ' value='1458389'   \/><label for='answer-id-1458389' id='answer-label-1458389' class=' answer'><span>Split the data into files that are less than 10 K<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374699[]' id='answer-id-1458390' class='answer   answerof-374699 ' value='1458390'   \/><label for='answer-id-1458390' id='answer-label-1458390' class=' answer'><span>Use file formats that are not<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-32' style=';'><div id='questionWrap-32'  class='   watupro-question-id-374700'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>32. <\/span>A company uses Amazon RDS to store transactional data. The company runs an RDS DB instance in a private subnet. A developer wrote an AWS Lambda function with default settings to insert, update, or delete data in the DB instance. <br \/>\r<br>The developer needs to give the Lambda function the ability to connect to the DB instance privately without using the public internet. <br \/>\r<br>Which combination of steps will meet this requirement with the LEAST operational overhead? (Choose two.) <br \/>\r<br>A. Turn on the public access setting for the DB instance. <br \/>\r<br>B. Update the security group of the DB instance to allow only Lambda function invocations on the <br \/>\r<br>database port. <br \/>\r<br>C. Configure the Lambda function to run in the same subnet that the DB instance uses. <br \/>\r<br>D. Attach the same security group to the Lambda function and the DB instance. Include a self-referencing rule that allows access through the database port. <br \/>\r<br>E. Update the network ACL of the private subnet to include a self-referencing rule that allows access through the database port.<\/div><input type='hidden' name='question_id[]' id='qID_32' value='374700' \/><input type='hidden' id='answerType374700' value='textarea'><!-- end question-content--><\/div><div class='question-choices '><p><textarea name='answer-374700[]' id='textarea_q_374700' class='watupro-textarea-medium' rows='5' cols='80'><\/textarea>\n<\/p><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-33' style=';'><div id='questionWrap-33'  class='   watupro-question-id-374701'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>33. <\/span>A company has a frontend ReactJS website that uses Amazon API Gateway to invoke REST APIs. The APIs perform the functionality of the website. A data engineer needs to write a Python script that can be occasionally invoked through API Gateway. The code must return results to API Gateway. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_33' value='374701' \/><input type='hidden' id='answerType374701' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374701[]' id='answer-id-1458392' class='answer   answerof-374701 ' value='1458392'   \/><label for='answer-id-1458392' id='answer-label-1458392' class=' answer'><span>Deploy a custom Python script on an Amazon Elastic Container Service (Amazon ECS) cluster.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374701[]' id='answer-id-1458393' class='answer   answerof-374701 ' value='1458393'   \/><label for='answer-id-1458393' id='answer-label-1458393' class=' answer'><span>Create an AWS Lambda Python function with provisioned concurrency.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374701[]' id='answer-id-1458394' class='answer   answerof-374701 ' value='1458394'   \/><label for='answer-id-1458394' id='answer-label-1458394' class=' answer'><span>Deploy a custom Python script that can integrate with API Gateway on Amazon Elastic Kubernetes Service (Amazon EKS).<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374701[]' id='answer-id-1458395' class='answer   answerof-374701 ' value='1458395'   \/><label for='answer-id-1458395' id='answer-label-1458395' class=' answer'><span>Create an AWS Lambda function. Ensure that the function is warm by scheduling an Amazon EventBridge rule to invoke the Lambda function every 5 minutes by using mock events.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-34' style=';'><div id='questionWrap-34'  class='   watupro-question-id-374702'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>34. <\/span>A company has a production AWS account that runs company workloads. The company's security team created a security AWS account to store and analyze security logs from the production AWS account. The security logs in the production AWS account are stored in Amazon CloudWatch Logs. The company needs to use Amazon Kinesis Data Streams to deliver the security logs to the security AWS account. <br \/>\r<br>Which solution will meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_34' value='374702' \/><input type='hidden' id='answerType374702' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374702[]' id='answer-id-1458396' class='answer   answerof-374702 ' value='1458396'   \/><label for='answer-id-1458396' id='answer-label-1458396' class=' answer'><span>Create a destination data stream in the production AWS account. In the security AWS account, create an IAM role that has cross-account permissions to Kinesis Data Streams in the production AWS account.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374702[]' id='answer-id-1458397' class='answer   answerof-374702 ' value='1458397'   \/><label for='answer-id-1458397' id='answer-label-1458397' class=' answer'><span>Create a destination data stream in the security AWS account. Create an IAM role and a trust policy to grant CloudWatch Logs the permission to put data into the stream. Create a subscription filter in the security AWS account.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374702[]' id='answer-id-1458398' class='answer   answerof-374702 ' value='1458398'   \/><label for='answer-id-1458398' id='answer-label-1458398' class=' answer'><span>Create a destination data stream in the production AWS account. In the production AWS account, create an IAM role that has cross-account permissions to Kinesis Data Streams in the security AWS account.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374702[]' id='answer-id-1458399' class='answer   answerof-374702 ' value='1458399'   \/><label for='answer-id-1458399' id='answer-label-1458399' class=' answer'><span>Create a destination data stream in the security AWS account. Create an IAM role and a trust policy to grant CloudWatch Logs the permission to put data into the stream. Create a subscription filter in the production AWS account.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-35' style=';'><div id='questionWrap-35'  class='   watupro-question-id-374703'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>35. <\/span>A company uses Amazon S3 to store semi-structured data in a transactional data lake. Some of the data files are small, but other data files are tens of terabytes. <br \/>\r<br>A data engineer must perform a change data capture (CDC) operation to identify changed data from the data source. The data source sends a full snapshot as a JSON file every day and ingests the changed data into the data lake. <br \/>\r<br>Which solution will capture the changed data MOST cost-effectively?<\/div><input type='hidden' name='question_id[]' id='qID_35' value='374703' \/><input type='hidden' id='answerType374703' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374703[]' id='answer-id-1458400' class='answer   answerof-374703 ' value='1458400'   \/><label for='answer-id-1458400' id='answer-label-1458400' class=' answer'><span>Create an AWS Lambda function to identify the changes between the previous data and the current data. Configure the Lambda function to ingest the changes into the data lake.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374703[]' id='answer-id-1458401' class='answer   answerof-374703 ' value='1458401'   \/><label for='answer-id-1458401' id='answer-label-1458401' class=' answer'><span>Ingest the data into Amazon RDS for MySQ<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374703[]' id='answer-id-1458402' class='answer   answerof-374703 ' value='1458402'   \/><label for='answer-id-1458402' id='answer-label-1458402' class=' answer'><span>Use AWS Database Migration Service (AWS DMS) to write the changed data to the data lake.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374703[]' id='answer-id-1458403' class='answer   answerof-374703 ' value='1458403'   \/><label for='answer-id-1458403' id='answer-label-1458403' class=' answer'><span>Use an open source data lake format to merge the data source with the S3 data lake to insert the new data and update the existing data.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374703[]' id='answer-id-1458404' class='answer   answerof-374703 ' value='1458404'   \/><label for='answer-id-1458404' id='answer-label-1458404' class=' answer'><span>Ingest the data into an Amazon Aurora MySQL DB instance that runs Aurora Serverless. Use AWS Database Migration Service (AWS DMS) to write the changed data to the data lake.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-36' style=';'><div id='questionWrap-36'  class='   watupro-question-id-374704'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>36. <\/span>A data engineer runs Amazon Athena queries on data that is in an Amazon S3 bucket. The Athena queries use AWS Glue Data Catalog as a metadata table. <br \/>\r<br>The data engineer notices that the Athena query plans are experiencing a performance bottleneck. The data engineer determines that the cause of the performance bottleneck is the large number of partitions that are in the S3 bucket. The data engineer must resolve the performance bottleneck and reduce Athena query planning time. <br \/>\r<br>Which solutions will meet these requirements? (Choose two.)<\/div><input type='hidden' name='question_id[]' id='qID_36' value='374704' \/><input type='hidden' id='answerType374704' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374704[]' id='answer-id-1458405' class='answer   answerof-374704 ' value='1458405'   \/><label for='answer-id-1458405' id='answer-label-1458405' class=' answer'><span>Create an AWS Glue partition index. Enable partition filtering.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374704[]' id='answer-id-1458406' class='answer   answerof-374704 ' value='1458406'   \/><label for='answer-id-1458406' id='answer-label-1458406' class=' answer'><span>Bucket the data based on a column that the data have in common in a WHERE clause of the user query<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374704[]' id='answer-id-1458407' class='answer   answerof-374704 ' value='1458407'   \/><label for='answer-id-1458407' id='answer-label-1458407' class=' answer'><span>Use Athena partition projection based on the S3 bucket prefix.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374704[]' id='answer-id-1458408' class='answer   answerof-374704 ' value='1458408'   \/><label for='answer-id-1458408' id='answer-label-1458408' class=' answer'><span>Transform the data that is in the S3 bucket to Apache Parquet format.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374704[]' id='answer-id-1458409' class='answer   answerof-374704 ' value='1458409'   \/><label for='answer-id-1458409' id='answer-label-1458409' class=' answer'><span>Use the Amazon EMR S3DistCP utility to combine smaller objects in the S3 bucket into larger objects.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-37' style=';'><div id='questionWrap-37'  class='   watupro-question-id-374705'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>37. <\/span>A data engineer must manage the ingestion of real-time streaming data into AWS. The data engineer wants to perform real-time analytics on the incoming streaming data by using time-based aggregations over a window of up to 30 minutes. The data engineer needs a solution that is highly fault tolerant. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_37' value='374705' \/><input type='hidden' id='answerType374705' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374705[]' id='answer-id-1458410' class='answer   answerof-374705 ' value='1458410'   \/><label for='answer-id-1458410' id='answer-label-1458410' class=' answer'><span>Use an AWS Lambda function that includes both the business and the analytics logic to perform time-based aggregations over a window of up to 30 minutes for the data in Amazon Kinesis Data Streams.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374705[]' id='answer-id-1458411' class='answer   answerof-374705 ' value='1458411'   \/><label for='answer-id-1458411' id='answer-label-1458411' class=' answer'><span>Use Amazon Managed Service for Apache Flink (previously known as Amazon Kinesis Data Analytics) to analyze the data that might occasionally contain duplicates by using multiple types of aggregations.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374705[]' id='answer-id-1458412' class='answer   answerof-374705 ' value='1458412'   \/><label for='answer-id-1458412' id='answer-label-1458412' class=' answer'><span>Use an AWS Lambda function that includes both the business and the analytics logic to perform aggregations for a tumbling window of up to 30 minutes, based on the event timestamp.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374705[]' id='answer-id-1458413' class='answer   answerof-374705 ' value='1458413'   \/><label for='answer-id-1458413' id='answer-label-1458413' class=' answer'><span>Use Amazon Managed Service for Apache Flink (previously known as Amazon Kinesis Data Analytics) to analyze the data by using multiple types of aggregations to perform time-based analytics over a window of up to 30 minutes.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-38' style=';'><div id='questionWrap-38'  class='   watupro-question-id-374706'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>38. <\/span>A company is planning to upgrade its Amazon Elastic Block Store (Amazon EBS) General Purpose SSD storage from gp2 to gp3. The company wants to prevent any interruptions in its Amazon EC2 instances that will cause data loss during the migration to the upgraded storage. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_38' value='374706' \/><input type='hidden' id='answerType374706' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374706[]' id='answer-id-1458414' class='answer   answerof-374706 ' value='1458414'   \/><label for='answer-id-1458414' id='answer-label-1458414' class=' answer'><span>Create snapshots of the gp2 volumes. Create new gp3 volumes from the snapshots. Attach the new gp3 volumes to the EC2 instances.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374706[]' id='answer-id-1458415' class='answer   answerof-374706 ' value='1458415'   \/><label for='answer-id-1458415' id='answer-label-1458415' class=' answer'><span>Create new gp3 volumes. Gradually transfer the data to the new gp3 volumes. When the transfer is complete, mount the new gp3 volumes to the EC2 instances to replace the gp2 volumes.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374706[]' id='answer-id-1458416' class='answer   answerof-374706 ' value='1458416'   \/><label for='answer-id-1458416' id='answer-label-1458416' class=' answer'><span>Change the volume type of the existing gp2 volumes to gp3. Enter new values for volume size, IOPS, and throughput.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374706[]' id='answer-id-1458417' class='answer   answerof-374706 ' value='1458417'   \/><label for='answer-id-1458417' id='answer-label-1458417' class=' answer'><span>Use AWS DataSync to create new gp3 volumes. Transfer the data from the original gp2 volumes to the new gp3 volumes.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-39' style=';'><div id='questionWrap-39'  class='   watupro-question-id-374707'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>39. <\/span>A company is migrating its database servers from Amazon EC2 instances that run Microsoft SQL Server to Amazon RDS for Microsoft SQL Server DB instances. The company's analytics team must export large data elements every day until the migration is complete. The data elements are the result of SQL joins across multiple tables. The data must be in Apache Parquet format. The analytics team must store the data in Amazon S3. <br \/>\r<br>Which solution will meet these requirements in the MOST operationally efficient way?<\/div><input type='hidden' name='question_id[]' id='qID_39' value='374707' \/><input type='hidden' id='answerType374707' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374707[]' id='answer-id-1458418' class='answer   answerof-374707 ' value='1458418'   \/><label for='answer-id-1458418' id='answer-label-1458418' class=' answer'><span>Create a view in the EC2 instance-based SQL Server databases that contains the required data elements. Create an AWS Glue job that selects the data directly from the view and transfers the data in Parquet format to an S3 bucket. Schedule the AWS Glue job to run every day.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374707[]' id='answer-id-1458419' class='answer   answerof-374707 ' value='1458419'   \/><label for='answer-id-1458419' id='answer-label-1458419' class=' answer'><span>Schedule SQL Server Agent to run a daily SQL query that selects the desired data elements from the EC2 instance-based SQL Server databases. Configure the query to direct the output .csv objects to an S3 bucket. Create an S3 event that invokes an AWS Lambda function to transform the output format from .csv to Parquet.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374707[]' id='answer-id-1458420' class='answer   answerof-374707 ' value='1458420'   \/><label for='answer-id-1458420' id='answer-label-1458420' class=' answer'><span>Use a SQL query to create a view in the EC2 instance-based SQL Server databases that contains the required data elements. Create and run an AWS Glue crawler to read the view. Create an AWS Glue job that retrieves the data and transfers the data in Parquet format to an S3 bucket. Schedule the AWS Glue job to run every day.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374707[]' id='answer-id-1458421' class='answer   answerof-374707 ' value='1458421'   \/><label for='answer-id-1458421' id='answer-label-1458421' class=' answer'><span>Create an AWS Lambda function that queries the EC2 instance-based databases by using Java Database Connectivity (JDBC). Configure the Lambda function to retrieve the required data, transform the data into Parquet format, and transfer the data into an S3 bucket. Use Amazon EventBridge to schedule the Lambda function to run every day.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-40' style=';'><div id='questionWrap-40'  class='   watupro-question-id-374708'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>40. <\/span>A data engineering team is using an Amazon Redshift data warehouse for operational reporting. The team wants to prevent performance issues that might result from long- running queries. A data engineer must choose a system table in Amazon Redshift to record anomalies when a query optimizer identifies conditions that might indicate performance issues. <br \/>\r<br>Which table views should the data engineer use to meet this requirement?<\/div><input type='hidden' name='question_id[]' id='qID_40' value='374708' \/><input type='hidden' id='answerType374708' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374708[]' id='answer-id-1458422' class='answer   answerof-374708 ' value='1458422'   \/><label for='answer-id-1458422' id='answer-label-1458422' class=' answer'><span>STL USAGE CONTROL<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374708[]' id='answer-id-1458423' class='answer   answerof-374708 ' value='1458423'   \/><label for='answer-id-1458423' id='answer-label-1458423' class=' answer'><span>STL ALERT EVENT LOG<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374708[]' id='answer-id-1458424' class='answer   answerof-374708 ' value='1458424'   \/><label for='answer-id-1458424' id='answer-label-1458424' class=' answer'><span>STL QUERY METRICS<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374708[]' id='answer-id-1458425' class='answer   answerof-374708 ' value='1458425'   \/><label for='answer-id-1458425' id='answer-label-1458425' class=' answer'><span>STL PLAN INFO<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-41' style=';'><div id='questionWrap-41'  class='   watupro-question-id-374709'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>41. <\/span>A data engineer must ingest a source of structured data that is in .csv format into an Amazon S3 data lake. The .csv files contain 15 columns. Data analysts need to run Amazon Athena queries on one or two columns of the dataset. The data analysts rarely query the entire file. <br \/>\r<br>Which solution will meet these requirements MOST cost-effectively?<\/div><input type='hidden' name='question_id[]' id='qID_41' value='374709' \/><input type='hidden' id='answerType374709' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374709[]' id='answer-id-1458426' class='answer   answerof-374709 ' value='1458426'   \/><label for='answer-id-1458426' id='answer-label-1458426' class=' answer'><span>Use an AWS Glue PySpark job to ingest the source data into the data lake in .csv format.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374709[]' id='answer-id-1458427' class='answer   answerof-374709 ' value='1458427'   \/><label for='answer-id-1458427' id='answer-label-1458427' class=' answer'><span>Create an AWS Glue extract, transform, and load (ETL) job to read from the .csv structured data source. Configure the job to ingest the data into the data lake in JSON format.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374709[]' id='answer-id-1458428' class='answer   answerof-374709 ' value='1458428'   \/><label for='answer-id-1458428' id='answer-label-1458428' class=' answer'><span>Use an AWS Glue PySpark job to ingest the source data into the data lake in Apache Avro format.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374709[]' id='answer-id-1458429' class='answer   answerof-374709 ' value='1458429'   \/><label for='answer-id-1458429' id='answer-label-1458429' class=' answer'><span>Create an AWS Glue extract, transform, and load (ETL) job to read from the .csv structured data source. Configure the job to write the data into the data lake in Apache Parquet format.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-42' style=';'><div id='questionWrap-42'  class='   watupro-question-id-374710'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>42. <\/span>A company has five offices in different AWS Regions. Each office has its own human resources (HR) department that uses a unique IAM role. The company stores employee records in a data lake that is based on Amazon S3 storage. <br \/>\r<br>A data engineering team needs to limit access to the records. Each HR department should be able to access records for only employees who are within the HR department's Region. <br \/>\r<br>Which combination of steps should the data engineering team take to meet this requirement with the LEAST operational overhead? (Choose two.) <br \/>\r<br>A. Use data filters for each Region to register the S3 paths as data locations. <br \/>\r<br>B. Register the S3 path as an AWS Lake Formation location. <br \/>\r<br>C. Modify the IAM roles of the HR departments to add a data filter for each department's Region. <br \/>\r<br>D. Enable fine-grained access control in AWS Lake Formation. Add a data filter for each Region. <br \/>\r<br>E. Create a separate S3 bucket for each Region. Configure an IAM policy to allow S3 access. Restrict access based on Region.<\/div><input type='hidden' name='question_id[]' id='qID_42' value='374710' \/><input type='hidden' id='answerType374710' value='textarea'><!-- end question-content--><\/div><div class='question-choices '><p><textarea name='answer-374710[]' id='textarea_q_374710' class='watupro-textarea-medium' rows='5' cols='80'><\/textarea>\n<\/p><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-43' style=';'><div id='questionWrap-43'  class='   watupro-question-id-374711'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>43. <\/span>A company uses AWS Step Functions to orchestrate a data pipeline. The pipeline consists of Amazon EMR jobs that ingest data from data sources and store the data in an Amazon S3 bucket. The pipeline also includes EMR jobs that load the data to Amazon Redshift. <br \/>\r<br>The company's cloud infrastructure team manually built a Step Functions state machine. The cloud infrastructure team launched an EMR cluster into a VPC to support the EMR jobs. However, the deployed Step Functions state machine is not able to run the EMR jobs. <br \/>\r<br>Which combination of steps should the company take to identify the reason the Step Functions state machine is not able to run the EMR jobs? (Choose two.)<\/div><input type='hidden' name='question_id[]' id='qID_43' value='374711' \/><input type='hidden' id='answerType374711' value='checkbox'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374711[]' id='answer-id-1458431' class='answer   answerof-374711 ' value='1458431'   \/><label for='answer-id-1458431' id='answer-label-1458431' class=' answer'><span>Use AWS CloudFormation to automate the Step Functions state machine deployment. Create a step to pause the state machine during the EMR jobs that fail. Configure the step to wait for a human user to send approval through an email message. Include details of the EMR task in the email message for further analysis.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374711[]' id='answer-id-1458432' class='answer   answerof-374711 ' value='1458432'   \/><label for='answer-id-1458432' id='answer-label-1458432' class=' answer'><span>Verify that the Step Functions state machine code has all IAM permissions that are necessary to create and run the EMR jobs. Verify that the Step Functions state machine code also includes IAM permissions to access the Amazon S3 buckets that the EMR jobs use. Use Access Analyzer for S3 to check the S3 access properties.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374711[]' id='answer-id-1458433' class='answer   answerof-374711 ' value='1458433'   \/><label for='answer-id-1458433' id='answer-label-1458433' class=' answer'><span>Check for entries in Amazon CloudWatch for the newly created EMR cluster. Change the AWS Step Functions state machine code to use Amazon EMR on EK<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374711[]' id='answer-id-1458434' class='answer   answerof-374711 ' value='1458434'   \/><label for='answer-id-1458434' id='answer-label-1458434' class=' answer'><span>Change the IAM access policies and the security group configuration for the Step Functions state machine code to reflect inclusion of Amazon Elastic Kubernetes Service (Amazon EKS).<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374711[]' id='answer-id-1458435' class='answer   answerof-374711 ' value='1458435'   \/><label for='answer-id-1458435' id='answer-label-1458435' class=' answer'><span>Query the flow logs for the VP<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374711[]' id='answer-id-1458436' class='answer   answerof-374711 ' value='1458436'   \/><label for='answer-id-1458436' id='answer-label-1458436' class=' answer'><span>Determine whether the traffic that originates from the EMR cluster can successfully reach the data providers. Determine whether any security group that might be attached to the Amazon EMR cluster allows connections to the data source servers on the informed ports.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='checkbox' name='answer-374711[]' id='answer-id-1458437' class='answer   answerof-374711 ' value='1458437'   \/><label for='answer-id-1458437' id='answer-label-1458437' class=' answer'><span>Check the retry scenarios that the company configured for the EMR jobs. Increase the number of seconds in the interval between each EMR task. Validate that each fallback state has the appropriate catch for each decision state. Configure an Amazon Simple Notification Service (Amazon SNS) topic to store the error messages.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-44' style=';'><div id='questionWrap-44'  class='   watupro-question-id-374712'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>44. <\/span>A company is developing an application that runs on Amazon EC2 instances. Currently, the data that the application generates is temporary. However, the company needs to persist the data, even if the EC2 instances are terminated. <br \/>\r<br>A data engineer must launch new EC2 instances from an Amazon Machine Image (AMI) and configure the instances to preserve the data. <br \/>\r<br>Which solution will meet this requirement?<\/div><input type='hidden' name='question_id[]' id='qID_44' value='374712' \/><input type='hidden' id='answerType374712' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374712[]' id='answer-id-1458438' class='answer   answerof-374712 ' value='1458438'   \/><label for='answer-id-1458438' id='answer-label-1458438' class=' answer'><span>Launch new EC2 instances by using an AMI that is backed by an EC2 instance store volume that contains the application data. Apply the default settings to the EC2 instances.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374712[]' id='answer-id-1458439' class='answer   answerof-374712 ' value='1458439'   \/><label for='answer-id-1458439' id='answer-label-1458439' class=' answer'><span>Launch new EC2 instances by using an AMI that is backed by a root Amazon Elastic Block Store (Amazon EBS) volume that contains the application data. Apply the default settings to the EC2 instances.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374712[]' id='answer-id-1458440' class='answer   answerof-374712 ' value='1458440'   \/><label for='answer-id-1458440' id='answer-label-1458440' class=' answer'><span>Launch new EC2 instances by using an AMI that is backed by an EC2 instance store volume. Attach an Amazon Elastic Block Store (Amazon EBS) volume to contain the application data. Apply the default settings to the EC2 instances.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374712[]' id='answer-id-1458441' class='answer   answerof-374712 ' value='1458441'   \/><label for='answer-id-1458441' id='answer-label-1458441' class=' answer'><span>Launch new EC2 instances by using an AMI that is backed by an Amazon Elastic Block Store (Amazon EBS) volume. Attach an additional EC2 instance store volume to contain the application data. Apply the default settings to the EC2 instances.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-45' style=';'><div id='questionWrap-45'  class='   watupro-question-id-374713'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>45. <\/span>A company uses Amazon Athena to run SQL queries for extract, transform, and load (ETL) tasks by using Create Table As Select (CTAS). The company must use Apache Spark instead of SQL to generate analytics. <br \/>\r<br>Which solution will give the company the ability to use Spark to access Athena?<\/div><input type='hidden' name='question_id[]' id='qID_45' value='374713' \/><input type='hidden' id='answerType374713' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374713[]' id='answer-id-1458442' class='answer   answerof-374713 ' value='1458442'   \/><label for='answer-id-1458442' id='answer-label-1458442' class=' answer'><span>Athena query settings<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374713[]' id='answer-id-1458443' class='answer   answerof-374713 ' value='1458443'   \/><label for='answer-id-1458443' id='answer-label-1458443' class=' answer'><span>Athena workgroup<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374713[]' id='answer-id-1458444' class='answer   answerof-374713 ' value='1458444'   \/><label for='answer-id-1458444' id='answer-label-1458444' class=' answer'><span>Athena data source<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374713[]' id='answer-id-1458445' class='answer   answerof-374713 ' value='1458445'   \/><label for='answer-id-1458445' id='answer-label-1458445' class=' answer'><span>Athena query editor<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-46' style=';'><div id='questionWrap-46'  class='   watupro-question-id-374714'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>46. <\/span>A company needs to partition the Amazon S3 storage that the company uses for a data lake. <br \/>\r<br>The partitioning will use a path of the S3 object keys in the following format: s3:\/\/bucket\/prefix\/year=2023\/month=01\/day=01. <br \/>\r<br>A data engineer must ensure that the AWS Glue Data Catalog synchronizes with the S3 storage when the company adds new partitions to the bucket. <br \/>\r<br>Which solution will meet these requirements with the LEAST latency?<\/div><input type='hidden' name='question_id[]' id='qID_46' value='374714' \/><input type='hidden' id='answerType374714' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374714[]' id='answer-id-1458446' class='answer   answerof-374714 ' value='1458446'   \/><label for='answer-id-1458446' id='answer-label-1458446' class=' answer'><span>Schedule an AWS Glue crawler to run every morning.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374714[]' id='answer-id-1458447' class='answer   answerof-374714 ' value='1458447'   \/><label for='answer-id-1458447' id='answer-label-1458447' class=' answer'><span>Manually run the AWS Glue CreatePartition API twice each day.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374714[]' id='answer-id-1458448' class='answer   answerof-374714 ' value='1458448'   \/><label for='answer-id-1458448' id='answer-label-1458448' class=' answer'><span>Use code that writes data to Amazon S3 to invoke the Boto3 AWS Glue create partition API call.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374714[]' id='answer-id-1458449' class='answer   answerof-374714 ' value='1458449'   \/><label for='answer-id-1458449' id='answer-label-1458449' class=' answer'><span>Run the MSCK REPAIR TABLE command from the AWS Glue console.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-47' style=';'><div id='questionWrap-47'  class='   watupro-question-id-374715'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>47. <\/span>A media company uses software as a service (SaaS) applications to gather data by using third-party tools. The company needs to store the data in an Amazon S3 bucket. The company will use Amazon Redshift to perform analytics based on the data. <br \/>\r<br>Which AWS service or feature will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_47' value='374715' \/><input type='hidden' id='answerType374715' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374715[]' id='answer-id-1458450' class='answer   answerof-374715 ' value='1458450'   \/><label for='answer-id-1458450' id='answer-label-1458450' class=' answer'><span>Amazon Managed Streaming for Apache Kafka (Amazon MSK)<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374715[]' id='answer-id-1458451' class='answer   answerof-374715 ' value='1458451'   \/><label for='answer-id-1458451' id='answer-label-1458451' class=' answer'><span>Amazon AppFlow<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374715[]' id='answer-id-1458452' class='answer   answerof-374715 ' value='1458452'   \/><label for='answer-id-1458452' id='answer-label-1458452' class=' answer'><span>AWS Glue Data Catalog<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374715[]' id='answer-id-1458453' class='answer   answerof-374715 ' value='1458453'   \/><label for='answer-id-1458453' id='answer-label-1458453' class=' answer'><span>Amazon Kinesis<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-48' style=';'><div id='questionWrap-48'  class='   watupro-question-id-374716'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>48. <\/span>A data engineer is using Amazon Athena to analyze sales data that is in Amazon S3. The data engineer writes a query to retrieve sales amounts for 2023 for several products from a table named sales_data. However, the query does not return results for all of the products that are in the sales_data table. <br \/>\r<br>The data engineer needs to troubleshoot the query to resolve the issue. <br \/>\r<br>The data engineer's original query is as follows: <br \/>\r<br>SELECT product_name, sum(sales_amount) <br \/>\r<br>FROM sales_data <br \/>\r<br>WHERE year = 2023 <br \/>\r<br>GROUP BY product_name <br \/>\r<br>How should the data engineer modify the Athena query to meet these requirements?<\/div><input type='hidden' name='question_id[]' id='qID_48' value='374716' \/><input type='hidden' id='answerType374716' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374716[]' id='answer-id-1458454' class='answer   answerof-374716 ' value='1458454'   \/><label for='answer-id-1458454' id='answer-label-1458454' class=' answer'><span>Replace sum(sales amount) with count(*J for the aggregation.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374716[]' id='answer-id-1458455' class='answer   answerof-374716 ' value='1458455'   \/><label for='answer-id-1458455' id='answer-label-1458455' class=' answer'><span>Change WHERE year = 2023 to WHERE extractlyear FROM sales data) = 2023.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374716[]' id='answer-id-1458456' class='answer   answerof-374716 ' value='1458456'   \/><label for='answer-id-1458456' id='answer-label-1458456' class=' answer'><span>Add HAVING sumfsales amount) &gt; 0 after the GROUP BY clause.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374716[]' id='answer-id-1458457' class='answer   answerof-374716 ' value='1458457'   \/><label for='answer-id-1458457' id='answer-label-1458457' class=' answer'><span>Remove the GROUP BY clause<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-49' style=';'><div id='questionWrap-49'  class='   watupro-question-id-374717'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>49. <\/span>A data engineer has a one-time task to read data from objects that are in Apache Parquet format in an Amazon S3 bucket. The data engineer needs to query only one column of the data. <br \/>\r<br>Which solution will meet these requirements with the LEAST operational overhead?<\/div><input type='hidden' name='question_id[]' id='qID_49' value='374717' \/><input type='hidden' id='answerType374717' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374717[]' id='answer-id-1458458' class='answer   answerof-374717 ' value='1458458'   \/><label for='answer-id-1458458' id='answer-label-1458458' class=' answer'><span>Confiqure an AWS Lambda function to load data from the S3 bucket into a pandas dataframe-Write a SQL SELECT statement on the dataframe to query the required column.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374717[]' id='answer-id-1458459' class='answer   answerof-374717 ' value='1458459'   \/><label for='answer-id-1458459' id='answer-label-1458459' class=' answer'><span>Use S3 Select to write a SQL SELECT statement to retrieve the required column from the S3 objects.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374717[]' id='answer-id-1458460' class='answer   answerof-374717 ' value='1458460'   \/><label for='answer-id-1458460' id='answer-label-1458460' class=' answer'><span>Prepare an AWS Glue DataBrew project to consume the S3 objects and to query the required column.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374717[]' id='answer-id-1458461' class='answer   answerof-374717 ' value='1458461'   \/><label for='answer-id-1458461' id='answer-label-1458461' class=' answer'><span>Run an AWS Glue crawler on the S3 objects. Use a SQL SELECT statement in Amazon Athena to query the required column.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div class='watu-question ' id='question-50' style=';'><div id='questionWrap-50'  class='   watupro-question-id-374718'>\n\t\t\t<div class='question-content'><div><span class='watupro_num'>50. <\/span>A company uses Amazon Redshift for its data warehouse. The company must automate refresh schedules for Amazon Redshift materialized views. <br \/>\r<br>Which solution will meet this requirement with the LEAST effort?<\/div><input type='hidden' name='question_id[]' id='qID_50' value='374718' \/><input type='hidden' id='answerType374718' value='radio'><!-- end question-content--><\/div><div class='question-choices watupro-choices-columns '><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374718[]' id='answer-id-1458462' class='answer   answerof-374718 ' value='1458462'   \/><label for='answer-id-1458462' id='answer-label-1458462' class=' answer'><span>Use Apache Airflow to refresh the materialized views.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374718[]' id='answer-id-1458463' class='answer   answerof-374718 ' value='1458463'   \/><label for='answer-id-1458463' id='answer-label-1458463' class=' answer'><span>Use an AWS Lambda user-defined function (UDF) within Amazon Redshift to refresh the materialized views.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374718[]' id='answer-id-1458464' class='answer   answerof-374718 ' value='1458464'   \/><label for='answer-id-1458464' id='answer-label-1458464' class=' answer'><span>Use the query editor v2 in Amazon Redshift to refresh the materialized views.<\/span><\/label><\/div><div class='watupro-question-choice  ' dir='auto' ><input type='radio' name='answer-374718[]' id='answer-id-1458465' class='answer   answerof-374718 ' value='1458465'   \/><label for='answer-id-1458465' id='answer-label-1458465' class=' answer'><span>Use an AWS Glue workflow to refresh the materialized views.<\/span><\/label><\/div><!-- end question-choices--><\/div><!-- end questionWrap--><\/div><\/div><div style='display:none' id='question-51'>\n\t<div class='question-content'>\n\t\t<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.dumpsbase.com\/freedumps\/wp-content\/plugins\/watupro\/img\/loading.gif\" width=\"16\" height=\"16\" alt=\"Loading...\" title=\"Loading...\" \/>&nbsp;Loading...\t<\/div>\n<\/div>\n\n<br \/>\n\t\n\t\t\t<div class=\"watupro_buttons flex \" id=\"watuPROButtons9379\" >\n\t\t  <div id=\"prev-question\" style=\"display:none;\"><input type=\"button\" value=\"&lt; Previous\" onclick=\"WatuPRO.nextQuestion(event, 'previous');\"\/><\/div>\t\t  \t\t  \t\t   \n\t\t   \t  \t\t<div><input type=\"button\" name=\"action\" class=\"watupro-submit-button\" onclick=\"WatuPRO.submitResult(event)\" id=\"action-button\" value=\"View Results\"  \/>\n\t\t<\/div>\n\t\t<\/div>\n\t\t\n\t<input type=\"hidden\" name=\"quiz_id\" value=\"9379\" id=\"watuPROExamID\"\/>\n\t<input type=\"hidden\" name=\"start_time\" id=\"startTime\" value=\"2026-05-05 18:35:16\" \/>\n\t<input type=\"hidden\" name=\"start_timestamp\" id=\"startTimeStamp\" value=\"1778006116\" \/>\n\t<input type=\"hidden\" name=\"question_ids\" value=\"\" \/>\n\t<input type=\"hidden\" name=\"watupro_questions\" value=\"374669:1458257,1458258,1458259,1458260 | 374670:1458261,1458262,1458263,1458264 | 374671:1458265,1458266,1458267,1458268 | 374672:1458269,1458270,1458271,1458272,1458273 | 374673:1458274,1458275,1458276,1458277 | 374674:1458278,1458279,1458280,1458281 | 374675:1458282,1458283,1458284,1458285 | 374676:1458286,1458287,1458288,1458289 | 374677:1458290,1458291,1458292,1458293 | 374678:1458294,1458295,1458296,1458297 | 374679:1458298,1458299,1458300,1458301 | 374680:1458302 | 374681:1458303,1458304,1458305,1458306 | 374682:1458307,1458308,1458309,1458310 | 374683:1458311,1458312,1458313,1458314 | 374684:1458315,1458316,1458317,1458318 | 374685:1458319,1458320,1458321,1458322,1458323 | 374686:1458324,1458325,1458326,1458327 | 374687:1458328,1458329,1458330,1458331,1458332 | 374688:1458333,1458334,1458335,1458336 | 374689:1458337,1458338,1458339,1458340,1458341,1458342,1458343,1458344,1458345,1458346,1458347 | 374690:1458348,1458349,1458350,1458351 | 374691:1458352,1458353,1458354,1458355 | 374692:1458356,1458357,1458358,1458359 | 374693:1458360,1458361,1458362,1458363 | 374694:1458364,1458365,1458366,1458367,1458368 | 374695:1458369,1458370,1458371,1458372 | 374696:1458373,1458374,1458375,1458376,1458377 | 374697:1458378,1458379,1458380,1458381 | 374698:1458382,1458383,1458384,1458385 | 374699:1458386,1458387,1458388,1458389,1458390 | 374700:1458391 | 374701:1458392,1458393,1458394,1458395 | 374702:1458396,1458397,1458398,1458399 | 374703:1458400,1458401,1458402,1458403,1458404 | 374704:1458405,1458406,1458407,1458408,1458409 | 374705:1458410,1458411,1458412,1458413 | 374706:1458414,1458415,1458416,1458417 | 374707:1458418,1458419,1458420,1458421 | 374708:1458422,1458423,1458424,1458425 | 374709:1458426,1458427,1458428,1458429 | 374710:1458430 | 374711:1458431,1458432,1458433,1458434,1458435,1458436,1458437 | 374712:1458438,1458439,1458440,1458441 | 374713:1458442,1458443,1458444,1458445 | 374714:1458446,1458447,1458448,1458449 | 374715:1458450,1458451,1458452,1458453 | 374716:1458454,1458455,1458456,1458457 | 374717:1458458,1458459,1458460,1458461 | 374718:1458462,1458463,1458464,1458465\" \/>\n\t<input type=\"hidden\" name=\"no_ajax\" value=\"0\">\t\t\t<\/form>\n\t<p>&nbsp;<\/p>\n<\/div>\n\n<script type=\"text\/javascript\">\n\/\/jQuery(document).ready(function(){\ndocument.addEventListener(\"DOMContentLoaded\", function(event) { \t\nvar question_ids = \"374669,374670,374671,374672,374673,374674,374675,374676,374677,374678,374679,374680,374681,374682,374683,374684,374685,374686,374687,374688,374689,374690,374691,374692,374693,374694,374695,374696,374697,374698,374699,374700,374701,374702,374703,374704,374705,374706,374707,374708,374709,374710,374711,374712,374713,374714,374715,374716,374717,374718\";\nWatuPROSettings[9379] = {};\nWatuPRO.qArr = question_ids.split(',');\nWatuPRO.exam_id = 9379;\t    \nWatuPRO.post_id = 96045;\nWatuPRO.store_progress = 0;\nWatuPRO.curCatPage = 1;\nWatuPRO.requiredIDs=\"0\".split(\",\");\nWatuPRO.hAppID = \"0.62199000 1778006116\";\nvar url = \"https:\/\/www.dumpsbase.com\/freedumps\/wp-content\/plugins\/watupro\/show_exam.php\";\nWatuPRO.examMode = 1;\nWatuPRO.siteURL=\"https:\/\/www.dumpsbase.com\/freedumps\/wp-admin\/admin-ajax.php\";\nWatuPRO.emailIsNotRequired = 0;\nWatuPROIntel.init(9379);\nWatuPRO.inCategoryPages=1;});    \t \n<\/script>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The AWS Certified Data Engineer &#8211; Associate certification demonstrates your ability to design data models, manage data lifecycles, and ensure data quality. This valuable Amazon Web Services (AWS) certification validates your expertise in core data-related AWS services, including data ingestion, transformation, and pipeline orchestration with programming concepts. Acquiring the latest study materials is a great [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[175,18249],"tags":[18250,18251],"class_list":["post-96045","post","type-post","status-publish","format-standard","hentry","category-amazon","category-data-engineer-associate","tag-amazon-dea-c01-dumps","tag-aws-certified-data-engineer-associate"],"_links":{"self":[{"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/posts\/96045","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/comments?post=96045"}],"version-history":[{"count":1,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/posts\/96045\/revisions"}],"predecessor-version":[{"id":96048,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/posts\/96045\/revisions\/96048"}],"wp:attachment":[{"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/media?parent=96045"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/categories?post=96045"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.dumpsbase.com\/freedumps\/wp-json\/wp\/v2\/tags?post=96045"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}