From the course: AWS Certified Data Analytics – Specialty (DAS-C01) Cert Prep: 1 Collection

Unlock this course with a free trial

Join today to access over 23,200 courses taught by industry experts.

Transform data in transit

Transform data in transit

- [Presenter] Transforming data in transit on the AWS platform has a great example in AWS Glue. Let's take a look at this diagram from the beginning to the end. First, you would trigger, via timer, a Glue operation. Glue is a serverless ETL system. What that means is that you can point it to multiple data sources, so for example, if you had a big file in S3 that was maybe a CSV file that was millions of rows, then you also had an external PostgreSQL database, you could tie those two together, create an ETL, extract transform load operation, and then transform that data and put it into another bucket in S3, right? So you've got this great reusable operation that is constantly making new versions of the data, right? 'Cause this external database could be updated by some vendor that you're dealing with. And what's great is that this trigger that runs periodically will transform it and put it into a new bucket. Now…

Contents