AWS Glue – A Fully Managed ETL Service.
AWS Glue is a fully managed ETL service that makes it easy to move data between your data stores. AWS Glue simplifies and automates the difficult and time consuming tasks of data discovery, conversion, mapping, and job scheduling. AWS Glue guides you through the process of moving your data with an easy to use console that helps you understand your data sources, prepare the data for analytics, and load it reliably from data sources to destinations. In this talk, we provide an overview of AWS Glue and its differentiating features. We first describe the data catalog for discovering and organizing your data sets. AWS Glue also makes job authoring easy through automatic script generation and by allowing users to use their own developer tools. Finally, it offers fully-managed, serverless job orchestration and execution, so users need not worry about configuration and resource management.
Sr. Software Manager, Amazon Web Services
Mehul is currently leading the development of AWS Glue and, before that, managed the delivery of key features in AWS Redshift. Prior to Amazon, his career spanned both research and industry. From 2011-2014, he was co-founder and CEO of Amiato, a startup that offered a cloud-based ETL service. From 2004-2011, he was a principal scientist at HP Labs where his expertise spanned distributed systems and data management. He has published in top-tier conferences and journals and has been granted over 20 patents. His research has won best paper and test-of-time-awards. He received his MEng in 1997 and BS in Computer Science and Physics in 1996, all from MIT.