Join us for a Serverless ETL Meetup at the AWS NYC office in Herald Square, on February 21, from 4:00 to 6:00 PM EST. We'll cover three different topics:
Talk: Leveraging Athena Spark to go from 2 hours to 2 min.
In this talk I will give an overview of Spark & Athena Spark, I will then give an overview & demo of a recent long running ETL job that I optimized from 2 hours down to 2 minutes. I will also cover how to make Athena Spark jobs production ready using the AWS Boto3 SDK.
Speaker: Elliot Cordo
Elliott is an expert in cloud native data engineering, strategy & architecture, with a passion helping organizations drive value from data. He has more than two decades of experience implementing cutting-edge, data-driven cloud native applications. He has a passion for helping organizations understand the true potential in their data by working as a leader, architect, & builder. He is the founder of Data Futures, a consultancy focused on cloud-native data modernization, & an AWS Data Hero.
Talk: Glue Job Automation
Like many organizations, David Yurman is using Glue to stage data from various sources. Instead of building hundreds of Glue jobs by hand David Yurman has built a configuration based method of doing so. Parthiv will share a module they have built to automate staging job creation via YAML.
Speaker: Parthiv Khaund
Parthiv Khaund is a Lead Data Engineer at David Yurman, contributing to the Data Management & Engineering team. His primary focus is on the development & enhancement of data pipelines, ensuring seamless integration with the central data warehouse which supports advanced data analysis & decision-making processes across the organization. Prior to David Yurman, Parthiv honed his skills in the insurance sector.
Talk: Unifying data with AWS Glue
AWS Glue is a serverless ETL (extract transform load) tool that allows you to easily onboard & transform data via a graphical interface, notebook, or Python code. The underlying engine is Spark which makes it both flexible (due to community contribution) & scalable. At Rightway, we leverage AWS Glue in various places to help us build data products rapidly. In this presentation, we will share how AWS Glue allowed us to leverage multiple data stores to create dynamic reports, showcasing the flexibility & scalability of AWS Glue through practical applications & recent successes.
Speaker: Marsha Ghose
Marsha is a data analyst at Rightway Healthcare, where she happily combines her talents & enthusiasm to drive innovation in the healthtech industry. Her experience is rooted in the healthtech domain, especially in pharmacy-related technologies, as demonstrated through her work with industry giants like CVS & the innovative startup Capsule. Marsha's diverse background, which encompasses software engineering & her experience as a board-certified pharmacy technician, allows her to merge technical expertise with practical healthcare insights.