About This Project
This page is a live demonstration of cloud engineering and data pipeline design. An automated ETL pipeline—built with Python on AWS Lambda—authenticates with the Spotify Web API, extracts my listening data (top artists, albums, recently played tracks, genre breakdowns, and new releases), transforms the raw JSON into structured datasets, and loads the results into an S3 bucket. The pipeline runs on a scheduled EventBridge trigger, so the data you see here is always current without any manual intervention.
The entire infrastructure is provisioned and managed through modular Terraform: S3 for storage, CloudFront with TLS for content delivery, Route 53 for DNS, API Gateway for the backend, and IAM policies scoped to least-privilege access. Every component—from the Lambda execution role to the CloudFront cache behavior—is defined as code, version-controlled, and reproducible.
Tech stack: Python · AWS Lambda · S3 · CloudFront · EventBridge · Secrets Manager · Terraform · Spotify Web API
Want to see your Spotify data? Connect your account →