
Featured Projects:
A selection of data engineering and analytics work
Airflights
An end-to-end ETL pipeline built with Python, Serpapi, and Delta Lake that ingests raw Google Flights data, transforms it through a medallion architecture, and produces aggregated flight intelligence for trip planning.
| Bronze Layer | SILVER Layer | GOLD Layer |
|---|---|---|
| Extracting and storing raw flight data to a Delta table in Databricks. | This layer processes raw flight data from Bronze tables into cleaned, curated Silver tables. |
IBM Stocks
| Bronze Layer | Silver Layer | Dashboard – Tableau |
|---|---|---|
| Build an incremental data ingestion pipeline for IBM stock datasets using Databricks LakeHouse architecture. | Implemented a Medallion ETL architecture (Bronze, Silver, Gold) for clean stock data. | Built interactive dashboards in Tableau to visualize IBM stock performance and market trends. |
IT Survey
| BRONZE Layer | SILVER Layer | Analysis – EDA |
|---|---|---|
| Applied Medallion architecture to process and transform raw IT survey data. | Conducted in-depth Exploratory Data Analysis (EDA) on IT surveys. |
Medallion ETL – Dashboard – EDA
| Crypto – Medallion ETL | Uber Drive | Cat Breed |
|---|---|---|
| Processed cryptocurrency market data through a Medallion ETL pipeline. | Analyzed Uber ride data to improve operational efficiency metrics. DASHBOARD |
