All Projects
Machine LearningCase Study·84% Match

Taxi Data Prediction Dashboard — Snowflake + Snowpark

Taxi demand analytics platform using Snowflake as both the data warehouse and compute engine. Snowpark Python UDFs run ML transformations directly in Snowflake. Jupyter notebooks (notebooks/) handle EDA; SQL scripts (sqls/) define transformations; Python src/ manages the pipeline. Streamlit Cloud hosts the interactive dashboard.

SnowflakeSnowparkMLflowPythonStreamlit
LiveSnowflakeWarehouse + ComputeSnowparkIn-DB PythonMLflowTrackingStreamlitDashboard
Snowflake
Warehouse + Compute
Snowpark
In-DB Python
MLflow
Tracking
Streamlit
Dashboard
🔴 The Problem

ETL pipelines and ML transformations ran outside the warehouse, duplicating data movement

No single source of truth for both analytics and ML feature engineering

The Solution

Snowpark Python UDFs run ML transformations directly inside Snowflake — no data egress

Jupyter notebooks handle EDA; SQL scripts define transformations; Streamlit Cloud hosts dashboard

📈 Impact & Results

Zero data movement between warehouse and ML layer

End-to-end pipeline in one platform: ingest → transform → model → visualise

Streamlit dashboard connected live to Snowflake for real-time analytics

Full Tech Stack
SnowflakeSnowparkMLflowPythonStreamlit

More Projects

Interested in working together?
Let's build something impactful.