Production50K+Records/day89%Accuracy120msP99 latency48hRetrain cycle★ Featured
🔴 The Problem
▸
Job seekers lacked real-time salary benchmarks
▸
Existing tools used stale data and simple regression models
✅ The Solution
▸
Kafka streams live salary data at 50K records/day
▸
Spark processes features at scale; MLflow tracks experiments
📈 Impact & Results
▸
89% accuracy within $5K range on holdout set
▸
Retraining cycle: 2 weeks → 48 hours
▸
Sub-120ms prediction latency at p99
Full Tech Stack
PythonSnowflakePySparkKafkaAirflowRedisFlaskReactDockerPostgreSQLAWS EC2Firebase
More Projects
Interested in working together?
Let's build something impactful.