Experience
Senior Data Engineer
LTIMindtree - Pune, India
May 2023 – Mar 2025
- Developed and optimized near real-time data pipelines to seamlessly migrate on-premises traditional databases to a centralized BigQuery platform, facilitating efficient analytical workloads and insights. Processed approximately 10 million records.
- Designed and implemented an event-driven pipeline to process and load complex, inconsistent data from multiple Excel sheets into GCP, reducing processing time from a week to under 10 minutes with parallel processing of 30 GB.
- Established orchestration using Google Workflow to ensure smooth execution and management of data pipelines.
Data Engineer
AI Adventures - Pune, India
Aug 2021 – Apr 2023
- Led migration of course data from Moodle to MySQL, enabling business intelligence views that drove a 20% increase in revenue.
- Expanded insights into sales, student retention, and curriculum performance, improving data-informed decisions across the organization.
Data Engineer
STL - Pune, India
Jun 2018 – May 2021
- Built near real-time data pipelines to migrate on-prem databases (Oracle, SAP and Salesforce) to BigQuery, enabling efficient analytics and processing approximately 1 TB of data.
- Provided Tableau reports and data-driven insights to support business functions, including scrap detection and DOE analysis, driving process improvements and informed decision making.
- Collaborated with cross-functional teams to ensure effective data usage, driving successful operational outcomes, and achieve business objectives.
Education
Projects
Vital Watch
Real-time health monitoring system that tracks vital signs and provides early warning alerts for critical health conditions.
Tech Stack: Python, Kafka (Confluent), KSQL, Twilio, ElevenLabs
AI Governance
Chrome extension + local Ollama backend to detect and redact sensitive data in LLM prompts before submission.
Tech Stack: JavaScript, Chrome Extension, OpenAI, Ollama
Smart Study Companion
AI-powered learning assistant that adapts to each student’s pace, tone, and skill level with interactive conversations.
Tech Stack: Python, LangChain
Blogs
Deep Learning’s Blind Spot: The Table Problem
Exploring why deep learning models struggle with tabular data and when traditional ML methods still outperform neural networks.
Catching Risk Early: A Recall-First Lung Cancer Classifier
Building a lung cancer classifier using CRISP-DM methodology and Scikit-learn, prioritizing recall to minimize missed diagnoses.
What I Learned Today
Daily learnings and notes from my journey in AI/ML, data science, and software engineering. Each note captures a concept, technique, or insight from my studies.






