Building enterprise-scale data platforms, real-time pipelines, and SIEM detection systems at TCS. Passionate about clean data, intelligent systems, and research that matters.
End-to-end real-time and batch Lakehouse for a multi-location restaurant chain. Medallion architecture (Bronze/Silver/Gold) with Delta Lake, CDC ingestion from Azure SQL, real-time streaming via Event Hubs, Galaxy schema for KPIs, and AI/BI dashboards powered by Databricks Mosaic AI for sentiment analysis.
Cloud pipeline processing 10M+ records with automated ELT, Star Schema modeling, Streamlit dashboards, and a data quality framework that reduced inconsistencies by 95%. Query performance improved 80% via dynamic tables and aggregated views.
Symptom-based disease diagnosis using LLMs, RAG, Neo4j Knowledge Graphs, and Milvus VectorDB. Achieved 97% evaluation performance with significantly reduced hallucination rates.
Looking for opportunities and collaborations in AI/ML · Data Engineering · Cloud (AWS, Azure) · Innovation & Research. Feel free to reach out through any channel below.