Passionate AI/ML Engineer specializing in neural architecture design, transformer models, and production ML systems. Expert in developing machine learning models, building scalable data pipelines, and deploying AI solutions that learn and adapt.
Currently pursuing B.Tech in Data Science and AI at IIT Bhilai, where I serve as Coordinator of the DSAI Club.
My mission: reducing loss functions in both code and real-world problems.
B.Tech. in Data Science and AI
Aug 2023 – May 2027 (Expected)
Relevant Coursework: Data Structures & Algorithms, ML, NLP, Deep Learning, DBMS, Statistics, OS
Deep Mutual Learning Library
Production-ready PyTorch library for Deep Mutual Learning (DML) and collaborative neural network training. Implements state-of-the-art techniques where multiple neural networks train together and learn from each other's predictions, achieving 2-5% higher accuracy compared to independent training.
pip install pytorch-dml
from pydml import DMLTrainer
from torchvision import models
models = [models.resnet18(), models.resnet18()]
trainer = DMLTrainer(models, device='cuda')
trainer.fit(train_loader, val_loader, epochs=100)
Stack: GPT-4 • LangGraph • CRAG • ChromaDB • FastAPI • Streamlit
Built a production-grade multi-agent medical RAG system using GPT-4, LangGraph, and CRAG with hybrid BM25+vector retrieval, reducing hallucinations to <10% (RAGAS) and improving retrieval accuracy by 35%.
Git Repository →Stack: Ensemble ML • SHAP • Streamlit • MLflow
Engineered v2 of Kaggle Student Performance Enhancer with a modular ML pipeline; optimized ensemble models (R² 0.989), integrated SHAP explainability, deployed Streamlit app, and enabled MLflow tracking with versioned models.
Git Repository →Innovation: Memory-Efficient KV Cache (60-80% reduction)
Implemented block-based attention mechanism with Copy-on-Write for beam search. Optimized inference pipeline achieving 2x throughput on large language models.
Git Repository →Course: Machine Learning (CS 550)
Designed a modular experiment framework combining PPLM steering, a lightweight RLHF proxy (reward-model + supervised fine-tune), and hybrid inference-time control for generating context-aware poetry. Built reproducible pipelines for training, evaluation, and batch sweeps with aggregated metrics such as reward, perplexity, and distinctness.
View Repository →Course: Artificial Intelligence (CSL304)
Built a full Bomberman game in Python with AI-driven bot agents capable of strategic movement, bomb placement, and real-time decision-making. Implemented A* pathfinding, state-based behavior (search, chase, evade), bomb safety simulation, and intelligent evasion scoring to create adaptive, challenging opponents.
View Repository →Led workshops, study groups, and hands-on ML sessions; coordinated events and mentoring to promote applied ML, Kaggle competitions, and research-oriented projects.
Led a team to build a geographic intelligence app that predicts optimal hospital placement by analyzing proximity to existing healthcare facilities, demographics, and regional infrastructure data.