Passionate AI/ML Engineer specializing in neural architecture design, transformer models, and production ML systems. Expert in developing machine learning models, building scalable data pipelines, and deploying AI solutions that learn and adapt.
Currently pursuing B.Tech in Data Science and AI at IIT Bhilai, where I serve as Coordinator of the DSAI Club.
My mission: reducing loss functions in both code and real-world problems.
B.Tech. in Data Science and AI
Aug 2023 – May 2027 (Expected)
Relevant Coursework: Data Structures & Algorithms, ML, NLP, DBMS, Statistics, Probability, OS
Deep Mutual Learning Library
Production-ready PyTorch library for Deep Mutual Learning (DML) and collaborative neural network training. Implements state-of-the-art techniques where multiple neural networks train together and learn from each other's predictions, achieving 2-5% higher accuracy compared to independent training.
pip install pytorch-dml
from pydml import DMLTrainer
from torchvision import models
models = [models.resnet18(), models.resnet18()]
trainer = DMLTrainer(models, device='cuda')
trainer.fit(train_loader, val_loader, epochs=100)
Architecture: XGBoost + Linear Regression Ensemble (R² = 0.989)
Built predictive models for academic risk assessment with Tableau dashboards. Achieved 20% earlier intervention through feature engineering and hyperparameter optimization.
Git Repository →Innovation: Memory-Efficient KV Cache (60-80% reduction)
Implemented block-based attention mechanism with Copy-on-Write for beam search. Optimized inference pipeline achieving 2x throughput on large language models.
Git Repository →Stack: LangChain + FAISS + GPT (93% relevance)
Deployed context-aware chatbot using Retrieval-Augmented Generation. Integrated semantic search with vector embeddings, containerized with Docker for scalability.
Git Repository →Course: Machine Learning (CS 550)
Designed a modular experiment framework combining PPLM steering, a lightweight RLHF proxy (reward-model + supervised fine-tune), and hybrid inference-time control for generating context-aware poetry. Built reproducible pipelines for training, evaluation, and batch sweeps with aggregated metrics such as reward, perplexity, and distinctness.
View Repository →Course: Artificial Intelligence (CSL304)
Built a full Bomberman game in Python with AI-driven bot agents capable of strategic movement, bomb placement, and real-time decision-making. Implemented A* pathfinding, state-based behavior (search, chase, evade), bomb safety simulation, and intelligent evasion scoring to create adaptive, challenging opponents. Includes customizable gameplay settings and full gameplay demo video.
view Repository →