MS Computer Science @ UT Dallas (3.8 GPA) | Software Engineering Fellow @ Princeton Research Computing | NeurIPS 2025 Under Review
Building ML systems and backend infrastructure. Princeton HPC fellowship (CUDA/C++), NeurIPS 2025 submission on on-device LLM quantization, 2 published papers. Open source contributor to Celery, IPython, Apache Airflow.
- ATQ-LLM — Adaptive Ternary Quantization for LLM compression (~16x on GPT-2). NeurIPS 2025.
- MusicSourceClassifier — AI vs Human music detector. 99.9% accuracy on 900 samples. CNN + FAISS similarity search.
- GPS Fleet Telemetry — Event-driven pipeline with anomaly detection. 43-test suite.
- DoctorBot-AI — Full-stack AI chatbot. FastAPI + Mistral + Three.js. 18 async tests.
- N-Body Optimization — CUDA/OpenMP/MIC. 10.4x speedup. Princeton fellowship.
- Video Understanding — Multimodal pipeline. Qwen2-VL + BLIP + Flan-T5.
- Adaptive Ternary Quantization for On-Device LLM Compression — NeurIPS 2025 (Under Review)
- Quantum Enhanced Recommendation AI Chatbot — IGI Global, 2025
- Ensemble ML for Multi-Disease Detection — EasyChair, 2023
Python · C/C++ · CUDA · TypeScript · PyTorch · Transformers · FAISS · FastAPI · React · Docker · Kubernetes · AWS · PostgreSQL

