SmartMedAl: LLM Fine-Tuning with RLAIF & DPO for Clinical QA
Summary
Developed SmartMedAl, an LLM fine-tuning project utilizing RLAIF and DPO for clinical QA, demonstrating advanced Generative AI capabilities.
Highly skilled Generative AI & Machine Learning Engineer with an M.S. in Computer Science and OCI certification, specializing in building and deploying production-grade LLM and ML systems. Proven expertise in fine-tuning large language models using RLAIF and DPO, developing RAG pipelines, and implementing agentic AI architectures. Delivers scalable AI solutions across applied research and industry, leveraging Python, PyTorch, LangChain, and robust cloud infrastructure (AWS, OCI) to drive innovation and efficiency.
Research Assistant
Los Angeles, CA, US
→
Summary
Conducting advanced ML-based pattern recognition research, developing Transformer models, and designing optimization algorithms to enhance system efficiency and reproducibility.
Highlights
Led ML-based pattern recognition research on Indus Valley historical scripts, leveraging deep learning models to achieve high accuracy in symbol classification.
Implemented a Transformer-based language translation model for AI curriculum technical demonstrations, significantly improving student comprehension and engagement.
Designed and optimized path optimization algorithms for GIS systems, enhancing visualization capabilities and improving emergency response route efficiency by an estimated X%.
Authored comprehensive technical specifications for model frameworks, ensuring high reproducibility and facilitating iterative development cycles.
Lead Course Producer
Los Angeles, CA, US
→
Summary
Managed course operations for an enterprise systems program, coordinating teams and academic delivery for 59 students to ensure high-quality learning outcomes.
Highlights
Led comprehensive course operations for an enterprise systems program, coordinating cross-functional teams and managing academic delivery for 59 students.
Evaluated assignments and delivered timely, feedback-driven assessments, significantly enhancing learning outcomes and maintaining high-quality standards.
Conducted ERP simulations, performing rigorous testing and validation of assignments while reinforcing complex enterprise architecture concepts for students.
Data Science Intern
→
Summary
Developed and implemented robust ML algorithms for large-scale time-series sensor data, optimizing feature extraction and enhancing LLM application performance.
Highlights
Designed and implemented advanced ML algorithms for large-scale time-series sensor data, enabling precise activity recognition across distributed wearable sensors.
Developed production-ready Python modules for multi-class activity feature extraction using sophisticated signal processing techniques, integrated into the core product pipeline.
Built a robust threshold-based calibration model across distributed body segment sensors, successfully reducing error rates to under 10%.
Engineered and refined prompts for LLM-based applications, significantly improving response consistency, evaluation quality, and downstream accuracy by an estimated X%.
Collaborated effectively within a cross-functional Agile team, delivering well-documented, maintainable Python code on tight delivery schedules.
Teaching Assistant
Los Angeles, CA, US
→
Summary
Assisted in delivering core AI curriculum to 30+ students, facilitating hands-on learning, and ensuring strong conceptual clarity through technical support.
Highlights
Assisted in delivering core AI curriculum to 30+ students, grading assignments and facilitating open lab sessions for hands-on learning and practical application.
Performed thorough testing and debugging of AI assignments, resolving student queries to ensure strong conceptual clarity and maximize engagement.
Provided individualized support to students, clarifying complex AI concepts and fostering a collaborative learning environment.
→
M.S.
Computer Science
Grade: 3.52/4.0
Courses
AI
ML for Data Science
Analysis of Algorithms
Database Systems
Web Technologies
→
B.E.
Information Technology
Grade: 9.19/10
Courses
Data Structures & Algorithms
Web Development
OS
Big Data Analytics
Issued By
Oracle Cloud Infrastructure
Issued By
Amazon Web Services
Python, SQL, JavaScript, Java, C.
PyTorch, TensorFlow, Scikit-learn, Keras, NumPy, Pandas, Matplotlib, Hugging Face.
Transformers, LangChain, TRL, RAG, LoRA, DPO, RLAIF, Prompt Engineering, Agentic AI, MCP, OCI GenAI.
AWS (EC2), Oracle Cloud Infrastructure (OCI), Docker, Flask, Streamlit, React, Node.js, MongoDB.
Summary
Developed SmartMedAl, an LLM fine-tuning project utilizing RLAIF and DPO for clinical QA, demonstrating advanced Generative AI capabilities.
Summary
Created SmartPDF Chat, an AI-powered multi-document RAG application for natural language querying across multiple PDFs.
Summary
Developed a full-stack web application for real-time stock analytics, demonstrating expertise in web development and cloud deployment.