Pratham Singla - Ai/ml Researcher & Engineer | Iit Roorkee - AI/ML Researcher & Engineer

About

Highly accomplished AI/ML researcher and engineer with a 9.195/10 CGPA from IIT Roorkee, specializing in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Computer Vision. Proven ability to develop and evaluate advanced AI systems, evidenced by multiple research internships, publications in top-tier workshops (NeurIPS, AAAI), and innovative projects. Seeking to leverage deep technical expertise and research acumen to drive impactful advancements in AI development.

Work Experience

Research Intern

Lossfunk

Dec 2024 - Present

Remote, India

Pioneered the development of advanced AI benchmarks and contributed to reinforcement learning frameworks for vision-language models at a leading research firm.

Developed a novel benchmark, inspired by ZeroBench, for complex visual reasoning, integrating an automated question-generation pipeline to train and evaluate vision-language models on GPT-level reasoning tasks.
Contributed to a reinforcement learning-based training framework, specifically designed to enhance models' ability to ground reasoning in visual inputs, reducing reliance on textual cues.
Pioneered the creation of a sophisticated evaluation system for next-generation AI models, setting a new standard for visual-language understanding.

Gen AI Intern

Omega Intelligence

Mar 2025 - May 2025

Remote, India

Explored and optimized Retrieval-Augmented Generation (RAG) architectures, establishing robust local benchmarking setups for Large Language Models (LLMs).

Explored and analyzed cutting-edge agentic Retrieval-Augmented Generation (RAG) architectures to optimize information retrieval and synthesis for complex queries.
Established a robust local benchmarking setup using tools like Ollama, enabling efficient deployment and performance evaluation of Large Language Models (LLMs).

Research Intern

Boston University

Mar 2025 - May 2025

Boston, MA, US

Investigated self-awareness in Large Language Models (LLMs) and evaluated fine-tuning methods to understand model behavior and generalization.

Investigated self-awareness in Large Language Models (LLMs) through in-depth analysis of alignment between internal reasoning and external outputs.
Evaluated the performance of SFT, DPO, and GRPO tuned models on tasks involving bias, risk, and reward hacking, providing critical insights into model vulnerabilities.
Designed and executed comprehensive experiments to rigorously test the awareness of learned behaviors and generalization of reasoning across diverse domains.

Education

Mechanical Engineering

Indian Institute of Technology Roorkee

9.195/10 CGPA

Aug 2023

Roorkee, Uttarakhand, India

Volunteer

Core Member

ACM, IIT Roorkee

Mar 2024 - Present

Roorkee, Uttarakhand, India

Actively participated in ACM chapter activities, contributing to initiatives that advance computer science education and community engagement at IIT Roorkee.

Collaborated with fellow members to promote computer science and foster a vibrant academic community.
Contributed to the development and dissemination of technical knowledge through participation in group projects and discussions.
Supported the chapter's mission to provide resources and opportunities for students interested in computing.

Head of Projects

Vision and Language Group, IIT Roorkee

Feb 2024 - Present

Roorkee, Uttarakhand, India

Led project planning and execution, fostered an active Deep Learning community, and moderated discussions within the Vision and Language Group at IIT Roorkee.

Orchestrated the overall planning and moderation of research projects, contributing to strategic direction and successful project outcomes.
Organized and moderated paper discussions, enhancing knowledge sharing and collaborative research initiatives within the group.
Actively contributed to the development of an active Deep Learning community on campus, fostering growth and engagement among students.

Projects

DE -VTL: A Retrieval Framework for RAG

Dec 2024 - Feb 2025

Developed an innovative Retrieval-Augmented Generation (RAG) framework focused on enhancing retrieval quality through active learning and efficient fine-tuning.

Evaluating AI Agent frameworks

Nov 2024 - Dec 2024

Evaluated AI agent frameworks on QA tasks to assess their reasoning capabilities, response accuracy, and efficiency.

AI Mock Interview Chatbot

Sep 2024 - Oct 2024

Engineered a voice-interactive AI interview coach utilizing Gemini LLM and Vapi AI to provide real-time feedback and structured scoring for personalized interview preparation.

Low Light Image Enhancement

Jun 2024 - Jul 2024

Developed a neural network-based solution for enhancing low-light images, achieving significant PSNR improvements through advanced image processing techniques.

Publications

Reflective Self-Awareness and Reasoning Alignment in LLMs

AAAI'26

Jan 2026

Analyzed self-awareness and reasoning alignment in Large Language Models (LLMs) across various scenarios, evaluating the impact of SFT, DPO, and GRPO fine-tuning on model behavior, including bias, risk, and reward hacking.

Adaptive Urban Planning

AAAI'25 AI4UP Workshop

Jan 2025

Developed a Multi-Agent urban planning framework using LLMs and Genetic Algorithms for optimization and regional customization, resulting in significant improvements in livability and accessibility.

StegaVision: Enhancing Steganography with Attention Mechanism

AAAI-25 Student abstract

Jan 2025

Developed a novel image steganography model using a neural network-based approach, integrating attention mechanisms (channel and spatial attention) into an autoencoder architecture.

Give me a hint: Can LLMs take hint to solve math problems?

NeurIPS’24 Math-AI Workshop

Dec 2024

Conducted a study on LLMs' ability to utilize hints in mathematical problem-solving on the MATH dataset, benchmarking zero-shot, few-shot, chain-of-thought, and adversarial hinting against traditional prompting.

Skills

Large Language Models (LLMs)

LLM Development
LLM Evaluation
RAG
Generative AI
Fine-tuning
Self-Awareness in LLMs
Reasoning Alignment
GPT
Gemini LLM

Machine Learning & Deep Learning

Reinforcement Learning
Neural Networks
Computer Vision
Image Enhancement
Steganography
Active Learning
Genetic Algorithms
Data Preprocessing
Histogram Analysis

AI/ML Research & Development

Benchmarking
Experiment Design
Model Evaluation
Problem Solving
Technical Research
Algorithm Development
AI Ethics
Multi-Agent Systems

Programming & Tools

Python
Ollama
Vapi AI
LORA
ZeroBench
Code Development
Data Analysis

References

Paras Chopra

Founder, Lossfunk, Wingify