Vardaan Pahuja

I am a Ph.D. student in CSE at The Ohio State University. I am fortunate to be advised by Prof. Yu Su. I graduated with M.Sc. (thesis track) in Computer Science from Université de Montréal (affiliated with MILA). Prior to this, I was working as Software Engineer in IBM India Research Lab., Bangalore. I graduated with Bachelor of Technology (Hons.) from IIT Kharagpur, India, and was awarded the prestigious Institute Silver Medal. I spent three wonderful summers as a research intern at Microsoft Research, Google Brain, and Bosch AI Research.

I'm on the 2026 industry job market and seeking full-time Research Scientist opportunities.
Vardaan Pahuja

News

Research Interests

My research interests lie in LLM agents, multimodal foundation models, and KB reasoning, with a central focus on the role of structure in modern multimodal machine learning. Broadly, I study how leveraging structure, either inherent (e.g., knowledge graphs) or deliberately synthesized (e.g., web interaction trajectories), can make multimodal systems more robust, generalizable, and interpretable. My work integrates visual signals, language, and structured knowledge to enhance representation and reasoning (ACL'21, CIKM'23, CIKM'24) as well as interactive web environments (ACL (Findings)'25). I also work on computer-use agents and LLM interpretability, including the use of sparse autoencoders to analyze salient latent structure in dense image representations (ICLR'26).

Publications

ICLR 2026 paper thumbnail

Automatic Image-Level Morphological Trait Annotation for Organismal Images

Vardaan Pahuja*, Samuel Stevens*, Alyson East, Sydne Record, Yu Su

Explorer paper thumbnail

Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

Vardaan Pahuja*, Yadong Lu*, Corby Rosset, Boyu Gou, Arindam Mitra, Spencer Whitehead, Yu Su, Ahmed Awadallah

Camera Trap paper thumbnail

Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs

Vardaan Pahuja, Weidi Luo, Yu Gu, Cheng-Hao Tu, Hong-You Chen, Tanya Berger-Wolf, Charles Stewart, Song Gao, Wei-Lun Chao, Yu Su

KG-R3 paper thumbnail

A Retrieve-and-Read Framework for Knowledge Graph Link Prediction

Vardaan Pahuja, Boshi Wang, Hugo Latapie, Jayanth Srinivasa, Yu Su

Diversifying Tokenization paper thumbnail

Diversifying Joint Vision-Language Tokenization Learning

Vardaan Pahuja, AJ Piergiovanni, Anelia Angelova

Fine-Tuning paper thumbnail

Fine-Tuning is Fine, if Calibrated

Zheda Mai*, Arpita Chowdhury*, Ping Zhang*, Cheng-Hao Tu, Hong-You Chen, Vardaan Pahuja, Tanya Berger-Wolf, Song Gao, Charles Stewart, Yu Su, Wei-Lun Chao

Holistic Transfer paper thumbnail

Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data

Cheng-Hao Tu, Hong-You Chen, Jike Zhong, Zheda Mai, Vardaan Pahuja, Tanya Berger-Wolf, Song Gao, Charles Stewart, Yu Su, Wei-Lun Chao

KBQA Survey paper thumbnail

Knowledge Base Question Answering: A Semantic Parsing Perspective

Yu Gu, Vardaan Pahuja, Gong Cheng, Yu Su

Systematic Investigation paper thumbnail

A Systematic Investigation of KB-Text Embedding Alignment at Scale

Vardaan Pahuja, Yu Gu, Wenhu Chen, Mehdi Bahrami, Lei Liu, Wei-Peng Chen and Yu Su

Structure NMN paper thumbnail

Structure Learning for Neural Module Networks

Vardaan Pahuja, Jie Fu, Sarath Chandar, Christopher J Pal

CSQA paper thumbnail

Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph

Amrita Saha*, Vardaan Pahuja*, Mitesh M. Khapra, Karthik Sankaranarayanan, Sarath Chandar

Joint Correlated Sequence paper thumbnail

Joint Learning of Correlated Sequence Labeling Tasks Using Bidirectional Recurrent Neural Networks

Vardaan Pahuja*, Anirban Laha*, Shachar Mirkin, Vikas Raykar, Lili Kotlerman, Guy Lev

EMBC paper thumbnail

Learning a Probabilistic Boolean Network Model from Biological Pathways and Time-series Expression Data

Vardaan Pahuja, Ritwik Kumar Layek, Pabitra Mitra

SalsaBot paper thumbnail

SalsaBot: Towards a Robust and Generalizable Embodied Agent

Chan Hee Song, Jiaman Wu, Ju-Seung Byeon, Zexin Xu, Vardaan Pahuja, Goonmeet Bajaj, Samuel Stevens, Ziru Chen, Yu Su

VQA CVPR19 paper thumbnail

Learning Sparse Mixture of Experts for Visual Question Answering

Vardaan Pahuja, Jie Fu, Christopher J Pal

VLDB paper thumbnail

Tooling framework for instantiating natural language querying system

Manasa Jammi, Jaydeep Sen, Ashish Mittal, Sagar Verma, Vardaan Pahuja, Rema Ananthanarayanan, Pranay Lohia, Hima Karanam, Diptikalyan Saha, Karthik Sankaranarayanan

Service

Awards

Honorable Mention Award for Poster, OSU CSE Graduate Student Research Poster Exhibition 2024
Institute Silver Medal, IIT Kharagpur – Best academic performance at graduation 2016
Prof. J.C. Ghosh Memorial Prize, IIT Kharagpur – Best academic performance (VI semester) 2015
International Symposium (Microwave and Comm.) 1981 Prize, IIT Kharagpur – Best academic performance (VI semester) 2015
Class of 1970 Alumni (US) Association Prize, IIT Kharagpur – Best academic performance in Institute (IV semester) 2014
IIT Kharagpur Alumni (California Chapter) Award, IIT Kharagpur – Best academic performance in Institute (IV semester) 2014
National Talent Search Examination (NTSE) – Award of scholarship under NTSE 2008