Rohan Pandey
Google Scholar ·
GitHub ·
Twitter ·
Email
I'm currently taking a short break from frontier AI research to solve OCR for Sanskrit. I feel it critical that the
classical Indian literary canon be immortalized in the training corpus for superintelligence.
At OpenAI, I helped train GPT-5 and future models. I graduated from Carnegie Mellon University in 2023 with an
honors thesis on semantics in multimodal LLMs.
Experience
- OpenAI: Explored model architecture questions spanning Pre-training, RL, and Inference
- Reworkd (YC S23): Built a multimodal web agent generating 5k lines of code weekly
- Microsoft AI: Fine-tuned language models to automate enterprise-scale data annotation
Selected Publications
- gzip Predicts Data-dependent Scaling Laws (ArXiv 2024)
- Multimodal Learning Without Multimodal Data: Guarantees and Applications (ICLR 2024)
- Towards Vision-Language Mechanistic Interpretability: a Causal Tracing Tool for BLIP (ICCV 2023 - CLVL)
- Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment (ACL 2023)
- Syntax-guided Neural Module Distillation to Probe Compositionality in Sentence Embeddings (EACL 2023)
- A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning (NeurIPS 2021 - Deep RL)
Featured Projects
- LlamaGym: Fine-tune LLM agents with online
reinforcement learning
- Tarsier: Vision utilities for agents to interact with
the web
- veda.dev: Morphology visualizer for Sanskrit literature research &
education
Fun Facts
- Run a biweekly Sanskrit reading group in San Francisco. Please reach out if interested!
- Forked ved/acc from
e/acc in 2023
- Lived at AGI House SF, a hacker house in Twin Peaks, for a year
until September 2024
- Taught a Classical Indian
Philosophy course at Carnegie Mellon University
- Conlanging in middle school led me to linguistics, and consequently to NLP & Sanskrit