Rohan Pandey

Google Scholar · GitHub · Twitter · Email

I Take the Bitter Lesson Seriously, so I'm teaching machines to do science at Periodic Labs.

At OpenAI, I helped train GPT-5 class models. I graduated from Carnegie Mellon University in 2023 with an honors thesis on semantics in multimodal LLMs.

Experience

OpenAI: Explored model architecture questions spanning Pre-training, RL, and Inference
Reworkd (YC S23): Built a multimodal web agent generating 5k lines of code weekly
Microsoft AI: Fine-tuned language models to automate enterprise-scale data annotation

Selected Publications

gzip Predicts Data-dependent Scaling Laws (ArXiv 2024)
Multimodal Learning Without Multimodal Data: Guarantees and Applications (ICLR 2024)
Towards Vision-Language Mechanistic Interpretability: a Causal Tracing Tool for BLIP (ICCV 2023 - CLVL)
Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment (ACL 2023)
Syntax-guided Neural Module Distillation to Probe Compositionality in Sentence Embeddings (EACL 2023)
A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning (NeurIPS 2021 - Deep RL)

Featured Projects

LlamaGym: Fine-tune LLM agents with online reinforcement learning
Tarsier: Vision utilities for agents to interact with the web
veda.dev: Morphology visualizer for Sanskrit literature research & education

Fun Facts

Run a biweekly Sanskrit reading group in San Francisco. Please reach out if interested!
Worked on OCR for Sanskrit to immortalize the classical Indian literary canon in the training corpus for superintelligence
Forked ved/acc from e/acc in 2023
Lived at AGI House SF, a hacker house in Twin Peaks, for a year until September 2024
Taught a Classical Indian Philosophy course at Carnegie Mellon University
Conlanging in middle school led me to linguistics, and consequently to NLP & Sanskrit