Vedant Misra

PaLM: Scaling Language Modeling with Pathways

Journal of Machine Learning Research, 2023 • 8,335 citations

Scaling language models using Google's Pathways system, achieving state-of-the-art performance across hundreds of language understanding and generation tasks.

Evaluating Large Language Models Trained on Code

arXiv, 2021 • 7,631 citations

Introduced HumanEval benchmark for evaluating code generation capabilities of large language models, foundational work for GitHub Copilot and similar tools.

Gemini: A Family of Highly Capable Multimodal Models

arXiv, 2023 • 7,002 citations

Technical report on Gemini, Google DeepMind's multimodal AI model family with state-of-the-art capabilities across text, image, audio, and video understanding.

Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens

arXiv, 2024 • 3,448 citations

Long-context understanding with up to 10M token context window, enabling new applications in document analysis and reasoning.

PaLM 2 Technical Report

arXiv, 2023 • 2,252 citations

Next generation of PaLM with improved multilingual, reasoning, and coding capabilities.

Beyond the Imitation Game: Quantifying and Extrapolating the Capabilities of Language Models

Transactions on Machine Learning Research, 2023 • 2,212 citations

Comprehensive benchmark (BIG-bench) with over 200 tasks for evaluating language model capabilities beyond simple imitation.

Solving Quantitative Reasoning Problems with Language Models

NeurIPS, 2022 • 1,350 citations

Demonstrated how language models can solve complex mathematical and quantitative reasoning problems through improved training approaches.

Gemini 2.5: Pushing the Frontier with Advanced Reasoning and Agentic Capabilities

arXiv, 2025 • 1,217 citations

Latest generation Gemini model with enhanced reasoning, multimodality, and agentic capabilities.

Gemma 3 Technical Report

arXiv, 2025 • 944 citations

Open-source language model family designed for responsible AI development and deployment.

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

arXiv, 2022 • 576 citations

Discovered the "grokking" phenomenon where neural networks suddenly generalize long after overfitting, with implications for understanding deep learning.

Exploring Length Generalization in Large Language Models

NeurIPS, 2022 • 319 citations

Analysis of how language models generalize to longer sequences than seen during training.

Rational Orbits Around Charged Black Holes

Physical Review D, 2010 • 69 citations

Classification of eccentric timelike orbits in charged black hole spacetime using dynamical systems theory, with applications to gravitational wave astronomy.