Research - Vedant Misra

My research focuses on advancing the capabilities of large language models and on the computational principles of intelligence. I have publications and patents spanning state of the art AI, machine learning, human-computer interaction, and quantitative finance.

PaLM: Scaling Language Modeling with Pathways

Journal of Machine Learning Research, 2023 • 8,335 citations

Scaling language models using Google's Pathways system, achieving state-of-the-art performance across hundreds of language understanding and generation tasks.

Evaluating Large Language Models Trained on Code

arXiv, 2021 • 7,631 citations

Introduced HumanEval benchmark for evaluating code generation capabilities of large language models, foundational work for GitHub Copilot and similar tools.

Gemini: A Family of Highly Capable Multimodal Models

arXiv, 2023 • 7,002 citations

Technical report on Gemini, Google DeepMind's multimodal AI model family with state-of-the-art capabilities across text, image, audio, and video understanding.

Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens

arXiv, 2024 • 3,448 citations

Long-context understanding with up to 10M token context window, enabling new applications in document analysis and reasoning.

PaLM 2 Technical Report

arXiv, 2023 • 2,252 citations

Next generation of PaLM with improved multilingual, reasoning, and coding capabilities.

Beyond the Imitation Game: Quantifying and Extrapolating the Capabilities of Language Models

Transactions on Machine Learning Research, 2023 • 2,212 citations

Comprehensive benchmark (BIG-bench) with over 200 tasks for evaluating language model capabilities beyond simple imitation.

Solving Quantitative Reasoning Problems with Language Models

NeurIPS, 2022 • 1,350 citations

Demonstrated how language models can solve complex mathematical and quantitative reasoning problems through improved training approaches.

Gemini 2.5: Pushing the Frontier with Advanced Reasoning and Agentic Capabilities

arXiv, 2025 • 1,217 citations

Latest generation Gemini model with enhanced reasoning, multimodality, and agentic capabilities.

Gemma 3 Technical Report

arXiv, 2025 • 944 citations

Open-source language model family designed for responsible AI development and deployment.

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

arXiv, 2022 • 576 citations

Discovered the "grokking" phenomenon where neural networks suddenly generalize long after overfitting, with implications for understanding deep learning.

Exploring Length Generalization in Large Language Models

NeurIPS, 2022 • 319 citations

Analysis of how language models generalize to longer sequences than seen during training.

Rational Orbits Around Charged Black Holes

Physical Review D, 2010 • 69 citations

Classification of eccentric timelike orbits in charged black hole spacetime using dynamical systems theory, with applications to gravitational wave astronomy.

Methods and Systems for Automated Generation of Personalized Messages

US Patent 11,321,736, 2022 • 163 citations

Patent for systems and methods to automatically generate personalized content for sales and marketing communications using machine learning.

Method and Apparatus for Dynamic Information Visualization

US Patent 8,683,389, 2014 • 142 citations

Patent for methods and systems for dynamic visualization of complex information and data patterns.

Evidence of Market Manipulation in the Financial Crisis

arXiv, 2011 • 21 citations

Statistical analysis revealing evidence of market manipulation ("bear raids") at the beginning of the 2007 financial crisis.

Artificial Intelligence and Complex Statistical Modeling in Glaucoma Diagnosis

Current Opinion in Ophthalmology, 2021 • 19 citations

Application of AI and statistical methods to improve diagnosis and treatment of glaucoma.

Black Box Attacks on Transformer Language Models

ICLR 2019 Workshop on Debugging ML • 13 citations

Research on adversarial attacks against transformer-based language models.

Vulnerability Analysis of High Dimensional Complex Systems

Symposium on Self-Stabilizing Systems, 2010 • 8 citations

Analysis of vulnerability patterns in complex high-dimensional systems using network theory and statistical methods.

Regulation of Short Selling: The Uptick Rule and Market Stability

SEC Report, 2010 • 3 citations

Analysis of short selling regulations and their impact on market stability, presented to the SEC.