
PaLM: Scaling Language Modeling with Pathways
Scaling language models using Google's Pathways system, achieving state-of-the-art performance across hundreds of language understanding and generation tasks.
My research focuses on advancing the capabilities of large language models and on the computational principles of intelligence. I have publications and patents spanning state of the art AI, machine learning, human-computer interaction, and quantitative finance.

Scaling language models using Google's Pathways system, achieving state-of-the-art performance across hundreds of language understanding and generation tasks.

Introduced HumanEval benchmark for evaluating code generation capabilities of large language models, foundational work for GitHub Copilot and similar tools.

Technical report on Gemini, Google DeepMind's multimodal AI model family with state-of-the-art capabilities across text, image, audio, and video understanding.

Long-context understanding with up to 10M token context window, enabling new applications in document analysis and reasoning.

Next generation of PaLM with improved multilingual, reasoning, and coding capabilities.

Comprehensive benchmark (BIG-bench) with over 200 tasks for evaluating language model capabilities beyond simple imitation.

Demonstrated how language models can solve complex mathematical and quantitative reasoning problems through improved training approaches.

Latest generation Gemini model with enhanced reasoning, multimodality, and agentic capabilities.

Open-source language model family designed for responsible AI development and deployment.

Discovered the "grokking" phenomenon where neural networks suddenly generalize long after overfitting, with implications for understanding deep learning.

Analysis of how language models generalize to longer sequences than seen during training.

Classification of eccentric timelike orbits in charged black hole spacetime using dynamical systems theory, with applications to gravitational wave astronomy.

Patent for systems and methods to automatically generate personalized content for sales and marketing communications using machine learning.

Patent for methods and systems for dynamic visualization of complex information and data patterns.

Statistical analysis revealing evidence of market manipulation ("bear raids") at the beginning of the 2007 financial crisis.

Application of AI and statistical methods to improve diagnosis and treatment of glaucoma.

Research on adversarial attacks against transformer-based language models.

Analysis of vulnerability patterns in complex high-dimensional systems using network theory and statistical methods.

Analysis of short selling regulations and their impact on market stability, presented to the SEC.