Research engineer and compute strategist who co-created JAX, pioneered the QRNN architecture, and serves as Head of Compute at Anthropic.
Profile
| Nationality | American |
| Current Institution(s) | Anthropic (Head of Compute) |
| Research Areas | Deep Learning Systems, Large Language Models, ML Frameworks, TPU/Accelerator Efficiency |
| Education | BA, Linguistics, Stanford University |
| X / Twitter | @jekbradbury |
| GitHub | jekbradbury |
| Google Scholar | James Bradbury — 112,000+ citations |
Overview
James Bradbury is an American research engineer best known as a co-creator of JAX, Google’s high-performance numerical computing library that has become a foundational tool for large-scale deep learning research. Prior to JAX, he made significant early contributions to NLP with the Quasi-Recurrent Neural Network (QRNN), and he created torchtext, the canonical text-processing companion to PyTorch. After spending several years at Google Brain and Google DeepMind working on TPU infrastructure and efficient LLM inference at scale, he joined Anthropic in early 2023 as Head of Compute, where he is responsible for securing and effectively deploying the accelerator resources that underpin Claude model training.
Early Life & Education
Bradbury studied linguistics at Stanford University, an unusual background for an ML engineer that later informed his early focus on natural language processing. Before entering deep learning research full-time, he spent time at Caixin, the Chinese financial and investigative media organization, suggesting an early interest in language and information work that predated his ML career. His formal publication record begins around 2015–2016 at MetaMind, though his open-source activity on GitHub dates to at least 2014.
Career
Prior to his ML career, Bradbury did an internship at Caixin (财新传媒), the Beijing-based financial and investigative media organization.
Stanford Linguistics Department (dates unconfirmed)
Bradbury is listed on his X profile as having been affiliated with the Stanford Linguistics department, consistent with his undergraduate and possible early research background in computational linguistics.
MetaMind (pre-2016)
Bradbury joined MetaMind, the deep learning startup founded by Richard Socher, where he worked as a researcher on NLP systems. During this period he co-authored papers on neural machine translation and began developing the ideas that would become the QRNN. MetaMind was acquired by Salesforce in April 2016.
Salesforce Research (2016–2018)
Following the MetaMind acquisition, Bradbury became a research scientist at Salesforce Research. There he published the QRNN paper (ICLR 2017), demonstrating a sequence model up to 16× faster than LSTMs at equivalent accuracy by replacing recurrent layers with parallel convolutions and a minimal pooling step. He also created torchtext (pytorch/text), the first widely-adopted text-data toolkit for PyTorch, and the matchbox library for writing minibatch-efficient PyTorch code. In 2016 he co-represented MetaMind/Salesforce at the WMT machine translation shared task with Richard Socher.
Google Brain / Google DeepMind (2018–2023)
In September 2018, Bradbury joined Google Brain to work at the intersection of ML and programming languages. His most significant contribution during this period was as a core designer and early author of JAX, the composable transformations library built on top of XLA that enables JIT compilation, automatic differentiation, and vectorization of NumPy-style Python programs for GPU and TPU. He also contributed to Flux.jl, the Julia-based ML framework. Later in his Google tenure, as part of the JAX Engagements Team within Google DeepMind, he focused on enabling large-scale LLM users on TPU v4, co-authoring the 2022 PaLM inference efficiency paper demonstrating low-latency, high-throughput serving of 8B–540B parameter models.
Anthropic (2023–present)
Bradbury joined Anthropic in early 2023 as Head of Compute. In this role he is responsible for ensuring the company has the accelerator resources required to pursue its mission and that those resources are used effectively and efficiently across training and inference workloads. He has spoken publicly about compute strategy at the PyTorch Conference 2024, where he participated in a keynote panel on scaling and benchmarking.
Key Contributions
- JAX — Co-created the composable ML framework that underpins much of modern large-scale deep learning research; the jax-ml/jax repository has over 35,000 GitHub stars and Bradbury is one of its original named authors alongside Roy Frostig, Matthew James Johnson, and Chris Leary.
- Quasi-Recurrent Neural Networks (QRNN) — Introduced at ICLR 2017 (arXiv:1611.01576), this architecture alternates parallel convolutional layers with a minimal recurrent pooling step, achieving up to 16× speedup over cuDNN LSTMs at equivalent accuracy on language modeling, sentiment analysis, and neural machine translation.
- torchtext (pytorch/text) — Created the foundational NLP data-loading and preprocessing library for PyTorch; the repository has over 3,600 stars and was the standard text toolkit for the PyTorch ecosystem for several years.
- matchbox (salesforce/matchbox) — Open-sourced a PyTorch abstraction allowing researchers to write code at the level of individual examples and have it run automatically on efficient minibatches, simplifying NLP model prototyping.
- PaLM Inference Efficiency (2022) — Co-authored the paper and accompanying open-source code demonstrating practical high-throughput inference of models from 8B to 540B parameters on TPU v4, shaping how the industry thinks about LLM serving economics.
- Flux.jl contributions — Active contributor to the Julia ML ecosystem’s primary framework, reflecting his consistent interest in bringing ML to new programming paradigms.
Awards & Recognition
- Google Scholar citation count — Over 112,000 cumulative citations, driven primarily by the JAX reference and the QRNN paper, placing him among the most highly cited ML systems researchers of his cohort.
- PyTorch Conference 2024 keynote panelist — Invited to the scaling and benchmarking panel alongside researchers from Meta, together.ai, and UC Berkeley, recognizing his standing in the ML infrastructure community.
Key Relationships
- Richard Socher — Doctoral advisor and MetaMind founder; Bradbury worked under Socher at both MetaMind and Salesforce Research, co-authoring the QRNN paper and WMT 2016 system.
- Roy Frostig, Matthew James Johnson, Chris Leary — Co-creators of JAX at Google Brain; the four form the core of JAX’s original authorship team.
- Reiner Pope — Co-author on the PaLM inference paper at Google; Pope is now at MatX and Bradbury is listed as a co-author on his Google Scholar profile.
- Stephen Merity, Caiming Xiong — Co-authors on the QRNN paper at Salesforce Research, both prominent NLP researchers.
- Dario Amodei / Anthropic leadership — Current organizational context; Bradbury joined Anthropic shortly after the Google Cloud partnership announcement in early 2023.
- Soumith Chintala — Shared stage at PyTorch Conference 2024; Chintala co-founded PyTorch and Bradbury created torchtext as a core PyTorch ecosystem library.
Personal Style
Bradbury’s career traces an unusual arc from linguistics to ML systems to compute strategy, and his technical work consistently prioritizes making powerful machinery accessible: torchtext simplified NLP pipelines, JAX brought XLA’s power to Python researchers, and matchbox abstracted minibatch complexity away from model code. On social media he is terse and technically precise, more likely to share benchmark numbers or paper links than extended commentary. His move from individual contributor roles in research engineering to the strategic compute function at Anthropic reflects a broader pattern among senior ML infrastructure figures who increasingly see accelerator procurement and utilization as load-bearing decisions for frontier AI development.
References
- X / Twitter profile: @jekbradbury
- GitHub profile: jekbradbury
- Google Scholar: James Bradbury
- NVIDIA Blog author page
- LF Events — PyTorch Conference 2024 keynote speaker bio
- arXiv: Quasi-Recurrent Neural Networks (1611.01576)
- JAX GitHub repository
- pytorch/text GitHub repository
- PaLM inference paper (arXiv:2211.05102)