British-Canadian cognitive scientist and Nobel laureate; the central figure of the connectionist tradition, architect of the backpropagation renaissance, and the most influential teacher in the history of deep learning.
Basic Information / Profile
| Field | Details |
|---|---|
| Full Name | Geoffrey Everest Hinton |
| Born | December 6, 1947, Wimbledon, London, England, UK |
| Nationality | British-Canadian |
| Current Institution | University of Toronto (University Professor Emeritus); Vector Institute (Chief Scientific Advisor) |
| Research Fields | Artificial neural networks, machine learning, cognitive science, deep learning, AI safety |
| PhD Advisor | Christopher Longuet-Higgins |
| PhD Thesis | Relaxation and its Role in Vision (University of Edinburgh, 1978) |
| Personal Website | cs.toronto.edu/~hinton |
| X / Twitter | @geoffreyhinton |
| Google Scholar | scholar.google.com/citations?user=JicYPdAAAAAJ |
| Nobel Prize | nobelprize.org/prizes/physics/2024/hinton/facts |
Overview
Geoffrey Hinton is a British-Canadian computer scientist and cognitive psychologist who spent five decades championing neural networks as the right approach to machine intelligence at a time when the academic mainstream had largely abandoned them. His 1986 Nature paper popularising backpropagation for multi-layer networks, his development of Boltzmann machines and deep belief networks, and his supervision of AlexNet — the convolutional network that effectively restarted the field in 2012 — form the technical spine of modern deep learning. Through his group at the University of Toronto he trained or mentored a disproportionate share of the researchers who now lead AI at major laboratories, including Yann LeCun, Ilya Sutskever, Alex Krizhevsky, Ruslan Salakhutdinov, and Peter Dayan. He shared the 2018 Turing Award with Yoshua Bengio and Yann LeCun, and in 2024 he and John Hopfield were awarded the Nobel Prize in Physics “for foundational discoveries and inventions that enable machine learning with artificial neural networks.” In May 2023, Hinton resigned from Google Brain to speak freely about what he regards as the existential and near-term risks of the technology he helped create — a position he has stated with increasing urgency since.
Early Life & Education
Family and Intellectual Inheritance
Hinton was born on December 6, 1947, in Wimbledon. His family tree runs through a remarkable seam of British intellectual history: he is the great-great-grandson of the mathematician and logician George Boole, whose work on Boolean algebra became a foundation of computer science, and of Mary Everest Boole, the mathematician and educator. His middle name, Everest, comes from his great-great-granduncle George Everest, the Surveyor General of India after whom the mountain is named. His father was the entomologist H. E. Hinton; his first cousin once removed was Joan Hinton, one of two female physicists on the Manhattan Project. Hinton has noted that his back injury at age 19 — which has made sitting painful for his entire adult life — shaped his working habits, and he has dealt with depression throughout his career.
King’s College, Cambridge — BA (1967–1970)
Hinton matriculated at King’s College, Cambridge in 1967, cycling through natural sciences, history of art, and philosophy before settling on experimental psychology, graduating with a Bachelor of Arts in 1970. After completing his degree he spent a year apprenticing as a carpenter before returning to academia — a biographical pause he has cited as evidence that his intellectual path was far from predetermined.
University of Edinburgh — PhD (1972–1978)
Hinton began his doctorate at Edinburgh in 1972 under Christopher Longuet-Higgins, a cognitive scientist who had recently converted from neural network research to the symbolic AI approach then dominant in the field. Working within this unsympathetic environment, Hinton wrote a thesis on Relaxation and its Role in Vision, exploring how constraint-satisfaction processes in visual perception might be modelled — early groundwork for the energy-based models he would develop later. The degree was awarded in 1978. Having difficulty securing research funding in Britain, he moved to the United States.
Career
UC San Diego and Carnegie Mellon — Postdoc and Faculty (1978–1987)
Hinton was a postdoctoral researcher at UC San Diego from 1978, where he joined David Rumelhart and Ronald Williams. Their collaboration produced the 1986 Nature paper “Learning Representations by Back-propagating Errors,” which demonstrated that multi-layer networks could learn useful internal representations via gradient descent through the chain rule — the result that reignited serious interest in neural networks after a decade of stagnation. Hinton has credited Rumelhart with the core idea; the paper’s importance lay in its accessible presentation and compelling experiments at a moment when the field needed a proof of concept. From 1982 to 1987 he was faculty at Carnegie Mellon, where he joined the Parallel Distributed Processing (PDP) group that included Terry Sejnowski, Francis Crick, David Rumelhart, and James McClelland — a connectionist research community that explicitly positioned itself against the symbolic AI mainstream during the AI winter. With Ackley and Sejnowski, he co-invented Boltzmann machines in 1985, an energy-based stochastic recurrent network whose learning algorithm drew on ideas from statistical physics.
University of Toronto — Professor (1987–present)
Hinton moved to the University of Toronto in 1987 as a CIFAR Fellow — Canada had actively recruited him, in part to avoid the growing military funding of AI research in the United States, which conflicted with his socialist politics. He has been affiliated with Toronto ever since, becoming University Professor and, after his retirement, University Professor Emeritus. Toronto became the world’s most influential node for neural network research for the following three decades, producing or hosting a disproportionate fraction of the researchers who later shaped the field.
His Toronto group’s output across this period was broad and consistent. The wake-sleep algorithm (1995, with Dayan, Frey, and Neal) introduced a generative model trained by alternating recognition and generation passes that anticipated the variational autoencoder framework. In 2006 Hinton and Salakhutdinov published a landmark Science paper demonstrating that deep networks with many hidden layers could be trained effectively using a greedy layer-by-layer pre-training approach with restricted Boltzmann machines — a result that, more than any single paper, triggered the “deep learning” renaissance by showing that depth was achievable in practice. In 2008 he developed t-SNE with Laurens van der Maaten, a dimensionality reduction and visualisation technique that became ubiquitous for inspecting high-dimensional data. In 2012 his students Alex Krizhevsky and Ilya Sutskever, under his supervision, produced AlexNet — a deep convolutional neural network that won the ImageNet Large Scale Visual Recognition Challenge by a margin of nearly 11 percentage points over the previous best, shocking the computer vision community and effectively setting the agenda for the field’s next decade. The paper on Dropout (2014, with Srivastava, Krizhevsky, Sutskever, and Salakhutdinov) formalised a simple but powerful regularisation technique that became standard practice. Knowledge distillation (2015, with Vinyals and Dean) proposed a method for transferring learned knowledge between networks by training on soft probability outputs rather than hard labels — now universally used in model compression.
In 2012, Hinton co-founded DNNresearch Inc. with Krizhevsky and Sutskever to commercialise the technology. Google acquired the company in March 2013 for $44 million — a transaction that effectively signalled to the entire industry that deep learning had arrived.
UCL Gatsby Computational Neuroscience Unit — Founding Director (1998–2001)
Between his Toronto appointments, Hinton served as the founding director of the Gatsby Computational Neuroscience Unit at University College London, funded by the Gatsby Charitable Foundation. The unit became an influential training ground for computational neuroscience and probabilistic machine learning; Peter Dayan, who had been Hinton’s postdoc at Toronto, became co-director and later director.
Google Brain — Distinguished Researcher (2013–2023)
Following the DNNresearch acquisition, Hinton divided his time between the University of Toronto and Google Brain, where he worked as a Distinguished Researcher. During this period he continued fundamental research, introducing capsule networks (2017) as an architectural alternative to standard convolutional networks that aimed to better represent part-whole spatial relationships, and the Forward-Forward algorithm (NeurIPS 2022) as a speculative alternative to backpropagation that replaces the backward pass with a second forward pass on negative data. In 2021 he co-authored a widely cited paper on SimCLR, a contrastive self-supervised learning framework for visual representations.
In May 2023, Hinton publicly announced his resignation from Google, stating he wanted to speak freely about AI risks “without considering how this impacts Google.” In his New York Times interview he said “a part of him now regrets his life’s work.” He stressed he was not criticising Google — which he praised for acting responsibly — but that the structural position of being employed by a major AI developer made candid public statements about the technology’s dangers professionally complicated.
Post-Google — Public AI Safety Advocate (2023–present)
Since leaving Google, Hinton has been one of the most prominent and credible voices raising alarms about near-term and long-term AI risk. He signed the 2023 Center for AI Safety statement that AI extinction risk should be treated as a global priority alongside nuclear war and pandemics. He has warned of risks from deliberate misuse — particularly the use of AI to design novel bioweapons (“It just requires one crazy guy with a grudge”) — from technological unemployment, and from the possibility of AI systems developing misaligned sub-goals through instrumental convergence. By late 2024 he was estimating a 10–20% probability of human extinction from AI within 30 years. He has advocated for government regulation as the only realistic mechanism to force safety investment on competing corporations, and supported California’s SB 1047 AI safety bill alongside Yoshua Bengio, Stuart Russell, and Lawrence Lessig.
Key Contributions
- Backpropagation — canonical popularisation (Nature, 1986, with Rumelhart and Williams) — Demonstrated that gradient descent through the chain rule enables multi-layer networks to learn useful internal representations; the paper that ended the first AI winter for neural networks. Hinton has credited Rumelhart with the core idea; the prior work by Werbos (1974) and Linnainmaa (1970) predated it, but the 1986 paper made it accessible and credible to the community.
- Boltzmann Machines (Cognitive Science, 1985, with Ackley and Sejnowski) — Introduced an energy-based stochastic recurrent network with a biologically motivated learning rule; the Boltzmann machine was explicitly cited in Hinton’s Nobel Prize citation.
- Restricted Boltzmann Machines and Deep Belief Networks (Science, 2006, with Salakhutdinov) — Showed that deep networks could be trained effectively using greedy layer-by-layer pre-training with RBMs; the paper that catalysed the modern deep learning era by proving that depth was tractable.
- Distributed Representations — Argued across multiple papers in the 1980s and 1990s that meaning and knowledge should be encoded in patterns of activation across many units, not in single “grandmother cells”; a foundational theoretical commitment that distinguished connectionism from earlier symbolic approaches.
- AlexNet (NeurIPS 2012, with Alex Krizhevsky and Ilya Sutskever) — A deep convolutional network that won ImageNet 2012 with a top-5 error rate of 15.3%, nearly 11 points ahead of the next best entry, igniting the deployment of deep learning across industry.
- Dropout (JMLR, 2014, with Srivastava, Krizhevsky, Sutskever, and Salakhutdinov) — A regularisation method that randomly deactivates units during training, preventing co-adaptation and functioning as an implicit ensemble; became standard practice in neural network training.
- Knowledge Distillation (2015, with Vinyals and Dean) — Introduced training a compact student network on soft probability outputs of a large teacher network, enabling model compression without hard-label retraining; now universally used in efficient inference and deployment.
- t-SNE (JMLR, 2008, with Laurens van der Maaten) — A non-linear dimensionality reduction method that preserves local cluster structure for visualisation of high-dimensional data; became the standard tool for inspecting embedding spaces.
- Wake-Sleep Algorithm (Science, 1995, with Dayan, Frey, and Neal) — A learning algorithm for hierarchical generative models that alternates recognition (“wake”) and generation (“sleep”) phases; anticipated the structure of later variational inference methods.
- Capsule Networks (2017) — A speculative architecture replacing scalar activations with vectors (“capsules”) to encode part-whole spatial relationships, aiming to overcome CNNs’ insensitivity to spatial configuration; has generated substantial follow-on work though not yet displaced CNNs in production.
- SimCLR / Contrastive Learning (ICML 2020, with Chen, Kornblith, and Norouzi) — A contrastive self-supervised learning framework for visual representations that became a widely adopted baseline for pre-training visual encoders.
- Forward-Forward Algorithm (NeurIPS 2022) — A speculative backpropagation alternative using two forward passes (on positive and negative data) rather than a backward pass; relevant to “mortal computation” settings such as analogue hardware.
- Academic lineage — PhD students include Richard Zemel, Brendan Frey, Radford Neal, Yee Whye Teh, Ruslan Salakhutdinov, Ilya Sutskever, Alex Krizhevsky, and Peter Brown; postdoctoral researchers include Yann LeCun, Peter Dayan, Max Welling, Alex Graves, and Zoubin Ghahramani — a cohort that has collectively shaped virtually every major AI laboratory active today.
Awards & Recognition
- Rumelhart Prize (2001) — First recipient of the prize for contributions to the theoretical foundations of human cognition.
- IJCAI Award for Research Excellence (2005)
- Herzberg Canada Gold Medal for Science and Engineering (2011)
- ACM Turing Award (2018, jointly with Yoshua Bengio and Yann LeCun) — “For conceptual and engineering breakthroughs that have made deep neural networks a critical component of computing.”
- Companion of the Order of Canada (2018)
- Dickson Prize in Science, Carnegie Mellon University (2021)
- Princess of Asturias Award for Technical and Scientific Research (2022, jointly with Bengio, LeCun, and Hassabis)
- Nobel Prize in Physics (2024, jointly with John Hopfield) — “For foundational discoveries and inventions that enable machine learning with artificial neural networks.” The Boltzmann machine was explicitly cited.
- VinFuture Grand Prize (2024, jointly with Bengio, LeCun, Jensen Huang, and Fei-Fei Li)
- Queen Elizabeth Prize for Engineering (2025, jointly with Bengio, Bill Dally, Hopfield, LeCun, Jensen Huang, and Fei-Fei Li)
- Sandford Fleming Medal (2025) — Royal Canadian Institute for Science, for excellence in science communication.
- Fellow of the Royal Society (FRS) (1998)
- Fellow of the Royal Society of Canada (1996)
- Fellow of AAAI (1990)
- International Member, US National Academy of Engineering (2016)
- International Member, US National Academy of Sciences (2023)
Key Relationships
- David Rumelhart — Intellectual collaborator at UC San Diego and co-author of the 1986 backpropagation paper; Hinton has consistently credited Rumelhart with the core idea and regards the paper as Rumelhart’s invention.
- Terrence Sejnowski — Carnegie Mellon colleague and co-inventor of Boltzmann machines; part of the PDP group that maintained connectionism through the AI winter.
- Yoshua Bengio — CIFAR co-member and long-time collaborator; co-Turing Award winner; Bengio’s Montréal group and Hinton’s Toronto group formed the two pillars of Canadian deep learning research.
- Yann LeCun — Postdoctoral researcher in Hinton’s group at Toronto (1987–1988); co-Turing Award winner; the postdoc year was the beginning of LeCun’s work on convolutional networks.
- Ilya Sutskever — PhD student; co-inventor of AlexNet; Hinton has said he is proud that Sutskever participated in the 2023 OpenAI board decision to remove Sam Altman, citing AI safety concerns as shared motivation.
- Alex Krizhevsky — PhD student; principal implementer of AlexNet; the three co-founded DNNresearch, which Google acquired in 2013.
- Peter Dayan — Postdoctoral researcher; became co-founder and later director of the Gatsby Unit at UCL, and a leading figure in computational neuroscience; Hinton’s influence on Dayan’s research direction was formative.
- John Hopfield — Nobel co-laureate; Hopfield’s 1982 paper on associative memory in Hopfield networks was a direct predecessor to Boltzmann machines and is the other body of work cited in the Nobel citation.
Personal Style
Hinton is known for a combination of deep stubbornness — holding connectionist positions through two AI winters when the consensus was against him — and intellectual restlessness: he has regularly proposed new learning frameworks (capsule networks, Forward-Forward, mortal computation) that challenge the consensus he helped establish, often noting that he suspects backpropagation is not how the brain actually works and that understanding the brain’s actual learning algorithm remains an open problem. His public voice since 2023 has been notably different from his scientific one: blunt, repetitive for deliberate emphasis, and willing to assign probability estimates to catastrophic outcomes. He has described the situation as roughly analogous to a brilliant physicist who helped build the atomic bomb and then spent the rest of his career warning about it. He moved from the United States to Canada in part out of political disillusionment with Reagan-era politics and military AI funding — a detail that reveals a consistency between his scientific convictions and his wider values that has been present since early in his career.
References
- Wikipedia: Geoffrey Hinton
- Nobel Prize facts page
- University of Toronto profile
- Personal website: cs.toronto.edu/~hinton
- Google Scholar profile
- X / Twitter: @geoffreyhinton
- Digg AI profile
- Cade Metz, “‘The Godfather of A.I.’ Leaves Google and Warns of Danger Ahead,” The New York Times, May 1, 2023
- Joshua Rothman, “Why the Godfather of A.I. Fears What He’s Built,” The New Yorker, November 20, 2023
- Rumelhart, Hinton, Williams, “Learning Representations by Back-propagating Errors,” Nature, 1986
- Hinton, Salakhutdinov, “Reducing the Dimensionality of Data with Neural Networks,” Science, 2006
- Krizhevsky, Sutskever, Hinton, “ImageNet Classification with Deep Convolutional Neural Networks,” NeurIPS 2012