In 1951, Marvin Minsky, then a student at Harvard, borrowed observations from animal behavior to try to design an intelligent machine. Drawing on the work of physiologist Ivan Pavlov, who famously used dogs to show how animals learn through punishments and rewards, Minsky created a computer that could continuously learn through similar reinforcement to solve a virtual maze.
At the time, neuroscientists had yet to figure out the mechanisms within the brain that allow animals to learn in this way. But Minsky was still able to loosely mimic the behavior, thereby advancing artificial intelligence. Several decades later, as reinforcement learning continued to mature, it in turn helped the field of neuroscience discover those mechanisms, feeding into a virtuous cycle of advancement between the two fields.
In a paper published in Nature today, DeepMind, Alphabets AI subsidiary, has once again used lessons from reinforcement learning to propose a new theory about the reward mechanisms within our brains. The hypothesis, supported by initial experimental findings, could not only improve our understanding of mental health and motivation. It could also validate the current direction of AI research toward building more human-like general intelligence.
Sign up for The Algorithm artificial intelligence, demystified
At a high level, reinforcement learning follows the insight derived from Pavlovs dogs: its possible to teach an agent to master complex, novel tasks through only positive and negative feedback. An algorithm begins learning an assigned task by randomly predicting which action might earn it a reward. It then takes the action, observes the real reward, and adjusts its prediction based on the margin of error. Over millions or even billions of trials, the algorithms prediction errors converge to zero, at which point it knows precisely which actions to take to maximize its reward and so complete its task.
It turns out the brains reward system works in much the same waya discovery made in the 1990s, inspired by reinforcement-learning algorithms. When a human or animal is about to perform an action, its dopamine neurons make a prediction about the expected reward. Once the actual reward is received, they then fire off an amount of dopamine that corresponds to the prediction error. A better reward than expected triggers a strong dopamine release, while a worse reward than expected suppresses the chemicals production. The dopamine, in other words, serves as a correction signal, telling the neurons to adjust their predictions until they converge to reality. The phenomenon, known as reward prediction error, works much like a reinforcement-learning algorithm.
DeepMinds new paper builds on the tight connection between these natural and artificial learning mechanisms. In 2017, its researchers introduced an improved reinforcement-learning algorithm that has since unlocked increasingly impressive performance on various tasks. They now believe this new method could offer an even more precise explanation of how dopamine neurons work in the brain.
Specifically, the improved algorithm changes the way it predicts rewards. Whereas the old approach estimated rewards as a single numbermeant to equal the average expected outcomethe new approach represents them more accurately as a distribution. (Think for a moment about a slot machine: you can either win or lose following some distribution. But in no instance would you ever receive the average expected outcome.)
The modification lends itself to a new hypothesis: Do dopamine neurons also predict rewards in the same distributional way?
To test this theory, DeepMind partnered with a group at Harvard to observe dopamine neuron behavior in mice. They set the mice on a task and rewarded them based on the roll of dice, measuring the firing patterns of their dopamine neurons throughout. They found that every neuron released different amounts of dopamine, meaning they had all predicted different outcomes. While some were too optimistic, predicting higher rewards than actually received, others were more pessimistic, lowballing the reality. When the researchers mapped out the distribution of those predictions, it closely followed the distribution of the actual rewards. This data offers compelling evidence that the brain indeed uses distributional reward predictions to strengthen its learning algorithm.
DeepMind
This is a nice extension to the notion of dopamine coding of reward prediction error, wrote Wolfram Schultz, a pioneer in dopamine neuron behavior who wasnt involved in the study, in an email. It is amazing how this very simple dopamine response predictably follows intuitive patterns of basic biological learning processes that are now becoming a component of AI.
The study has implications for both AI and neuroscience. First, it validates distributional reinforcement learning as a promising path to more advanced AI capabilities. If the brain is using it, its probably a good idea, said Matt Botvinick, DeepMinds director of neuroscience research and one of the lead authors on the paper, during a press briefing. It tells us that this is a computational technique that can scale in real-world situations. Its going to fit well with other computational processes.
Second, it could offer an important update to one of the canonical theories in neuroscience about reward systems in the brain, which in turn could improve our understanding of everything from motivation to mental health. What might it mean, for example, to have pessimistic and optimistic dopamine neurons? If the brain selectively listened to only one or the other, could it lead to chemical imbalances and induce depression?
Fundamentally, by further decoding processes in the brain, the results also shed light on what creates human intelligence. It gives us a new perspective on what's going on in our brains during everyday life, Botvinick said.
Read the original post:
An algorithm that learns through rewards may show how our brain does too - MIT Technology Review
- Elusive Cures: Why Neuroscience Hasnt Solved Brain Disordersand How We Can Change That, an excerpt - The Transmitter - June 10th, 2025 [June 10th, 2025]
- Nanowire Retinal Implant Restores Vision and Sees Infrared - Neuroscience News - June 10th, 2025 [June 10th, 2025]
- KLOTHO NEUROSCIENCE, INC. ANNOUNCES AN APPROACH TO INCREASE LONGEVITY AND HEALTHY LIFE SPAN - REPLACE A SILENCED GENE CALLED ALPHA-KLOTHO... - June 10th, 2025 [June 10th, 2025]
- Obeying Orders Lowers Moral Responsibility Perception in the Brain - Neuroscience News - June 10th, 2025 [June 10th, 2025]
- Family Time and Parental Bonding Linked to Better Sleep in Preteens - Neuroscience News - June 10th, 2025 [June 10th, 2025]
- Study Links Gut Bacteria to MS Risk and Reveals Key Triggers - Neuroscience News - June 10th, 2025 [June 10th, 2025]
- Alto Neuroscience Announces Acquisition of Novel Dopamine Agonist Combination Product Candidate, Adding Late-Stage Readout in Treatment Resistant... - June 10th, 2025 [June 10th, 2025]
- Sleep-Wake Perception Intact in Many With Insomnia - Neuroscience News - June 10th, 2025 [June 10th, 2025]
- Cannabis Use Among U.S. Seniors Has Surged 46% in Just Two Years - Neuroscience News - June 10th, 2025 [June 10th, 2025]
- Anoki Integrates With Magnite While Seedtag Adds Neuroscience To Find Emotional Connections - TVREV - June 10th, 2025 [June 10th, 2025]
- Neuroscience: Knowing People's Names Makes You Empathize With Them Better. (By the Way, My Name Is Bill) - Inc.com - June 1st, 2025 [June 1st, 2025]
- Kindness Sparks Cooperation by Boosting Social Connectedness - Neuroscience News - June 1st, 2025 [June 1st, 2025]
- Neuroscience and Genetics of ADHD and Neurodevelopment - Neuroscience News - June 1st, 2025 [June 1st, 2025]
- The Neuroscience of Cancer - Harvard Medicine Magazine - June 1st, 2025 [June 1st, 2025]
- Singing to Infants Boosts Mood and Bonding - Neuroscience News - June 1st, 2025 [June 1st, 2025]
- Neuroscience: Go Swimming and Your Brain Will Thank You - Inc.com - June 1st, 2025 [June 1st, 2025]
- Blood Fat Links Found Between Heart Risk and Alzheimers - Neuroscience News - June 1st, 2025 [June 1st, 2025]
- Tiny Brain Cell Cluster Found to Drive Obesity and Overeating - Neuroscience News - June 1st, 2025 [June 1st, 2025]
- New Neuroscience Shows Why Its So Important to Read Aloud to Your Kids - Inc.com - June 1st, 2025 [June 1st, 2025]
- Cats Can Recognize Their Owners by Smell Alone - Neuroscience News - June 1st, 2025 [June 1st, 2025]
- St. Lukes Center for Neuroscience Helps Those with Same Illness as Billy Joel - TAPinto - June 1st, 2025 [June 1st, 2025]
- These triplets who graduated from Georgia Tech with neuroscience degrees head to medical school - 11Alive.com - June 1st, 2025 [June 1st, 2025]
- Gabe Newell co-founded a neuroscience company in 2019 and its first brain chip is expected to ship later this year - PC Gamer - June 1st, 2025 [June 1st, 2025]
- Next-Gen Painkiller Blocks Pain Without the High - Neuroscience News - May 21st, 2025 [May 21st, 2025]
- Inflammation Triggers Repetitive Behaviors in ASD and OCD - Neuroscience News - May 21st, 2025 [May 21st, 2025]
- Astrocytes Take Center Stage in Brain Function and Behavior - Neuroscience News - May 21st, 2025 [May 21st, 2025]
- Setting the SCENE for Neuroscience Breakthroughs - Mellon College of Science - Carnegie Mellon University - May 21st, 2025 [May 21st, 2025]
- Long COVID Brain Fog Linked to Inflammation and Stress Markers - Neuroscience News - May 21st, 2025 [May 21st, 2025]
- Warren Buffett Says Youre Too Focused on the Negative. Heres the Neuroscience Showing Hes Right - Inc.com - May 21st, 2025 [May 21st, 2025]
- Reading Fiction Boosts Empathy and Fights Loneliness - Neuroscience News - May 21st, 2025 [May 21st, 2025]
- Astrocytes, Not Neurons, Drive Brains Attention and Alertness - Neuroscience News - May 21st, 2025 [May 21st, 2025]
- Mapping Young Minds: The Neuroscience Behind Babilou Family Singapore's Revolutionary Education Model - PR Newswire - May 21st, 2025 [May 21st, 2025]
- Loneliness Linked to 24% Higher Risk of Hearing Loss - Neuroscience News - May 21st, 2025 [May 21st, 2025]
- Eureka Moments Double Memory by Rewiring the Brain - Neuroscience News - May 21st, 2025 [May 21st, 2025]
- Scientists use brain activity to predict StarCraft II skill in fascinating new neuroscience research - psypost.org - May 21st, 2025 [May 21st, 2025]
- Stress of Long Work Hours May Physically Alter the Brain - Neuroscience News - May 21st, 2025 [May 21st, 2025]
- The Neuroscience of Dopamine: How to Triumph Over Constant Wanting - Next Big Idea Club - May 12th, 2025 [May 12th, 2025]
- Verbal Abuse in Childhood Rewires the Developing Brain - Neuroscience News - May 12th, 2025 [May 12th, 2025]
- Heavy Social Media Use Linked to Believing and Spreading Fake News - Neuroscience News - May 12th, 2025 [May 12th, 2025]
- Brain Cells That Predict What Comes Next, Even When Its New - Neuroscience News - May 12th, 2025 [May 12th, 2025]
- The Temperature | Better happiness through neuroscience - The Colorado Sun - May 12th, 2025 [May 12th, 2025]
- Genes Strongly Influence When Babies Take Their First Steps - Neuroscience News - May 12th, 2025 [May 12th, 2025]
- Using Music to Detect Concussion in Kids - Neuroscience News - May 12th, 2025 [May 12th, 2025]
- Boosting Klotho Protein Slows Aging and Enhances Health - Neuroscience News - May 12th, 2025 [May 12th, 2025]
- Eye Movements Set the Speed Limit for What You Can See - Neuroscience News - May 12th, 2025 [May 12th, 2025]
- Seeing Is Believing: How We Judge AI as Creative or Not - Neuroscience News - May 12th, 2025 [May 12th, 2025]
- Exercise Boosts Stem Cell Therapy for Parkinsons - Neuroscience News - May 12th, 2025 [May 12th, 2025]
- Aspen Neuroscience Announces 6-Month ASPIRO Phase 1/2a Clinical Trial Results of Personalized Cell Therapy for Parkinson's Disease - BioSpace - May 12th, 2025 [May 12th, 2025]
- Sheffield Lab: Understanding the neuroscience of memories - University of Chicago News - April 27th, 2025 [April 27th, 2025]
- Prenatal Stress Leaves Lasting Molecular Imprints on Babies - Neuroscience News - April 27th, 2025 [April 27th, 2025]
- Dean Buonomano explores the concept of time in neuroscience and physics - The Transmitter - April 27th, 2025 [April 27th, 2025]
- Psychedelics May Reset Brain-Immune Link Driving Fear and Anxiety - Neuroscience News - April 27th, 2025 [April 27th, 2025]
- Infant Social Skills Thrive Despite Hardship - Neuroscience News - April 27th, 2025 [April 27th, 2025]
- From Cologne to Country Roads: One scientist's interdisciplinary journey to build bridges (and robotic insects) between neuroscience and engineering -... - April 27th, 2025 [April 27th, 2025]
- Eyes Reveal Intentions Faster Than We Think - Neuroscience News - April 27th, 2025 [April 27th, 2025]
- Immune Resilience Identified as Key to Healthy Aging and Longevity - Neuroscience News - April 27th, 2025 [April 27th, 2025]
- Energy Starvation Triggers Dangerous Glutamate Surges in the Brain - Neuroscience News - April 27th, 2025 [April 27th, 2025]
- WVU Rockefeller Neuroscience Institute first in U.S. to successfully test innovative brain-computer interface technology to decode speech and language... - April 27th, 2025 [April 27th, 2025]
- Microglia Reprogrammed to Deliver Precision Alzheimers Therapies - Neuroscience News - April 27th, 2025 [April 27th, 2025]
- Neuroscience Says Music Is an Emotion Regulation Machine. Heres What to Play for Happiness, Productivity, or Deep Thinking - Inc.com - April 19th, 2025 [April 19th, 2025]
- Early Maternal Affection Shapes Key Personality Traits for Life - Neuroscience News - April 19th, 2025 [April 19th, 2025]
- Elons new neuroscience major highlighted by Greensboro News & Record - Elon University - April 19th, 2025 [April 19th, 2025]
- Brain Blast event at St. Lawrence University teaches local students neuroscience - North Country Now - April 19th, 2025 [April 19th, 2025]
- AI Reveals What Keeps People Committed to Exercise - Neuroscience News - April 19th, 2025 [April 19th, 2025]
- The "Holy Grail" of Neuroscience? Researchers Create Stunningly Accurate Digital Twin of the Brain - The Debrief - April 19th, 2025 [April 19th, 2025]
- Annenberg School Vice Dean Emily Falk publishes book on the neuroscience of decision-making - The Daily Pennsylvanian - April 19th, 2025 [April 19th, 2025]
- Music-Induced Chills Trigger Natural Opioids in the Brain - Neuroscience News - April 19th, 2025 [April 19th, 2025]
- What We Value: The Neuroscience of Choice and Change - think.kera.org - April 19th, 2025 [April 19th, 2025]
- Kile takes top neuroscience post at Sutter Health as system pushes to align care, expand trials - The Business Journals - April 19th, 2025 [April 19th, 2025]
- A Grain of Brain, 523 Million Synapses, and the Most Complicated Neuroscience Experiment Ever Attempted - SciTechDaily - April 19th, 2025 [April 19th, 2025]
- Mild Brain Stimulation Alters Decision-Making Speed and Flexibility - Neuroscience News - April 19th, 2025 [April 19th, 2025]
- Cannabis studies were informing fundamental neuroscience in the 1970s - Nature - April 10th, 2025 [April 10th, 2025]
- To make a meaningful contribution to neuroscience, fMRI must break out of its silo - The Transmitter - April 10th, 2025 [April 10th, 2025]
- Steve Jobss Unexpected Secret to Being More Creative (Backed by Neuroscience) - Inc.com - April 10th, 2025 [April 10th, 2025]
- Challenging Decades of Neuroscience: Brain Cells Are More Plastic Than Previously Thought - SciTechDaily - April 10th, 2025 [April 10th, 2025]
- Q&A: Lundbecks head of R&D on letting biology speak in neuroscience - Endpoints News - April 10th, 2025 [April 10th, 2025]
- Why it's hard to study the neuroscience of psychedelics : Short Wave - NPR - April 10th, 2025 [April 10th, 2025]
- Fear Sync: How Males and Females Respond to Stress Together - Neuroscience News - April 10th, 2025 [April 10th, 2025]
- Chemotherapy Disrupts Brain Connectivity - Neuroscience News - April 10th, 2025 [April 10th, 2025]
- Newly awarded NIH grants for neuroscience lag 77 percent behind previous nine-year average - The Transmitter - April 10th, 2025 [April 10th, 2025]