Close Menu
    Trending
    • Taylor Swift’s Brutal Wedding Invite Snub To Karlie Kloss Revealed
    • Malaysia slams ‘deafening silence’ over scrapped Norway missile deal
    • France arrests hundreds of rioters nationwide as PSG win Champions League | Football News
    • WCF winners, losers: Victor Wembanyama, SGA rise to the occasion, Chet Holmgren goes missing in Game 7 flop
    • Your workforce doesn’t need more AI. It needs play
    • How big can a galaxy get?
    • The Real Reason Russia Would Invade Europe
    • Jennifer Lopez’s 18-Year-Old Child Unveils New Name, Pronouns
    Benjamin Franklin Institute
    Sunday, May 31
    • Home
    • Politics
    • Business
    • Science
    • Technology
    • Arts & Entertainment
    • International
    Benjamin Franklin Institute
    Home»Science»How an intern helped build the AI that shook the world
    Science

    How an intern helped build the AI that shook the world

    Team_Benjamin Franklin InstituteBy Team_Benjamin Franklin InstituteMarch 7, 2026No Comments8 Mins Read
    Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
    Share
    Facebook Twitter Pinterest Email Copy Link


    AlphaGo’s victory braodcast on TV

    Im Hun-jung/Yonhap/AP Photo via Getty Images

    In March 2016, Google DeepMind’s artificial intelligence system AlphaGo shocked the world. In a stunning five-match series of Go, the ancient Chinese board game, the AI beat the world’s best player, Lee Sedol – a moment that was televised in front of millions and hailed by many as a historic moment in the development of artificial intelligence.

    Chris Maddison, now a professor of artificial intelligence at the University of Toronto, was then a master’s student and helped get the project off the ground. It all began when Ilya Sutskever, who later went on to found OpenAI, got in touch…

    Alex Wilkins: How did the idea for AlphaGo first come about?

    Chris Maddison: Ilya [Sutskever] gave me the following argument for why we should be working on Go. He said, Chris, do you think when an expert player looks at the Go board, they can pick the best move in half a second? If you think they can, then that means that you can learn a pretty good policy to pick the best move using a neural net.

    The reason is that half a second is about the time it takes for your visual cortex to do one forward pass [a round of processing], and we already knew from ImageNET [an important AI image-recognition competition] that we’re pretty good at approximating things that only take one forward pass of your visual cortex.

    I bought that argument, so I decided to join [Google Brain] as an intern in the summer of 2014.

    How did AlphaGo develop from there?

    When I joined, there was another little team at DeepMind that I was going to work with, which was Aja Huang and David Silver, that had started working on Go. It was basically my charge to start building the neural networks. It was a dream.

    There were a bunch of different approaches that we tried, and a lot of the initial things we tried failed. Eventually, I just got frustrated and tried the dumbest, simplest thing, which was to try to predict the next move that an expert would make in a given board position, training a neural network on a big corpus of expert games. And that turned out to be the approach that really got us off the ground.

    By the end of the summer, we hosted a little match with DeepMind’s Thore Graepel, who considered himself a decent Go player, and my networks beat him. DeepMind then started to be convinced that this was going to be a real thing and started putting resources towards it and building a big team around it.

    How difficult of a challenge was it seen beating Lee Sedol?

    I remember in the summer of 2014, we practically had Lee Sedol’s portrait on our desk next to us. I’m not a Go player, but Aja [Huang] is. Every time I would build a new network, it would get a little bit better, and I would turn to Aja and I’d say, OK, we’re a little bit better, how close are we to Lee Sedol? And Aja would turn to me and say, Chris, you don’t understand. Lee Sedol is one stone from God.

    You left the AlphaGo team before the big event. Why?

    David [Silver] said we’d like to keep you on and really drive this project to the next level, and, in retrospect, this was maybe one of the stupider decisions I made, I turned him down. I said I think I need to focus on my PhD, I’m an academic at heart. I went back to my PhD and loosely consulted with the project from that point on. I’m a little proud to say it took them a while to beat my neural networks. But then, ultimately, the artefact that played Lee Sedol was the product of a big engineering effort and a big team.

    What was the atmosphere like in Seoul when AlphaGo won?

    Being there in Seoul at that moment was hard to express. It was emotional. It was intense. There was a sense of anxiety. You go in confident, but you never know. It’s like a sports game. Statistically speaking, you’re the better player, but you never know how it’s going to shake out. I remember being in the hotel where we played the matches and looking out the window. We were at a high-enough level that you could look out onto one of the major city intersections. I realised there was a big screen, sort of like Times Square, that was showing our match. And then I looked along the sidewalks, and people were just lined up standing looking at the screen. I had heard numbers like hundreds of millions of people in China watched the first game, but I remember that moment as like, oh God, we’ve really stopped East Asia in its tracks.

    How important has AlphaGo been for AI more generally?

    A lot has changed on a surface level about the world of large language models (LLMs), they are now quite different in some ways from AlphaGo, but actually there’s an underlying technological thread that really hasn’t changed.

    So the first part of the algorithm is to train a neural network to predict the next move. Today’s LLMs begin with what we call pretraining to predict the next word, from a big corpus of human text found largely on the internet.

    For the second step in AlphaGo, we took the information from that human corpus that was compressed into these neural networks, and we refined it using reinforcement learning, to align the behaviour of the system towards the goal of winning games.

    When you learn to predict an expert’s next move, they are trying to win, but that’s not the only thing that explains the next move. Perhaps they don’t understand what the best move is, perhaps they made a mistake, so you need to align the overall system with your true goal, which in the case of AlphaGo was winning.

    In large language models, it’s the same after pretraining. The networks are not aligned with how we want to use them, and so we do a series of reinforcement learning steps that align the networks with our goals.

    In some ways, not much has changed.

    Does it tell us anything about where we can expect AIs to succeed?

    It has consequences in terms of what we choose to focus on. If you’re worried about making progress on important problems, the key bottlenecks that you should be worried about are do you have enough data to do pretraining, and do you have reward signals to do post-training. If you don’t have those ingredients, there’s no amount of clever – you know, this algorithm versus that algorithm – that’s going to get you off the ground.

    Did you feel any sympathy for Lee Sedol?

    Lee Sedol had been this idol over the summer of 2014, this unachievable milestone. To then suddenly be there in person, watching the matches, his stress, his anxiety, his realisation that this was a much worthier opponent than maybe he had thought going in, that was very stressful. You don’t want to put someone in that position. When he lost the match, he apologised to humanity, and said, “This is my failing, not yours.” That was tragic.

    There is also a custom in Go to review the match with your opponent. Someone wins or loses, but you review the match at the end, unwind the game and explore variations with each other. Lee Sedol couldn’t do that because AlphaGo wasn’t human, so instead he had his friends come in and review the match, but it’s just not the same. There felt something heartbreaking about that.

    But I didn’t appreciate all the man-versus-machine narratives around the match, because a team of people built AlphaGo. That was the effort of a tribe building an artefact that could achieve excellence in a human game. It was ultimately the artefact that all our blood, sweat and tears went into.

    Do you think there is still a place for humans in the world as AI accomplishes more human thinking work?

    We are learning more about the game of Go, and if we think that game is beautiful, which we do, and AIs can teach us more about that beauty, there’s a lot of inherent good in that as well. There’s a difference between goals and purposes. The goal of the game of Go is to win, but that’s not its only purpose – one purpose is to have fun. Board games are not destroyed by the presence of AI; chess is a thriving industry. We still appreciate the intrigue and the human achievement of that sport.

    Topics:



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link

    Related Posts

    Science

    How big can a galaxy get?

    May 31, 2026
    Science

    Why high-bandwidth memory is a bottleneck for AI chips

    May 31, 2026
    Science

    NASA’s Hubble captures gorgeous new photo of a spiral galaxy as it wanders through the Virgo Cluster

    May 30, 2026
    Science

    New protein-folding AI vastly expands on Alphafold’s efforts

    May 30, 2026
    Science

    Mirror life: Scientists clash over threat of lab-engineered bacteria

    May 30, 2026
    Science

    The best new science-fiction books of June 2026 include novels from Adrian Tchaikovsky and M. John Harrison

    May 30, 2026
    Editors Picks

    Three myths about the Russia economic war | Russia-Ukraine war

    February 24, 2026

    The Push to Make Semiconductors in Space Just Took a Serious Leap Forward

    January 4, 2026

    Cubans React to Arrival of Russian Oil Tanker

    March 31, 2026

    Rob Gronkowski Hilariously Recovers From Near Wipeout On NYE

    January 2, 2026

    AI Shifts Expectations for Entry Level Jobs

    December 25, 2025
    About Us
    About Us

    Welcome to Benjamin Franklin Institute, your premier destination for insightful, engaging, and diverse Political News and Opinions.

    The Benjamin Franklin Institute supports free speech, the U.S. Constitution and political candidates and organizations that promote and protect both of these important features of the American Experiment.

    We are passionate about delivering high-quality, accurate, and engaging content that resonates with our readers. Sign up for our text alerts and email newsletter to stay informed.

    Latest Posts

    Taylor Swift’s Brutal Wedding Invite Snub To Karlie Kloss Revealed

    May 31, 2026

    Malaysia slams ‘deafening silence’ over scrapped Norway missile deal

    May 31, 2026

    France arrests hundreds of rioters nationwide as PSG win Champions League | Football News

    May 31, 2026

    Subscribe for Updates

    Stay informed by signing up for our free news alerts.

    Paid for by the Benjamin Franklin Institute. Not authorized by any candidate or candidate’s committee.
    • Privacy Policy
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.