An AI Program Is Beating Poker Champions for the First Time

Advertisement
By Katherine Arcement, The Washington Post | Updated: 25 January 2017 13:30 IST

The night before his newest poker competition was set to begin, Carnegie Mellon's Tuomas Sandholm and his PhD student Noam Brown sat down to play a little No Limit Texas Hold'em against the main competition: the artificial intelligence program they designed called "Libratus."

"I was totally wrecked," Sandholm told The Washington Post. The machine destroyed him. But he is not a serious poker player, so that's not such a big achievement.

But for the past 13 days, however, Libratus has been facing off against four world-champion poker players in a Pittsburgh casino. If it can beat them like it beat Sandholm, it would be an enormous breakthrough.

Advertisement

So far, after 67,000 hands, Libratus has won $701,242 worth of chips after starting from a balance of zero. That means, of course, that the champions have lost that same amount, $701,242. (They're not playing with real money but rather for a lump-sum prize of $200,000 that will divide at the end of the tournament.)

There are 53,000 hands left to play and if this trend continues, it will be the first time that AI has beaten humans at poker.

That would be a huge achievement. Poker is not like other games, such as chess, where AI has emerged victorious thanks to advanced algorithms. Poker is much harder for AI. As the MIT Technology Review explained:

Advertisement

"Poker requires reasoning and intelligence that has proven difficult for machines to imitate. It is fundamentally different from checkers, chess, or Go, because an opponent's hand remains hidden from view during play. In games of 'imperfect information,' it is enormously complicated to figure out the ideal strategy given every possible approach your opponent may be taking. And no-limit Texas Hold'em is especially challenging because an opponent could essentially bet any amount."

Google's AlphaGo Wins Final Game Against Lee Sedol

"Libratus has had the lead since the outset," Sandholm says.

Advertisement

 

Monday, on the tail end of Day 13, four poker players, Jimmy Chou, Dong Kim, Jason Les, and Daniel McAulay, sat in the dimly-lit blue light of computer screens in Pittsburgh's Rivers Casino, playing a virtual hand of cards against a virtual opponent.

Advertisement

For Sandholm, a computer scientist with a 126-page C.V., this is the culmination of twelve years of research. Starting in 2004 at Carnegie Mellon University, Sandholm began studying abstract algorithms for sequential imperfect information games. A "perfect" information game is one like chess, for example, where both players see the board and are in a good position to anticipate the opponent's next possible move. An "imperfect information" game is one in which on each players' turn they don't know all the information available in the game - such as the other person's cards.

Poker is an "imperfect information" game because players hide their hands, limiting the capacity of the opponent to calculate what their next move should be, thus allowing players to bluff.

The uses of the exercise go far beyond poker. War and cyberwar are both areas in which this could be useful.

Sandholm settled on No Limit Texas Hold'em poker as a model that could be extrapolated to real-life "imperfect" situations like cyber-security or military strategy. He wanted a general purpose algorithm that would excel in strategic reasoning.

In the course of his research, time after time, his algorithms failed against humans in the game. Even as late as May 2015, when Sandholm organized a similar poker competition at Rivers Casino pitting AI program "Claudico" against four champion poker players, Claudico lost by $732,713 in chips.

"Where a human might place a bet worth half or three-quarters of the pot, Claudico would sometimes bet a miserly 10 percent or an over-the-top 1,000 percent," Carnegie Mellon explained in a 2015 news release. As Doug Polk, a player against the program, explained at the time to CMU, "Betting $19,000 to win a $700 pot just isn't something that a person would do."

However, Sandholm's team did win the Annual Computer Poker Competition against other AI research teams twice in a row.

"Different research builds on results," he explains. None of the teams had succeeded - until Libratus.

Now, in the current competition in Pittsburgh, "AI is making moves humans would never make. AI is a Martian playing poker," says Sandholm. Libratus, concocting a strategy based on its knowledge of the rules of No Limit Texas Hold'em and the moves you can make in the game, began beating even the two champion players who had played Sandholm's prior AI program, Clautico.

It went like this:

27,000 hands in, Libratus had a $50,513 lead.

67,000 hands in, Libratus had doubled that lead fourteen times, to $701,242 in chips.

The challenge for Libratus was that while the AI program remained constant, the human players were constantly studying, learning, and able to improve. They also had extra motivation to win: prize money and social pressure. On Day 9, a man said to Les, "Hey, you're letting us down!"

Right now the AI is in first place. Sandholm has begun receiving, as he describes, "a lot of nice emails" from other AI researchers about Libratus's success. Meanwhile, the human poker players are streaming their games on Twitch and live-tweeting their results: "Humans end up winning $93k for the day. #BrainsVsAI" Les tweeted on January 23rd.

The competition lasts seven more days, unless they add on an extra day to account for the human poker players' relative lack of speed. Sandholm won't be popping any champagne yet, but by the end of the month that may no longer hold true.

© 2017 The Washington Post

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Vivo X300, X300 Pro Launched With MediaTek Dimensity 9500 SoC: See Price
  2. Vivo Pad 5e Launched With Snapdragon 8s Gen 3 SoC At This Price
  3. Flipkart Diwali Sale 2025: Best Discounts on Motorola Phones, Tablets
  4. Vivo X300 Series: All You Need to Know Ahead of Launch Today
  5. Apple Could Launch Three New Products This Week: What to Expect
  6. Oppo Pad 5 Will be Available in These Storage Variants and Colourways
  1. Vivo Watch GT 2 Launched With 2.07-Inch Screen, eSIM Support: Price, Features
  2. Vivo TWS 5 Series Launched With Hi-Res Audio, Up to 12 Hours of Battery Life: Price, Features
  3. Vivo Pad 5e Launched With 12.1-Inch Display, Snapdragon 8s Gen 3 Chipset: Price, Specifications
  4. Apple's Foldable iPhone to Feature a Hinge That Costs Less Than Previously Expected, Analyst Says
  5. Singapore Court Approves WazirX Restructuring Plan Following $234 Million Hack
  6. Vivo X300 Pro Launched With MediaTek Dimensity 9500 SoC, 200-Megapixel Camera Alongside Vivo X300: Price, Specifications
  7. Cryptology Key CEO Found Dead in Lamborghini in Ukraine Amidst Cryptocurrency Market Crash
  8. OnePlus Ace 6 Bags 3C Certification Ahead of China Launch; Key Specifications Leaked
  9. Samsung Galaxy M17 5G With 5,000mAh Battery Goes on Sale in India: Price, Features, Sale Offers
  10. Oppo Pad 5 Storage Variants, Colourways Revealed via Listing on Company's Website
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.