China's DeepSeek and Qwen AI Beat US Rivals In Crypto Trading Contest (yahoo.com) 31

Posted by BeauHD on Tuesday October 28, 2025 @06:30PM from the would-you-look-at-that dept.

hackingbear shares a report from Crypto News: Two Chinese artificial intelligence (AI) models, DeepSeek V3.1 and Alibaba's Qwen3-Max, have taken a commanding lead over their US counterparts in a live real-world real-money cryptocurrency trading competition, posting triple-digit gains in less than two weeks. According to Alpha Arena, a real-market trading challenge launched by US research firm Nof1, DeepSeek's Chat V3.1 turned an initial $10,000 into $22,900 by Monday, a 126% increase since trading began on October 18, while Qwen 3 Max followed closely with a 108% return.

In stark contrast, US models lagged far behind. OpenAI's GPT-5 posted the worst performance, losing nearly 60% of its portfolio, while Google DeepMind's Gemini 2.5 Pro showed a similar 57% decline. xAI's Grok 4 and Anthropic's Claude 4.5 Sonnet fared slightly better, returning 14% and 23% respectively. "Our goal with Alpha Arena is to make benchmarks more like the real world -- and markets are perfect for this," Nof1 said on its website.

China's DeepSeek and Qwen AI Beat US Rivals In Crypto Trading Contest

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 31 Comments Log In/Create an Account

Comments Filter:

Well of course (Score:2, Flamebait)

by RitchCraft ( 6454710 ) writes:

We all know the Chinese would never cheat.
- - Re: (Score:2)
    
    by sabbede ( 2678435 ) writes:
    
    Or we could just say the true thing - that the CCP cannot be trusted. I wouldn't be shocked if they manipulated the markets in order to guarantee an outcome. I recall there being quite a lot of crypto held in China, though that may no longer be true.
Sounds like a very poorly disguised advertisement (Score:5, Insightful)

by ffkom ( 3519199 ) writes: on Tuesday October 28, 2025 @06:49PM (#65757084)

... for scammy crypto-things sold to gullible people. But even if that "contest" had any other background, the results would hardly be reproducible at any future point in time. You could just as well judge LLMs by their ability to predict dice rolls.

- Re: (Score:2)
  
  by Nrrqshrr ( 1879148 ) writes:
  
  Exactly. The stock market is so disconnected from reality that it's basically a casino. The cryptocurrencies market is even worse.
  I would rather see them pit these AIs in something actually measurable, but then we would all realize that these are just tools and there's no inherent understanding behind their decisions.
  - Re: (Score:2)
    
    by ffkom ( 3519199 ) writes:
    
    That LLM poker tournament [pokerbattle.ai] is probably more entertaining, and unlike the crypto scam does not pretend to be about anything but gambling.
  - Re: (Score:1)
    
    by retchdog ( 1319261 ) writes:
    
    ackshually... casinos are much more tightly regulated.
Still waiting (Score:2)

by Waffle Iron ( 339739 ) writes:

I'll know that AI has achieved a major milestone when one of them eventually says:
"Crypto is a strange game. The only winning move is not to play."
- - Re: (Score:2)
    
    by Waffle Iron ( 339739 ) writes:
    
    Are they smart enough to know to pull those gains out before the tulip market crashes?
    - Re: (Score:2)
      
      by ArchieBunker ( 132337 ) writes:
      
      I’ve been hearing slashdot talk about tulips for a solid decade now. No sign of collapse in sight.
      - Re: (Score:2)
        
        by Waffle Iron ( 339739 ) writes:
        
        It was demonstrated long ago that most all trading markets are chaotic. By definition, you won't see a sign of collapse in sight.
        Unlike useful markets, (but similar to 17th century tulips), crypto has no intrinsic value. Therefore, there is no bottom to a potential crash.
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        Correction:
        
        There is potentially no bottom to a crash.
        In practice, however, there is. And it's been demonstrated.
        
        Of course, the next one could be different.
    - Re: (Score:2)
      
      by DamnOregonian ( 963763 ) writes:
      
      A crash could be decades out, or not at all.
      The market works on investor confidence, and there are a lot of believers.
      
      As for wise enough to pull money out, wellllll...
      The Chinese LLMs are ahead, so they've certainly proven to be smart enough to pull their money out to profit in the short term, while one set of US LLMs seems to be performing just about as wall as HODL, and 2 of them seem to be pulling out low and buying high, or something nonsensical.
      
      Frankly, the 2 Chinese bots continued performance in
Random Walk (Score:2)

by quantaman ( 517394 ) writes:

Ignoring the whole fact that we don't even know how they set up these bots to trade, it looks pretty damn random.
- Re: (Score:2)
  
  by DamnOregonian ( 963763 ) writes:
  
  It's all above board. Pretty standard LLM harness. Go read about it. The runners of the competition aren't invested in an outcome, it's just a game.
  
  While the game operates fairly, I think that might actually be the wrong way to go about this.
  Every LLM I've ever worked with in a harness (agentically) has had quirks for how you need to "talk" to it to get it to perform best.
  It's very possible that there is an unintentional bias in harness performance for the Chinese LLMs.
  - Re: (Score:2)
    
    by DamnOregonian ( 963763 ) writes:
    
    Also, no. Not random at all.
    
    You have a couple bots with significant gains, a couple of bots that have basically followed the market, and a couple of bots that have vastly underperformed the market.
    
    The middle group are performing at basically the level of a random coin flip.
    The top group are outperforming a coin flip, and the bottom group are underperforming a coin flip.
    What's important is that they're consistently doing so, over time.
    
    The existence of the upper and lower groups indicates that the und
Fire Idea (Score:3)

by dohzer ( 867770 ) writes: on Tuesday October 28, 2025 @08:10PM (#65757300)

When cryptocurrency isn't wasting enough energy, you can always throw AI at it.

- Re: (Score:2)
  
  by DamnOregonian ( 963763 ) writes:
  
  Ya, that part is pretty funny.
  This might be the most wasteful endeavor ever undertaken, beyond a Republican energy plan.
need to know risk (Score:2)

by buddyglass ( 925859 ) writes:

Kinda need to know what level of risk the Chinese AIs took on. Getting big gains by making risky bets and getting lucky = meh.
- Re: (Score:2)
  
  by DamnOregonian ( 963763 ) writes:
  
  Luck is random. The results are clearly not random.
  
  Rather, if risk plays a factor, it means the market on average favors risk takers.
  i.e., they will come out ahead.
Some extremely expensive programs playing dice (Score:3)

by doragasu ( 2717547 ) writes: on Wednesday October 29, 2025 @02:40AM (#65757812)

You would be surprised some get better guesses than other.

- Re: (Score:2)
  
  by DamnOregonian ( 963763 ) writes:
  
  If they were playing dice, you'd be right. However, they're not.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

China's DeepSeek and Qwen AI Beat US Rivals In Crypto Trading Contest (yahoo.com) 31

China's DeepSeek and Qwen AI Beat US Rivals In Crypto Trading Contest More Login

China's DeepSeek and Qwen AI Beat US Rivals In Crypto Trading Contest

Well of course (Score:2, Flamebait)

Re: (Score:2)

Sounds like a very poorly disguised advertisement (Score:5, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Still waiting (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Random Walk (Score:2)

Re: (Score:2)

Re: (Score:2)

Fire Idea (Score:3)

Re: (Score:2)

need to know risk (Score:2)

Re: (Score:2)

Some extremely expensive programs playing dice (Score:3)

Re: (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot