Google's New Hurricane Model Was Breathtakingly Good This Season (arstechnica.com) 43

Posted by BeauHD on Tuesday November 04, 2025 @10:30PM from the revolutionizing-forecasting dept.

An anonymous reader quotes a report from Ars Technica: Although Google DeepMind's Weather Lab only started releasing cyclone track forecasts in June, the company's AI forecasting service performed exceptionally well. By contrast, the Global Forecast System model, operated by the US National Weather Service and is based on traditional physics and runs on powerful supercomputers, performed abysmally. The official data comparing forecast model performance will not be published by the National Hurricane Center for a few months. However, Brian McNoldy, a senior researcher at the University of Miami, has already done some preliminary number crunching.

The results are stunning: A little help in reading the graphic is in order. This chart sums up the track forecast accuracy for all 13 named storms in the Atlantic Basin this season, measuring the mean position error at various hours in the forecast, from 0 to 120 hours (five days). On this chart, the lower a line is, the better a model has performed. The dotted black line shows the average forecast error for official forecasts from the 2022 to 2024 seasons. What jumps out is that the United States' premier global model, the GFS (denoted here as AVNI), is by far the worst-performing model. Meanwhile, at the bottom of the chart, in maroon, is the Google DeepMind model (GDMI), performing the best at nearly all forecast hours.

The difference in errors between the US GFS model and Google's DeepMind is remarkable. At five days, the Google forecast had an error of 165 nautical miles compared to 360 nautical miles for the GFS model, more than twice as bad. This is the kind of error that causes forecasters to completely disregard one model in favor of another. But there's more. Google's model was so good that it regularly beat the official forecast from the National Hurricane Center (OFCL), which is produced by human experts looking at a broad array of model data. The AI-based model also beat highly regarded "consensus models," including the TVCN and HCCA products. For more information on various models and their designations, see here.

Google's New Hurricane Model Was Breathtakingly Good This Season

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 43 Comments Log In/Create an Account

Comments Filter:

- Re: (Score:2)
  
  by martin-boundary ( 547041 ) writes:
  
  Why do you say that? TV forecasts are averages over a region. That's by design. Do you really want the TV weather presenter to rattle off rain and temperature and humidity numbers for every block in every city in the whole state you live in? No, they just give indicative data. It's not going to apply to you. It's a guideline.
  If you want to know if you're going to be rained on, go get a radar image map with rain data updated every minute, and work out if the clouds will cross your street at the exact momen
  - Re:If it is half as good as weathernews, count me (Score:4, Informative)
    
    by serviscope_minor ( 664417 ) writes: on Wednesday November 05, 2025 @02:29AM (#65774340) Journal
    
    Do you really want the TV weather presenter to rattle off rain and temperature and humidity numbers for every block in every city in the whole state you live in?
    That's why I use the Met Office app or website in the UK. It's really nice to have very granular forecasting of rain.
    If you want to know if you're going to be rained on, go get a radar image map with rain data updated every minute, and work out if the clouds will cross your street at the exact moment when you will be standing on it.
    Or use the maps from forecasters who do that for you.
    
    - Re: (Score:2)
      
      by martin-boundary ( 547041 ) writes:
      
      The forecasters don't know where you'll be when it matters. That's why it's your job to look up the data yourself.
      - Re: (Score:2)
        
        by serviscope_minor ( 664417 ) writes:
        
        The forecasters don't know where you'll be when it matters.
        The good ones produce maps, which are basically like rain radar maps extrapolated forwards in time.
WayDumberThanDirt (Score:1)

by Anonymous Coward writes:

WayDumberThanDirt told me Trump was banning hurricanes and increasing tariffs 20% on any countries that reported on them.
Stop The Science !
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  retard.
  - Re: (Score:2)
    
    by rsoundman ( 7720072 ) writes:
    
    Trouble is that this is believable.
Breathtakingly bad (Score:1)

by russsell ( 185151 ) writes:

The fact that the linked graph requires a paragraph-long explanation indicates that it is breathtakingly bad.
- Re: (Score:3)
  
  by gardyloo ( 512791 ) writes:
  
  The second sentence explains it perfectly fine to a layperson, and anyone used to interpreting plots has no real problem even without the supplied explanation. The only points of confusion for someone unfamiliar with the different models are the are the labels.
- Re: (Score:2)
  
  by phantomfive ( 622387 ) writes:
  
  According to the graph the best model was off in predicting by 100km after 48 hours. Not breathtaking.
Ars Technica Clickbait or Useful? (Score:5, Informative)

by RossCWilliams ( 5513152 ) writes: on Wednesday November 05, 2025 @12:00AM (#65774236)

I am not sure how much credence to give to a statistical analysis from one season of a handful of hurricanes.

- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  Agreed. I came here to say the same thing. Hurricane tracks can be fairly predictable but there are some really wild ones that have a mind of their own. I don't think any of the models could predict those mentioned in the Weather Channel's "Strangest Hurricanes" collection. https://weather.com/storms/hur... [weather.com]
- Re: (Score:2)
  
  by AmiMoJo ( 196126 ) writes:
  
  The site seems to be down, but I would hope any academic evaluating this model would have tested it with historic data as well.
  - Re: (Score:3)
    
    by habig ( 12787 ) writes:
    
    I suspect that one can't compare the Google results to past seasons because it was trained on those past seasons. So, this is the first abailable year for a validation dataset. They do mention that this year the traditional models performed particularly poorly, but didn't know why.
Good on Google (seriously), however (Score:5, Insightful)

by 93 Escort Wagon ( 326346 ) writes: on Wednesday November 05, 2025 @12:29AM (#65774256)

They really should've compared it to the European ECMWF. The US's GFS model has fallen way behind the ECMWF in all sorts of ways, and the US government doesn't seem inclined to provide adequate funding to remedy that.
Note that this is not intended as an anti-Trump post; this has been a longer-term issue over mutliple administrations, both Democratic and Republican.

- Re: (Score:1)
  
  by evanh ( 627108 ) writes:
  
  Google probably just pillaged from ECMWF
  - Re:Good on Google (seriously), however (Score:5, Funny)
    
    by 93 Escort Wagon ( 326346 ) writes: on Wednesday November 05, 2025 @01:42AM (#65774304)
    
    Maybe they taught DeepMind how to look up the ECMWF forecasts / predictions!
    
- Re: (Score:2)
  
  by DamnOregonian ( 963763 ) writes:
  
  I think they're not strictly analogous. For example, ECMWF says "TROPICAL CYCLONE MELISSA COULD NOT BE TRACKED" when you try to lookup the tracks for it.
  They may not do tracks for Atlantic cyclones or something.
- Re: (Score:1)
  
  by Powercntrl ( 458442 ) writes:
  
  Note that this is not intended as an anti-Trump post
  To be fair, the Trump administration did cut the budget of NOAA and the NWS. [science.org]
- Re: (Score:2)
  
  by sabbede ( 2678435 ) writes:
  
  That got mentioned:
  This early model comparison does not include the “gold standard” traditional, physics-based model produced by the European Centre for Medium-Range Weather Forecasts. However, the ECMWF model typically does not do better on hurricane track forecasts than the hurricane center or consensus models, which weigh several different model outputs. So it is unlikely to be superior to Google’s DeepMind.
  - Re: Good on Google (seriously), however (Score:2)
    
    by blue trane ( 110704 ) writes:
    
    So, does physics predict less well than AI?
- Re: (Score:2)
  
  by ScienceBard ( 4995157 ) writes:
  
  They really should've compared it to the European ECMWF. The US's GFS model has fallen way behind the ECMWF in all sorts of ways, and the US government doesn't seem inclined to provide adequate funding to remedy that.
  Note that this is not intended as an anti-Trump post; this has been a longer-term issue over mutliple administrations, both Democratic and Republican.
  Yeah, the US model is well known to be worse than the European models overall, but sometimes better in select scenarios. Typically I hear forecasters mixing predictions from the various models based on circumstance.
  But even ignoring that... I'd expect that a trained model like Googles would dominate a relatively tepid hurricane season. What will be interesting is if it does a better job on an extremely atypical, high impact (landfall) storm system, and can replicate that repeatedly. In general in modeling (
Models improvement has been amazing (Score:4, Interesting)

by ellbee ( 93668 ) writes: on Wednesday November 05, 2025 @01:28AM (#65774294)

Anyone bashing the GFS is free to improve it - the code is on github, Fortran required. Replicating the data assimilation network is an exercise for the reader.

- - Re: Models improvement has been amazing (Score:4, Informative)
    
    by reanjr ( 588767 ) writes: on Wednesday November 05, 2025 @04:07AM (#65774438) Homepage
    
    Weather prediction has actually gotten quite good. But your weatherman is not going to give you the real weather prediction because it's confusing and highly probabilistic and because of set expectations.
    For example, if the data says the chance of rain is 50% the weatherman will tell you it's 75%. Because they've learned viewers interpret 50% as "we have no idea, it's a coin toss, your guess is as good as mine". And because they've learned to err on the side of rain so the viewers don't get angry when they plan a beach day.
    Additional complexities might be things like 34% chance of rain if temperatures hit the expected high of 67, but if the wind turns and the temp drops, the chance of rain climbs to 43%. If it rains, the temperature tomorrow should be adjusted down 1.5 degrees for the first six hours, then reach previous expected temps around 2pm.
    Too much information.
    
    - Re: (Score:2)
      
      by evil_aaronm ( 671521 ) writes:
      
      Some of us drive semi-professionally and need the ugly truth of the weather, if you will, without any sugarcoating. Where can we get the most accurate weather predictions as you describe? I have Wunderground, and Zoom Earth, which seem pretty good. Anything else?
      - Re: (Score:2)
        
        by ellbee ( 93668 ) writes:
        
        First you have to learn about the various models available and what their limitations are. TropicalTidbits.com has many different charts under "Forecast Models" and windy.com allows you to choose from a few standard models and does a good job visualizing the data. To go deeper you can use a GRIB viewer app like LuckGrib (MacOS/iOS only - there are other viewers out there) which can download subsets of model data from its own servers. Or you can download parts of the models yourself from NOAA's NOMADS reposi
- Re: (Score:2)
  
  by sabbede ( 2678435 ) writes:
  
  No Mosquito or Me110?
I'm not hurricane expert (Score:1)

by Powercntrl ( 458442 ) writes:

But a season that mostly consists of fish storms seems like not the most challenging task for a prediction model. Who cares what the margin for error is when we're talking hundreds of miles of open ocean?
The real test will be when we have another season with storms that actually end up heading towards the east coast.
- Re: I'm not hurricane expert (Score:2)
  
  by blue trane ( 110704 ) writes:
  
  Do container ships carrying your Amazon purchases care?
  - Re: (Score:2)
    
    by Powercntrl ( 458442 ) writes:
    
    Do container ships carrying your Amazon purchases care?
    It's right there in TFS - this was about the Atlantic hurricane season. You might want to look on a map to see where China is. Just sayin'.
    - Re: I'm not hurricane expert (Score:2)
      
      by blue trane ( 110704 ) writes:
      
      "Atlantic hurricane season (June 1 to November 30) affects shipping routes by causing delays, rerouting, and increased costs due to port closures, road and rail damage, and dangerous storm conditions. [...]
      Most affected areas
      U.S. Atlantic and Gulf Coasts: States like Florida, Louisiana, Texas, North and South Carolina, and Georgia are particularly vulnerable."
Humans not so bad (Score:1)

by multatuli ( 740516 ) writes:

"Google's model was so good that it regularly beat the official forecast from the National Hurricane Center (OFCL), which is produced by human experts looking at a broad array of model data."
This tells me humans do a decent job. Given humans' performance I doubt if it is worth all that comes with GDMI (which may indirectly be causing hurricanes itself).
I Think It's Fantastic (Score:2)

by SlashbotAgent ( 6477336 ) writes:

I think the current predictions seem to be remarkably accurate 5-7 days out. It has been a noticeable and dramatic improvement in the last ~5 years.
If Google's models turn out to be even more accurate, that is simply fantastic!
Average track position (Score:2)

by RoccamOccam ( 953524 ) writes:

Instead of just the average track error (the dotted black line), I'd be interested in the error of the average track position. In other words, get the track position at each timestamp for all models, average that, then determine the error.
- Re: (Score:2)
  
  by pz ( 113803 ) writes:
  
  Instead of just the average track error (the dotted black line), I'd be interested in the error of the average track position. In other words, get the track position at each timestamp for all models, average that, then determine the error.
  You're describing the consensus models, and they are better than any individual model, at least thus far.
No matter how good, (Score:2)

by jenningsthecat ( 1525947 ) writes:

the hurricane model probably wasn't nearly as breathtaking as the hurricanes themselves were.
Unsure (Score:2)

by hardwarejunkie9 ( 878942 ) writes:

Not quite sure how to interpret this without a strong caveat regarding reliability of data sampling in the current environment. There's a lot that's changed within the last year and I lack the expertise to know whether that affects this comparison or not, whether on the overall ranges or the needed sample size to make a conclusion.
Ideal use case for AI (Score:2)

by Tony Isaac ( 1301187 ) writes:

AI is, at its core, a sophisticated pattern recognition system. It's really good at digesting tons of inputs and spotting patterns. Hurricane prediction is exactly the kind of thing these AI models *should* be good at, given the right kinds of input data, and enough of it.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Re: (Score:2)

Re:If it is half as good as weathernews, count me (Score:4, Informative)

Re: (Score:2)

Re: (Score:2)

WayDumberThanDirt (Score:1)

Re: (Score:1)

Re: (Score:2)

Breathtakingly bad (Score:1)

Re: (Score:3)

Re: (Score:2)

Ars Technica Clickbait or Useful? (Score:5, Informative)

Re: (Score:1)

Re: (Score:2)

Re: (Score:3)

Good on Google (seriously), however (Score:5, Insightful)

Re: (Score:1)

Re:Good on Google (seriously), however (Score:5, Funny)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: Good on Google (seriously), however (Score:2)

Re: (Score:2)

Models improvement has been amazing (Score:4, Interesting)

Re: Models improvement has been amazing (Score:4, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

I'm not hurricane expert (Score:1)

Re: I'm not hurricane expert (Score:2)

Re: (Score:2)

Re: I'm not hurricane expert (Score:2)

Humans not so bad (Score:1)

I Think It's Fantastic (Score:2)

Average track position (Score:2)

Re: (Score:2)

No matter how good, (Score:2)

Unsure (Score:2)

Ideal use case for AI (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals