Please create an account to participate in the Slashdot moderation system

AI Slashes Google's Code Migration Time By Half (theregister.com) 74

Posted by msmash on Thursday January 16, 2025 @02:51PM from the ground-breaking-efficiency dept.

Google has cut code migration time in half by deploying AI tools to assist with large-scale software updates, according to a new research paper from the company's engineers. The tech giant used large language models to help convert 32-bit IDs to 64-bit across its 500-million-line codebase, upgrade testing libraries, and replace time-handling frameworks. While 80% of code changes were AI-generated, human engineers still needed to verify and sometimes correct the AI's output. In one project, the system helped migrate 5,359 files and modify 149,000 lines of code in three months.

This discussion has been archived. No new comments can be posted.

AI Slashes Google's Code Migration Time By Half

Load All Comments

Search 74 Comments Log In/Create an Account

Comments Filter:

The research looks extremely weak and thin. BS. (Score:4, Insightful)

by Seven Spirals ( 4924941 ) writes: on Thursday January 16, 2025 @03:01PM (#65094159)

I simply call bullshit. This is an AI vendor that says AI's juice is worth the squeeze. Talk about an unwelcome and untrustworthy source for "research", and from "don't be evil.... PSYCH!" Google, even.

Share
twitter facebook
- Re:The research looks extremely weak and thin. BS. (Score:5, Insightful)
  
  by Junta ( 36770 ) writes: on Thursday January 16, 2025 @03:09PM (#65094183)
  
  Eh, the cited tasks seem to be credibly within the reach of LLM, very tedious, very obvious tasks. The sort of scope that even non-AI approaches often handle to be fair, but anyone who has played with a migration tool and LLMs I think could believe there's a lot of low hanging fruit in code migrations that don't really need human attention but suck with traditional transition tools.
  Of course, this is generally a self-inflicted problem from chosing more fickle ecosystems, but those fickle ecosystems have a lot of mindshare (python and javascript are highly likely to cause you to do big migrations, C and Golang are comparitively less likely to inflict stupid changes for less reason).
  
  Parent Share
  twitter facebook
  - Re: (Score:1)
    
    by Seven Spirals ( 4924941 ) writes:
    
    Python and Ruby change their runtimes and their package formats constantly and it's forever broken if you aren't on the bleeding edge. Other systems like Javascript + NPM or Lua + Rocks do a lot better, but still suck hind tit for the most part. For me, C is the go-to language and have a lot less features (no network package manager) but a lot more survivability and reliability (the shit will actually work without throwing Python tracebacks for the first half hour I fight with it).
    - Re: (Score:2)
      
      by Junta ( 36770 ) writes:
      
      Biggest concern I have with C (and Go) is that when the Java or Python attempt is throwing tracebacks like crazy and the C or Go is going just fine, the C or Go *should* be reporting errors like crazy. Lazy programmers not checking the return code/errno result in a program that seems pretty happy even as it compounds failure upon failure. Go has panic/recover, but that is so frowned upon third party code would never do it even if you would have liked it to.
      - Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
        
        Re: The research looks extremely weak and thin. BS (Score:2)
        
        by stripes ( 3681 ) writes:
        
        If you have an assert covering a case you are able to deal with you need to alter the assert, and every layer between the assert and wherever you are able to actually deal with that state. If you throw an error that you later able to deal with you âoejustâ put the handling in at the layer that you want to handle it. Having to alter each call site is a pain, but not unique. Frequently you need to do similar things in systems with Dependncy Injection when you add a new dependncy you have to route
        
        Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
    - Re: The research looks extremely weak and thin. BS (Score:2)
      
      by hjf ( 703092 ) writes:
      
      I run home assistant for my home automation stuff. I run it on a FreeBSD box I have. Home Assistant is on the bleeding edge of python and FreeBSD is the opposite.
      it really is annoying to deal with "modern" programmers that want to "refactor everything, all the time", breaking APIs with no regard.
      - Re: (Score:1)
        
        by Seven Spirals ( 4924941 ) writes:
        
        That's one opinion. In my world (older systems and embedded systems) it's not annoying, it's unacceptable and Python, Perl, and Ruby scripts are considered to be a pile of time wasting garbage until proved (by someone else) that they work. Even then I'm still super skeptical. Seen too many tracebacks for too many supposed shrink-wrapped packages.
        
        Re: (Score:2)
        
        by ByTor-2112 ( 313205 ) writes:
        
        I am not familiar with Ruby, but I use the embedded python interpreter on customer systems that are locked down for SOX complaince. The client's auditors check screenshots of installed applications, file timestamps, object timestamps, etc. I'm able to the pip "hack" for installing packages and I have never had a compatibility issue within a major version of Python, and only a the rare exception when updating. It has always been as simple as copy over and extract. No installation necessary.
        Can you name some
        
        Re: (Score:1)
        
        by Seven Spirals ( 4924941 ) writes:
        
        Tracebacks with a vanilla install of pretty much 90% of anything installed with 'pip'. Complete functional breakdown of anything simple that's supposed to "just work" and no documentation or troubleshooting info after the things shit themselves. Nearly a 100% chance that any pip item with dependencies will fail to install dependencies, etc... The most recent Pyturd that exploded on me was cve-bin-tool.
      - Re: (Score:2)
        
        by Entrope ( 68843 ) writes:
        
        it really is annoying to deal with "modern" programmers that want to "refactor everything, all the time", breaking APIs with no regard.
        Also ones that have a hard-on for massive dependency trees. I wanted to build Jujutsu [github.com] on an Ubuntu 22.04 box, but that Ubuntu only has Rust 1.80 and some dependency in the jj stack already requires Rust 1.81.
        jj is nice because it's very focused on doing its job and being usable, with none of the stereotypical in-your-face "we use Rust" attitude. But its dependency set forces you -- I assume inadvertently -- to the bleeding edge.
      - Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
        
        Re: (Score:2)
        
        by hjf ( 703092 ) writes:
        
        I call it the "new immate" approach. Like in movies, where the new immate has to kill someone to prove they're tough, and not to be messed with.
        I've worked on many projects where the new hire (usually a semi-senior or senior) will take the code and start sending huge PRs for "refactoring". It's 99% shifting things around. But it makes them look impressive.
        Sorry, i'm too old for this shit. For me, impressive is when i see a PR that only has a few lines and it solves a long-standing bug.
    - Re: (Score:3)
      
      by nightflameauto ( 6607976 ) writes:
      
      Python and Ruby change their runtimes and their package formats constantly and it's forever broken if you aren't on the bleeding edge. Other systems like Javascript + NPM or Lua + Rocks do a lot better, but still suck hind tit for the most part. For me, C is the go-to language and have a lot less features (no network package manager) but a lot more survivability and reliability (the shit will actually work without throwing Python tracebacks for the first half hour I fight with it).
      Having been a dairy farmer, the hind tit is the one you want on most cows. Fronts tend to have less milk. Rears tend to output more. While you have to wait for the milk to drop in some nervous milkers, the layout of the entire udder is such that the rear teats have larger "containerization" as it were. The front tend to be slightly smaller / raised above the rear. This educational moment brought to you by hundreds of early mornings and hot afternoons in the milk barns and parlors.
      - Re: (Score:1)
        
        by Seven Spirals ( 4924941 ) writes:
        
        Ha! Thanks for that, I was expecting another flame. That was educational and hilarious.
  - Re: (Score:2)
    
    by swillden ( 191260 ) writes:
    
    Of course, this is generally a self-inflicted problem from chosing more fickle ecosystems, but those fickle ecosystems have a lot of mindshare (python and javascript are highly likely to cause you to do big migrations, C and Golang are comparitively less likely to inflict stupid changes for less reason).
    Google often chooses to do very large migrations, in all of the languages the company uses. Google uses a build-from-head monorepo strategy for almost everything, which has a lot of benefits but it also means that when the core libraries are improved the amount of client code that's impacted is enormous. Not being willing to make regular large-scale migrations would mean that the core libraries are not allowed to improve, which just motivates project teams to write their own variants, or add layers on top,
  - Re: (Score:3)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
- Re: (Score:1)
  
  by masterz ( 143854 ) writes:
  
  Having used it, I have to say that some of the AI suggested modifications are simply magic. I have typed '// This should' and it fills in exactly what I was thinking. Blocks of code are very often filled in automatically, and correctly.
  - Re: (Score:1)
    
    by narcc ( 412956 ) writes:
    
    I've heard a lot of wild claims, but this is the first time I've seen anyone claim that AI was psychic...
    - Re: (Score:1)
      
      by masterz ( 143854 ) writes:
      
      LLMs just predict the next likely thing. I think humans often do the same, we just don't know it.
      - Re: (Score:2)
        
        by narcc ( 412956 ) writes:
        
        You wrote: "I have typed '// This should' and it fills in exactly what I was thinking"
        I weep for the future...
      - Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
        
        Re: (Score:2)
        
        by ByTor-2112 ( 313205 ) writes:
        
        I just never can understand that kind of magical thinking.
        
        Re: (Score:2)
        
        by Gideon Fubar ( 833343 ) writes:
        
        The trick to understanding people who do this is to understand that they engage that magical thinking by default... but also there are less of them than there appears to be, it's the same ones making the same type of mistakes over and over, and exploiting the people who rush to help them stop hurting themselves.
        I have no suggestions for how to understand the method though.
  - Re: (Score:2)
    
    by whiplashx ( 837931 ) writes:
    
    Yeah gpt-o1 code is really really good, I only have to fix about 10% of it. I keep meeting people who disagree but, it works for me, I have written dozens and dozens of programs in the last two months, probably 5x faster.
- Limited Applicability (Score:2)
  
  by XopherMV ( 575514 ) writes:
  
  What's noteworthy is that this was the same set of changes across multiple repos. The applicability of this solution for other problems is limited. If I know there's a bug in a system, it makes way more sense to me to dig into that code to find the one bug rather than create an LLM in an attempt to find that same bug in all possible repos. That approach generally doesn't make sense.
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
We've heard this song before (Score:2, Troll)

by hyades1 ( 1149581 ) writes:

Translation: We can now screw up twice as bad in half the time.
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
That explains it.... (Score:1)

by Kelxin ( 3417093 ) writes:

That's why almost every Google service has gone to shit lately. Yandex gives better results than Google now. Was in a Google meet earlier today that had several problems. Don't get me wrong, I love when developers push code out that was made by someone or something else that they don't understand.
- Re: (Score:2)
  
  by Njovich ( 553857 ) writes:
  
  I was having the same thought, I've had shitty issues lately across a range of Google apps that I use.
But they didn't change the type (Score:5, Interesting)

by laughingskeptic ( 1004414 ) writes: on Thursday January 16, 2025 @03:07PM (#65094177)

Why after acknowledging that the generic typing (int) made finding all of the places needing changing hard ... did they not `typedef int userId` and replace all pertinent int declarations and THEN `typedef long userId`? Instead they used their LLM to help change certain declarations from int to long.

Share
twitter facebook
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
- Re: (Score:2)
  
  by Dan667 ( 564390 ) writes:
  
  I would love to see how close you could achieve the same results with a script of if / replace statements.
Sounds trivial? (Score:2)

by ByTor-2112 ( 313205 ) writes:

This sounds kind of trivial, in the sense that if the code is well written, the changes should also be very formulaic.
- Re: (Score:2)
  
  by omnichad ( 1198475 ) writes:
  
  Right. It's like a car that's 95% full self-driving. Since the output isn't deterministic, the whole process needs human review and mistakes are easier to miss.
  Doing this algorithmically would have been consistent and where it fails it would fail in a predictable way.
- Re: (Score:1)
  
  by nightflameauto ( 6607976 ) writes:
  
  This sounds kind of trivial, in the sense that if the code is well written, the changes should also be very formulaic.
  Haven't you heard? The type of "find and replace" that every IDE has had in it for decades already is now referred to as AI. Anything that the machine does is AI. Booting the computer is handled by AI. Login is actually AI. Opening a Word document is AI. IT'S AI ALL THE WAY DOWN!
  - Re: (Score:2)
    
    by ByTor-2112 ( 313205 ) writes:
    
    I mean, that has been the case since Siri and Google Assistant were introduced. Every app went from using an algorithm to using "AI", no matter what it was really doing.
    Isn't an "AI" model just evaluating a function with billions of parameters, and the training generated the coefficients? It's just math all the way down.
- AI can manage 2nd year CS student stuff (Score:4, Interesting)
  
  by drnb ( 2434720 ) writes: on Thursday January 16, 2025 @03:40PM (#65094267)
  
  This sounds kind of trivial, in the sense that if the code is well written, the changes should also be very formulaic.
  From playing around with AI coding systems. AI seems to be about the level of a sophomore CS student who has had the data structures class, has not had the algorithms class yet, and can copy code from the internet, but may lacks a real understanding of the code implementation its copying. Which is still kind of impressive from the perspective of someone who studied AI at the grad school level.
  
  Copy/paste coders beware, AI is coming for you. :-)
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by RobinH ( 124750 ) writes:
    
    That's what a LLM does. It outputs text that is statistically indistinguishable from the text it's been trained on. But it doesn't actually "know" or "understand" what the code is doing. It's not actually reasoning about it. A real programmer is modelling the CPU and memory in their head (or at least a greatly simplified model of it) and thinking about what each step does to the state of the machine.
    Take a look at the real-time AI-generated minecraft game. It's really trippy. It predicts the next fram
    - Re: (Score:2)
      
      by drnb ( 2434720 ) writes:
      
      But it doesn't actually "know" or "understand" what the code is doing. It's not actually reasoning about it.
      I'd say there is some very simplistic reasoning in some of the AI coding systems. It seems to be able to combine a couple simple concepts well enough to "merge" the respective pieces of code it's seen.
    - Re: (Score:2)
      
      by anoncoward69 ( 6496862 ) writes:
      
      Unless you're coding kernel or driver level code i doubt very few programmers are "modelling the CPU and memory in their head" No programmer writing gives 2 shits about whats going on under the hood.
      - Re: (Score:2)
        
        by RobinH ( 124750 ) writes:
        
        That sounds like something an AI would say. ;)
      - Re: (Score:2)
        
        by drnb ( 2434720 ) writes:
        
        Unless you're coding kernel or driver level code i doubt very few programmers are "modelling the CPU and memory in their head" No programmer writing gives 2 shits about whats going on under the hood.
        You are mistaken. Having a very basic understanding of the underlying architecture lets you write better code, even when using a high level language. Compilers often benefit from "hints", structuring one's code and data with the architecture in mind. This includes application level code.
        
        That computer architecture class is rightfully a core class.
  - Re: (Score:2)
    
    by Virtucon ( 127420 ) writes:
    
    . AI seems to be about the level of a sophomore CS student
    More like the first answer that came from stackoverflow whether or not it was the highest-ranked or correct answer.
  - Re: (Score:2)
    
    by kick6 ( 1081615 ) writes:
    
    Ah to be a CS sophomore in the "copy code from the internet" era.
    - Re: (Score:2)
      
      by drnb ( 2434720 ) writes:
      
      Ah to be a CS sophomore in the "copy code from the internet" era.
      We were so much more skilled having to open a Knuth book and translate his pseudo-assembly into compilable code. :-)
- Re: (Score:2)
  
  by gillbates ( 106458 ) writes:
  
  I was thinking the same thing, my goodness, they've reinvented sed!
  In a well designed codebase, this would have been a one-line change. The fact that they're bragging about using AI for this just shows that there are yet entire departments at Google ignorant of basic software engineering practices.
  - Re: (Score:2)
    
    by gillbates ( 106458 ) writes:
    
    From the FA, Whether there is a long-term impact on quality remains to be seen.
    Just FYI Google: a software engineer can quantify the impact on quality using process controls. Just thought you might like to know.
Seems feasible, well scoped, verifiable (Score:1)

by doomday ( 948793 ) writes:

LLMs are pretty good at low risk fairly consistent edits that can easily be mechanically verified as correct. With the size of Google's codebase and the requirements that your one "pull request" be up to date and verifiable, this seems like a case where it could be a win, and reduce your workload and the amount of pain to do it. I spend a lot of time on the cases where it doesn't work, and I call those out vigorously, but this seems like one where LLMs would help. There are many more complex cases where t
Great idea (Score:2)

by CEC-P ( 10248912 ) writes:

How much did it slash the coding accuracy? They should write an AI to investigate this question.
Wait-- (Score:2)

by Geoffrey.landis ( 926948 ) writes:

Wait... code is a migratory species?
It flies south for the winter?
- Re:Wait-- (Score:5, Funny)
  
  by ihadafivedigituid ( 8391795 ) writes: on Thursday January 16, 2025 @04:48PM (#65094411)
  
  Oh, yeah, an African code base maybe, but not a European code base. That's my point.
  
  But then of course, uh, African code bases are non-migratory.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by Geoffrey.landis ( 926948 ) writes:
    
    So, what is the airspeed velocity of unladen code?
    - Re: (Score:2)
      
      by ihadafivedigituid ( 8391795 ) writes:
      
      What? I don't know that--
      
      WHAAAAAAAaaaaaaaaaa .......
    - Re: (Score:2)
      
      by sconeu ( 64226 ) writes:
      
      I don't know that!
      [falls into the Gorge of Eternal Peril]
      - Re: (Score:2)
        
        by ihadafivedigituid ( 8391795 ) writes:
        
        Too late, n00b, check the timestamps.
        
        Now I will say Ni! to you until you get me a shrubbery.
News Flash! (Score:2)

by Virtucon ( 127420 ) writes:

Company with a bloated codebase says that they can now have a bigger bloated codebase because of AI.
find . -type f -exec sed -i 's/old-pattern/new-pattern/g' {} +
- Re: (Score:2)
  
  by swsuehr ( 612400 ) writes:
  
  Came here to say this. The task they're bragging about sounds like a job for sed and awk. There's this absolute amnesia or maybe just complete cluelessness about how to actually use the tools. People just seem to want to write new tools rather than learn what's already available.
  - Comment removed (Score:4, Interesting)
    
    by account_deleted ( 4530225 ) writes: on Thursday January 16, 2025 @07:17PM (#65094693)
    
    Comment removed based on user account deletion
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by gillbates ( 106458 ) writes:
      
      ask the LLM to figure out where these values got passed.
      Back in 2006, as a new hire I wrote a tool which would scan a codebase for identifiers and cross reference every usage of those. It was a fun little project - took about a week - and was the first application I'd written which actually used a substantial amount of memory - more than 700MB, IIRC.
      Once you have the dependency graph, it's a relatively simple matter to automate the textual changes. The clincher comes when you have aligned or byte-p
      - Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
        
        Re: (Score:2)
        
        by gillbates ( 106458 ) writes:
        
        Even if the LLM could find the consumer on the other end of the network connection, would it even have legal authority to change the code there? What if the consumer is a third-party contractor? How would it know the difference? How would it even know who is connecting to its service, if the connections weren't logged?
        Emotional attachment has nothing to do with the fact that changing code may not even be possible in all of the necessary scenarios. It could be politically, legally, or financially risk
        
        Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
    - Re: (Score:2)
      
      by swsuehr ( 612400 ) writes:
      
      Meh. I worry about the accuracy and visibility of AI for this vs. using tried and true methods as the post stated with the sed command.
      - Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
Sounds Reasonable (Score:2)

by sconeu ( 64226 ) writes:

Back in the early '90s, I migrated a system from 16 to 32 bit. I wrote scripts to do this (this code base was in hundreds of thousands lines of code).
From memory, I'd say the 80% automation number sounds about right. I can easily see this being a decent use of so-called "AI" in development.
Tainted codebase? (Score:3)

by grolschie ( 610666 ) writes: on Thursday January 16, 2025 @07:57PM (#65094749)

Who's code was the LLM trained on, and under what license was said code released?

Share
twitter facebook

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

The research looks extremely weak and thin. BS. (Score:4, Insightful)

Re:The research looks extremely weak and thin. BS. (Score:5, Insightful)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: The research looks extremely weak and thin. BS (Score:2)

Re: (Score:2)

Re: The research looks extremely weak and thin. BS (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:1)

Re: (Score:2)

Re: (Score:3)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Limited Applicability (Score:2)

Re: (Score:2)

We've heard this song before (Score:2, Troll)

Re: (Score:2)

That explains it.... (Score:1)

Re: (Score:2)

But they didn't change the type (Score:5, Interesting)

Re: (Score:2)

Re: (Score:2)

Sounds trivial? (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

AI can manage 2nd year CS student stuff (Score:4, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Seems feasible, well scoped, verifiable (Score:1)

Great idea (Score:2)

Wait-- (Score:2)

Re:Wait-- (Score:5, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

News Flash! (Score:2)

Re: (Score:2)

Comment removed (Score:4, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Sounds Reasonable (Score:2)

Tainted codebase? (Score:3)

Related Links Top of the: day, week, month.

Slashdot Top Deals