Anthropic's AI Model Gains Computer Control in New Upgrade (anthropic.com) 8
Anthropic has released an upgraded version of its AI model Claude 3.5 Sonnet and announced a new model, Claude 3.5 Haiku, alongside a public beta feature enabling AI to operate computers like humans. The enhanced Sonnet model improved its coding capabilities, scoring 49% on the SWEbench Verified benchmark, surpassing OpenAI and other competitors. The Haiku model matches the performance of Anthropic's previous flagship Claude 3 Opus while maintaining lower costs and faster speeds.
The computer use feature, available through Anthropic's API and cloud partners, allows Claude to perform tasks like navigating web browsers, filling forms, and manipulating data. Early adopters include Asana, DoorDash, and Replit, though Anthropic -- backed by investors including Google and Amazon -- acknowledges the feature remains experimental and error-prone. Claude 3.5 Haiku will launch later this month, initially supporting text-only inputs with image capabilities to follow.
The computer use feature, available through Anthropic's API and cloud partners, allows Claude to perform tasks like navigating web browsers, filling forms, and manipulating data. Early adopters include Asana, DoorDash, and Replit, though Anthropic -- backed by investors including Google and Amazon -- acknowledges the feature remains experimental and error-prone. Claude 3.5 Haiku will launch later this month, initially supporting text-only inputs with image capabilities to follow.
error prone? (Score:4, Funny)
> acknowledges the feature remains experimental and error-prone
I never would have guessed.
Re: (Score:2)
Think of it likes Windows.
Big Progress for Automation (Score:2)
This will be a big improvement for getting AI to takeover tasks in the work space. There are a lot of tedious but well defined tasks that people have to do now and involve custom software or working with software that doesn't have an interface that is compatible with other automation software. Having an AI that could take over would be a big move
Re: (Score:2)
I don't disagree .. this is useful. But I also find it amusing that if this becomes widespread, I can imagine even *less* effort being made to create ergonomic UXes or improve existing ones. "Who cares if it's awful from user perspective, you can get a computer to do all the UX navigation for you!"
Hehehe
Re: (Score:2)
Not this time. The current hype-AI does not really do "well defined". Well, maybe the next one will in 10-20 years.
Sonnet , Haiku (Score:2)
So you have to give the AI instructions in the form of poems?
Re:Sonnet , Haiku (Score:4, Interesting)
Router hums, a patch takes hold,
Networks bloom again.
So basically malware now? (Score:2)
This will be interesting. And no, it will not happen on any of my systems.