Please create an account to participate in the Slashdot moderation system

 



Forgot your password?
typodupeerror
Microsoft The Internet

Microsoft's Plan To Fix the Web: Letting Every Website Run AI Search for Cheap (theverge.com) 22

Microsoft has announced NLWeb, an open protocol designed to democratize AI-powered search capabilities for websites and apps. Developed by Microsoft technical fellow Ramanathan V. Guha, who previously created RSS and Schema.org, NLWeb allows site owners to implement ChatGPT-style natural language search with minimal code. The protocol enables websites to process complex queries like "spicy and crunchy appetizers for Diwali" or "jackets warm enough for Quebec," requiring only an AI model, some code, and the site's own data.

During his demonstration to news outlet The Verge, Guha showed how NLWeb remembers user preferences, such as dietary restrictions, for future interactions. "It's a protocol, and the protocol is a way of asking a natural-language question, and the answer comes back in structured form," explained Guha, who argues the approach is significantly cheaper than traditional search methods that require extensive web crawling and indexing. Microsoft is partnering with publishers and companies including TripAdvisor, Eventbrite, and Shopify to implement NLWeb, though Guha acknowledges the challenge of achieving widespread adoption in a web that historically tends toward centralization.

Microsoft's Plan To Fix the Web: Letting Every Website Run AI Search for Cheap

Comments Filter:
  • Attack vector (Score:3, Insightful)

    by pr0nbot ( 313417 ) on Monday May 19, 2025 @01:35PM (#65387953)

    Another security nightmare - securing your llm against malicious prompts attempting to DDoS it by getting it to consume resources, or to divulge privileged information it has access to. This is like putting a sociopathic teenager in charge of customer support (aka the DOGE methodology)

    • by shanen ( 462549 )

      Not following your reasoning here. Are you implying something about using the AI-enhanced websites as some kind of weapon in a form of jujitsu? If so, I'm having trouble seeing how the direction of causality is supposed to work...

      But I am looking forward to Funny. Story sure seems ripe for jokes. I was just checking for some early example of "What could possibly go wrong?" and this FP seems to be falling short.

      • Conjecture: Lots of web sites have disappeared (pandemic) and will disappear (boomers retiring and closing businesses, AI generated answers). Microsoft, google, and other big tech will need to slow the decline because AI slop will reduce original human generated content to a small fraction of the web and drive away most users.

        Conjecture 2: Social media and social media adjacent (LinkedIn) sites will lose users as long-time users age out and getting new users who engage with the community becomes more dif

        • by shanen ( 462549 )

          Thanks for the thoughts. Some worth comment, but discussion basically expired as Slashdot measures time, so I'll just think about them for now...

  • by greytree ( 7124971 ) on Monday May 19, 2025 @01:45PM (#65387989)
    https://www.nlweb.nl/
  • While you can currently argue against AGI, it's harder to argue against omnipresent AI, as AI becomes more embeddable. I'd even say that the Webkit vs Chromium vs Gecko wars are obsolete now, and the real battle of the future of the web is the AI engine that gets used the most.
    • by allo ( 1728082 )

      Currently there is too much development. There are like four new models every week (speaking about open weights, you bet there are a lot more that are not shared). Any candidate you could choose will be outdated by the end of the year.

  • by rsilvergun ( 571051 ) on Monday May 19, 2025 @01:51PM (#65388007)
    I think it's kind of hard to tell. It's not 100% clear where the actual model and searches run from. Some of it seems to imply it's going to run at your own site and some of it implies that it goes beyond your site.

    If you're actually running the searches and the model off of Microsoft systems then odds are good they've got your data and they're going to use it to train their own AI models which you probably don't want because you're giving up a ton of incredibly valuable data you could be selling to Microsoft for profit.

    On the other hand if you're running it on your own servers AI models require a shitload of computer power and well, you're going to have to pay for that.

    I guess in exchange for that you get an AI chatbot that you can use to sell crap to people.

    The actual use case of an AI chatbot for it quickly accessing content though is terrible. You want users to have to browse around your website because 99% of the web business model is having people browse and then you get the throw ads at them on each page. If they can very very quickly find what they're looking for they're going to come in, get it, leave and you don't need much revenue because you don't have eyeballs.

    That's a huge part of enshitification. You have websites designed to force you to engage for the maximum amount of time possible. On the other hand without that the entire content industry that generates a lot of the useful information on the internet kind of collapses.
    • by Anonymous Coward

      If it is for find-in-page++ you don't need a large model. For summarizing and question answering to a text in the input you'll probably choose a tiny model fine-tuned for that tasks, and that can run on a raspberry pi. Have a look how good (for its size) Qwen3 0.6B is. And I think people run up to 8B models with wasm in the browser. So I'd say a good 1B model as Q4 quant would work on most devices people use today.

    • by postbigbang ( 761081 ) on Monday May 19, 2025 @04:42PM (#65388553)

      It's all about ads and tracking. The search history, answers delivered, all of that will be the crux of still other training models, so that much can be inferred by every hook possible.

      Have AI do shopping for you? You plainly know who's going to be at the head of the list-- someone who paid for it. Recommend travel, destinations, therapists, tires, games, the list is infinite, will be monetized, tracked for effectiveness and will learn how to sell exactly to YOU.

      For a price, with micro-payments to $search_API.

      • I mean yeah everybody is hoping to take over Google's slot as the arbiters of the internet.

        The problem is that besides LLM use for replacing workers I don't see a lot of value in it.

        If you make content for a living AI basically takes your content and regurgitates it, often with only the bare minimum pieces of it that people want. It means that as a user you quickly get to the information you want, assuming AI is working like it's supposed to, but it means that you're engagement statistics and with i
        • The Ad AI business is different than the GoogleAds mindset. It makes money in different ways, and still needs ad composition, placement, review, sales people, marketing, infrastructure, an HR dept (perhaps for bots), etc. It's not a zero-overhead business.

          Specialty AI will become a norm, e.g. like subject matter experts of a different era. In turn, these will be networked together in their own matrices. Those matrices will micro-monetize information, transactions, and placements, as well as tasks (compute).

          • by Anonymous Coward

            Will doom us all sooner than later. As far as a way around that there really is only one and it's to change how we teach to focus on critical thinking and improving children's capabilities to the maximum they can reach. But that's it odds with cost cutting and a desire to create effective corporate robots as fast as possible and it's cheaply as possible like natsi Musk is doing with his team of h-1bs. It's too expensive to move a warehouse. Robots is devouring jobs.

            Tech workers and customer service workers

            • I'm not quite as paranoid as you are, but we share similar concerns. However, this is about Microsoft and the post.

              It's an API, one step towards both panopticon, making Microsoft loot, and also an early embryonic set of ideas not unlike how computing replaced human tasks 30-40 years ago. There were many exclamations of how those fun menial tasks (that paid actual money) might be automated. Those that went with the automation survived. It's Darwinian in that characteristic.

              Now, faced with bots, AI, and other

  • by nightflameauto ( 6607976 ) on Monday May 19, 2025 @02:05PM (#65388053)

    Is it a way to bypass all possible security and scrape all information on web sites into Microsoft AI systems? Or is this a hoped-for future addiction that will feed a payment plan for any website dumb enough to use it today, to be monetized later? Or is it a better way to gather info for targeted advertising by scraping every site that adds this "open protocol?"

    I mean, there's so many possible nefarious reasons for this to exist, I'm having a hard time coming up with a single possible positive for the end users and website owners. It all seems to point to, "Give us more direct hooks to your information, please. We needs dat data." And possibly to, "We'll monetize egregiously once we get you addicted."

    • by Tablizer ( 95088 )

      there's so many possible nefarious reasons for this to exist,

      MS's new NefariousGPT generates creative vendor-lock-in and data vacuuming ideas that took human MBA dolts years to devise.

      MS's new motto: Be Evil Faster

      Google is suing for motto swiping.

  • by allo ( 1728082 ) on Monday May 19, 2025 @02:19PM (#65388099)

    Reduce JavaScript to a small subset, limit the number of third party sites that can be used to load resources, do not allow scripts or cookies from third parties at all. Maybe also reduce HTML to a safe subset. Kinda what AMP could have been if it wasn't from Google.

    AI search on websites is not required to fix anything. I am very open to the better uses of AI, but embedding it in every website sounds like one of the worst. But since Recall it looks like Microsoft is trying to find what the worst use-case for AI might be.

  • by Anonymous Coward
    Its like accepting bandages from a person actively stabbing you...
  • by Anonymous Coward

    > Ramanathan V. Guha, who previously created RSS

    consulted about, yes. created: no.

    to be fair: RDF was a much larger technical achievement and RSS was only a minor application of it. (until later, when it wasn't even that).

  • How exactly does that work? Pouring more urine on a urinal does not fix the clogged drain.
  • The idea behind the Semantic Web was that data would not only be available, but would be meaningful. That's been the missing part. Injecting AI to give users a web experience that actually leverages the semantic meaning of all the data out there has some merit.

What ever you want is going to cost a little more than it is worth. -- The Second Law Of Thermodynamics

Working...