Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror
AI

Alibaba's ZeroSearch Teaches AI To Search Without Search Engines, Cuts Training Costs By 88% (venturebeat.com) 7

Alibaba Group researchers have developed "ZeroSearch," a technique that enables large language models to acquire search capabilities without using external search engines during training. The approach transforms LLMs into retrieval modules through supervised fine-tuning and employs a "curriculum-based rollout strategy" that gradually degrades generated document quality.

In tests across seven question-answering datasets, ZeroSearch matched or exceeded the performance [PDF] of models trained with real search engines. A 7B-parameter retrieval module achieved results comparable to Google Search, while a 14B-parameter version outperformed it. The cost savings are substantial: training with 64,000 search queries using Google Search via SerpAPI would cost approximately $586.70, compared to just $70.80 using a 14B-parameter simulation LLM on four A100 GPUs -- an 88% reduction.

The technique works with multiple model families including Qwen-2.5 and LLaMA-3.2. Researchers have released their code, datasets, and pre-trained models on GitHub and Hugging Face, potentially lowering barriers to entry for smaller AI companies developing sophisticated assistants.

Alibaba's ZeroSearch Teaches AI To Search Without Search Engines, Cuts Training Costs By 88%

Comments Filter:
  • Their 'smart' search function was so utterly annoying.

  • Let's race to the bottom -- see what's beyond the last visible dog.
  • They do not replace search engines, the can not replace search engines, and they do not aim to replace search engines.

    They replace USING a search engine during training. The key point is training on that prompt:

    Answer the given question. You must conduct reasoning inside <think> and </think> first every time you get new information. After reasoning, if you find you lack some knowledge, you can call a search engine by <search> query </search>, and it will return the top searched re

Remember: use logout to logout.

Working...