Slashdot Log In
A Look At the Wolfram Alpha "Search Engine"
Posted by
kdawson
on Mon Apr 27, 2009 12:59 AM
from the computational-knowledge-engine dept.
from the computational-knowledge-engine dept.
An anonymous reader points out a ReadWriteWeb piece on an hour-long demo of Wolfram|Alpha (which we discussed at its announcement). Stephen Wolfram does not like to call it a "search engine," preferring instead the term "computational knowledge engine." It will open to the public in May. "The hype around Wolfram|Alpha, the next 'Google killer' from the makers of Mathematica, has been building over the last few weeks. Today, we were lucky enough to attend a one-hour web demo with Stephen Wolfram, and from what we've seen, it definitely looks like it can live up to the hype — though, because it is so different from traditional search engines, it will definitely not be a 'Google killer.' According to Stephen Wolfram, the goal of Alpha is to give everyone access to expert knowledge and the data that a specialist would be able to compute from this information."
Related Stories
[+]
Science: Wolfram Promises Computing That Answers Questions 369 comments
An anonymous reader writes "Computer scientist Stephen Wolfram feels that he has put together at least the initial version of a computer that actually answers factual questions, a la Star Trek's ship computers. His version will be found on their Web-based application, Wolfram Alpha. What does this mean? Well, instead of returning links to pages that may (or may not) contain the answer to your questions, Wolfram will respond with the actual answer. Just imagine typing in 'How many bones are in the human body?' and getting the answer." Right now, though the search entry field is in place, Alpha is not yet generally available -- only "to a few select individuals."
Submission: Alpha - The Wolfram search engine by Anonymous Coward
[+]
Test Driving the Wolfram Alpha 124 comments
SilverMind writes in to note a blog entry at Byte Size Biology describing in detail a few hours spent with Wolfram Alpha (which we have discussed before). "After playing around with Wolfram Alpha for a few hours, I can safely say the following: it's different, it's incomplete, it's idiosyncratic, and it's funky cool. And no, it will not dethrone Google, nor does it aim to do so."
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.
Google started the ball rolling... (Score:4, Insightful)
...with their web-crawling keyword-sculling technology, but it's only normal that someone else was researching what to ~do~ with all the data. IE (data analysis for human comprehension) and Google would make one fierce - and useful - blend.
Re:Google started the ball rolling... (Score:5, Insightful)
This blog post reads more like a marketing piece written by a shill, and if there is any hype, it just seems like it's just self-delusion or just wishful thinking at this point.
Any search engine query and corresponding results can be manually optimized and tweaked to quasi-perfection. In fact, that's the exact recipe many of the now defunct search engines were using a while ago. They would optimize the hell out of a couple of queries or use case scenarios, and then they would fall in love with the layout and content of their contrived results. And then, when the users didn't use the search engine the way the developers wanted them to use it, the developers tried changing the behaviors of their users instead of trying to change their search engine. For the most recent example of this, of actually one company that still had money to waste a year ago, think back to the ask.com commercial where they tried to teach us about the *cool* ajax feature of previewing web sites. Not that this feature was bad per say, but if it was any good, or groundbreaking in any usable way, users would be telling each other about it -- they wouldn't need to be educated about it -- at such a large expense.
And the same goes for the tone of this blog post was written in. It was written from the perspective of a shill, or from the perspective of the company itself, but not from the perspective of an actual user. Personally, I don't want to know about the supposed hype or marketing-speak from the developer's own mouth, I just want to know how useful it's going to be for me. And I don't want contrived examples, I want one or two random example from the (supposedly independent) blogger himself (if possible). And I don't want an actual screenshot of the search box, I want the actual search box itself. Am I only one who tried clicking on it? And if you're going to give me the screenshot of something, give me the screenshot of the search results page (at the very least) and not just a verbal description of it.
Which brings me to my last point: Show. Don't tell. And if there is one thing that Google does well, it's that they don't try to prematurely hype their nascent lab products. They release them first, then they see if the users fall in love with their creation (or not), which is rather a hit-or-miss proposition and a long iterative process. So don't tell me about a fancy search engine, if it's not even out for a public trial yet. I want to try it. I don't want to be told about it.
Parent
Re:Google started the ball rolling... (Score:5, Insightful)
IE (data analysis for human comprehension) and Google would make one fierce - and useful - blend.
Finding relevant information other than the Wikipedia page for any specialist topic is a pain in the ass. If these guys can find a way to index only the good stuff, i.e. not based on general popularity but content accuracy, they could have a future.
Do I have to remind everyone how annoying it is to search for technical documentation for something vaguely Linux-related, only to find the first 30 hits are various forums with more or less clueless newbies discussing installation difficulties and the syntax of apt-get?
Parent
Re:Google started the ball rolling... (Score:5, Funny)
Do I have to remind everyone how annoying it is to search for technical documentation for something vaguely Linux-related, only to find the first 30 hits are various forums with more or less clueless newbies discussing installation difficulties and the syntax of apt-get?
Gods yes. And not to mention that 80% of them are from 2006 or earlier.
Parent
Sorta... (Score:5, Informative)
is there a way to blacklist all the Ubuntu forums in my Google profile?
Keep track of the domain names of the sites with this info, and then
[searchquery] -stupidforum1.com -stupidforum2.com -stupidforum3.com
And so on. I used to use the same to pull expert-s/exchange results out of my Google queries, and then they started giving Google referred page hits free answers, so it's been a while since I used it.
And yes, there's probably a better method than what I've brought up, but it does work.
Parent
Re: (Score:3, Informative)
That works too...
I had many many queries which I was redirected to EE, and then I found that answers were not available - and skipped the page.
Actually, I found answers to most queries from other sites, otherwise I would have paid them to get the answers.
Only now, after you had mentioned that it is there below, did I notice that the answers are infact available.
So, I guess it is a real working idea.
athe real question... (Score:5, Funny)
What role will cellular automata play in this, and will this also define the basic nature of universal mechanics?
Re: (Score:3, Informative)
Re:athe real question... (Score:5, Funny)
I have cloacal exstrophy, you insensitive clod!
Parent
I wouldn't hold my breath (Score:5, Insightful)
It took Mathematica many years to become even marginally correct and useful. If Alpha proceeds at the same pace, it won't have any impact at all.
Re:I wouldn't hold my breath (Score:4, Insightful)
Where can I download your better option?
Parent
Re:I wouldn't hold my breath (Score:4, Funny)
Parent
Re: (Score:3, Funny)
My god, it's full of... (Score:4, Insightful)
FTFA:
...according to Stephen Wolfram, Alpha is built on top of 5 million lines of Mathematica code which currently run on top of about 10,000 CPUs (though Wolfram is actively expanding its server farm in preparation for the public launch).
5 *million* lines of Mathematica? How many code monkeys does he have working for him?
Re: (Score:3, Funny)
Don't laugh, but this is one of the questions that Wolfram|Alpha will be able to answer easily. May 2009.
Re:My god, it's full of... (Score:5, Funny)
I knew I should have looked this up before clicking submit: this makes Wolfram Alpha 1.25 million times more complicated than the entire universe, which Wolfram expects to be expressible in 4 lines [umich.edu] of Mathematica [metafilter.com].
Parent
Re: (Score:3, Insightful)
And 10,000 CPUs and expanding for the public launch... who is going to pay for all that? It's not that Google has a few more of those CPUs running now, but when Google went public I'm quite sure it was less. They just expanded with the expansion of their market.
Maybe this is the late 90s again (prepare for totally unrealistic user numbers), or this search engine indeed needs so much horse power, meaning in effect that it can never become profitable.
Also the article talks about queries running a few second
Re:My god, it's full of... (Score:4, Interesting)
Yeah [flickr.com], I'd say that's less than 10,000 CPUs.
That said, the later you try to crash the party, the more mature competition you are facing, and the bigger/better the launch has to be. Google didn't have Google to contend with.
Will it fail? Probably. But the stakes are enormous, so you can't blame a rich smart guy for trying.
Parent
Re:My god, it's full of... (Score:5, Interesting)
Google at the time had a.o. AltaVista to contend with, at the time the number-one search engine. It was set up by some college students in their dorm room, who had a better idea about searching/indexing web pages, and managed to implement that idea. Then it went live from a single computer for their friends. Who told their friends, and soon the whole campus used them, etc.
Google never advertised their service, it was pure word of mouth. They just got better results than the competition. And they got started of course in a geek environment, so the first word got out and spread quickly.
Good chance that the "next Google" starts up just like that. Hell, I bet The Pirate Bay started up that way. Craigslist did so at least - just a guy called Craig who started a local classifieds page for friends and friends of friends.
Yes the stakes are huge but just throwing money at the problem generally won't get you far, I would say good chance it gets you doomed even as big money often takes away the focus from the innovation that is needed.
Parent
Re: (Score:3, Insightful)
Some Rough Statistics (from August 29th, 1996)
Total indexable HTML urls: 75.2306 Million
Total content downloaded: 207.022 gigabytes
BackRub is written in Java and Python and runs on several Sun Ultras and Intel Pentiums running Linux. The primary database is kept on an Sun Ultra II with 28GB of disk.
That were, at the time, very serious computing resources, but nothing special for a university to have available. Nowadays this will be the same: just add a zero or two for to the specs. It is even something that a normal start-up with venture capital funding can afford, start up a little smaller and it becomes living room material. 1000/1000M Internet is readily available even for consumers, so even bandwidth is not a problem. For starting up there is no need to index "the Intern
A New Kind Of "Living Up To" (Score:4, Insightful)
The Hype: The singularity is here people and Wolfram is our prophet!
The Demo: It's like a search engine but not as good, so he doesn't like you calling it that.
The Product: I can't wait.
Re: (Score:3, Funny)
So it'll be exactly like his book?
And can it deal with paradoxes? (Score:4, Funny)
Quick, somebody ask it what "the first number not nameable in under ten words" is!
Parent
Google-killer? (Score:5, Insightful)
Google won't be killed until someone perfects an AI that you can have a search 'conversation' with, who can understand goddamn context and intelligently narrow down, find relevant articles that don't contain your keywords, etc. Kinda like the librarian from Neal Stephenson's "Snow Crash" novel, but more powerful.
The main reason no one will beat Google until then is that Google is extremely wealthy and can outspend you as it continually perfects information sorting itself, not to mention buy any technology that comes close to threatening it. If you really developed a Google-killer and presented it to the world, do you also have the stones to turn down, say, $100 million? I don't think so, it would take you probably 20-30 years to make that on your own, if you're lucky, with the search field full of competition and Google's mature business-plan in place. Even the days of Alta-Vista were essentially the Cowboy West, unsophisticated and without any proven business plans. Google walked in and owned right away, then discovered how to make money off search when no one else was.
Even then, the founders of Google tried to sell their brilliant search idea not for $100 million dollars, but for $1 million dollars, and there were no takers. They were forced to go it alone. If someone had offered them $500,000 they probably would've taken it and ran.
Although, if you really do develop an AI, there'll be a billion more profit opportunities than search, that's peripheral. An AI can do menial labor far better, faster, stronger than a human. What happens when McDonalds is staffed solely by robots. That would be pretty damn cool actually. They work for the price of electricity, maybe we can get the price of a cheeseburger back down to $0.25 :D
Re:Google-killer? (Score:4, Interesting)
What happens when McDonalds is staffed by robots?
http://www.marshallbrain.com/manna1.htm [marshallbrain.com]
Parent
Re:Google-killer? (Score:5, Insightful)
The main reason no one will beat Google until then is that Google is extremely wealthy and can outspend you as it continually perfects information sorting itself, not to mention buy any technology that comes close to threatening it.
Yes because it's always the wealthy, on top company that innovates the ground breaking ideas, like the airplane, the home computer, the telephone...
Oh wait...
Parent
This could work. (Score:5, Informative)
It seems they are not trying to index the web, nor trying to replace Google.
Instead they are trying to compute knowledge-worthy data from a small subset of the web using natural language algorithms.
Queries like "What is the melting point of iron?" are processed and answered, instead of just trying to score pages based on keywords.
This could really work.
Re:This could work. (Score:4, Insightful)
Or, ya know, not.
I'm not calling Wolfram a big academic fraud with an even bigger opinion of himself, but so far we've seen no evidence that he has done anything.
Parent
Re: (Score:3, Funny)
The thing is called "Wolfram Alpha", probably as in "Alpha and Omega".
Enough said.
Reminds of this Super Genius [wikipedia.org].
Re: (Score:3, Insightful)
Or, ya know, not.
I'm not calling Wolfram a big academic fraud with an even bigger opinion of himself, but so far we've seen no evidence that he has done anything.
I said it could work. ;)
So far there's no evidence in either direction. But it's more fun to stay optimistic.
Re: (Score:2)
You clearly haven't read ANKOS. I envy you.
Re:This could work. (Score:5, Informative)
"Wolfie" Steve Wolfram HAS developed a rather successful software for mathematical modeling. You may have heard of it: "Mathematica". He also wrote a book called "A new kind of Science" which lays out some interesting ideas based on what are called "Cellular Automata" - basically a simple algorithm turned into a loop.
Certain, very simple algorithms appear to be rather respectable pseudo-random number generators, and he uses the fact that they are (repeatable) pseudo-random number generators to be a plus rather than a minus.
I'd like to see some challenging of his ideas, specifically, just how "random" is the output of these simple algorithms? Are they really as incompressible as they seem? It strikes me that there are only so many states possible in a narrow, N-bit wide field that he uses like a register, and thus this would severely limit the "randomness" in the result to being far less than claimed.
In his book, he went too far - he even suggested that cellular automata explain all the phenomenon of the universe! - and for that, his other, useful ideas will tend to be dismissed, even if he IS right.
Parent
Re:This could work. (Score:4, Informative)
Except that most of the ideas on CA that Wolfram used (borrowed?) were already well-known. "A New Kind of Science", it wasn't.
Parent
Re:This could work. (Score:4, Informative)
er...
http://www.google.com/search?hl=en&q=What+is+the+melting+point+of+iron%3F&btnG=Google+Search&meta=&aq=f&oq= [google.com]
Parent
Re: (Score:3, Informative)
Alpha is going to have to be alot better which would require human intervention which leads to a Yahoo type directory and that has alot less entries.
Don't bother... (Score:4, Funny)
... until it's in beta.
The hype? (Score:5, Insightful)
This is the first time I've ever heard about it and I usually check technologically acclimated news sites. Is this a "Google killer" like Cuil was?
Great for financial data (Score:5, Interesting)
If they can figure out how to get this thing to understand financial data, it would be quite useful. That whole area needs more theoretical work.
Machine understanding of financial data is tough. Partly because the data is willfully obfuscated. I once developed a system for turning SEC filings into XBRL (which is an XML representation for financial statements.) At one point, I had several hundred euphemisms for "Net Loss". The connection between financial reporting and reality is at times tenuous.
Accounting is fundamentally mis-designed. The problem is that some numbers are actual, some have tolerances, some are estimates whose actual value will be known at a future date, and some are estimates whose actual value will never be known. Numbers of all four categories are added, and the result is given as a number without a tolerance. That's just wrong. Accounting works that way for historical reasons; it was designed when arithmetic was expensive. Why it stays that way is more interesting, but beyond the scope of this posting. Because of these problems, machine understanding of traditional accounting data is very difficult.
(Back when I did Downside [downside.com] I was more into this, but when I started getting invited to accounting conferences, I realized I didn't want to get into accounting standardization as a field.)
Re: (Score:3, Insightful)
Why it stays that way is more interesting, but beyond the scope of this posting.
Would it be entirely inaccurate if I could summarize that with the single word "greed"?
Re:Great for financial data (Score:5, Insightful)
Parent
Re:Great for financial data (Score:4, Interesting)
I think it is worse than that. They use those formulas because it prevents them having to think. I had accounting in a one semester stint in the Business School at a Large Unnamed University (before I realized that way madness lay). In the middle of an exam, I couldn't remember the Big Formula for solving one particular problem. So I derived my own, solved the problem with the correct answer. My formula was a special case of the Big Formula and my formula was just fine for the particular test problem. I got no credit, that's when I figured Business School Product deserved no credit.
Parent
3.125% (Score:4, Insightful)
Another query with a very sophisticated result was "uncle's uncle's brother's son." [...] Alpha actually returns an interactive genealogic tree with additional information, including data about the 'blood relationship fraction,' for example (3.125% in this case).
Your "uncle's uncle's brother's son" could well be your father.
Wolfram Hart (Score:4, Funny)
Just announced: Corey Hart has joined on as the lead promoter of the new "computational knowledge engine." In related news they have now renamed the engine to signify the merging of their separate talents.
"I believe that Wolfram Hart has the ability to become the Alpha and the Omega of internet informatics" said Hart in a midnight press conference.
Not everyone is celebrating this new turn of events, however. A man only identifying himself as Angel has come out in opposition to a company who openly support those that wear their sunglasses only at night.
Google Killer == Evil? (Score:5, Funny)
If Google does no evil and someone kills them, doesn't that then make the killer evil?
I don't know if I like where this is going.
The end of understanding stuff. (Score:4, Insightful)
Alpha, however, will probably be a worthy challenger for Wikipedia and many textbooks and reference works. Instead of looking up basic encyclopedic information there, users can just go to Alpha instead, where they will get a direct answer to their question, as well as a nicely presented set of graphs and other info.
So this means we just get the straight answer in the future. No more thinking for yourself, no more understanding where the answer comes from, no more critical thinking about the validity of the answer. E.g. TFA mentions that the answer to how many Internet users there are in Europe includes the factoid that there are only 93 in Vatican City. Is this true? Well it must be because Alpha gives it, right? Or maybe it is not true? But why would it be not true and what would be a more realistic number? How many people do really live/work in Vatican City, for example? How does this relate to the number of Internet users?
An encyclopedia search will give one heaps of background information that is highly relevant to the question, and gives a lot of understanding about the answer. It makes the answer more than just a number.
For example if one would look up the question "what is the national flag of the USA", the answer is of course "the stars and stripes", and may include an image. But now I happen to know there is a story behind it: why this number of stars, and that number of stripes, and those colours. I bet this will be in Wikipedia's answer but not in Alpha's answer.
Search engines like this sound really interesting to me, and can be very useful, though it will never replace textbooks and encyclopedias. There is just so much more to answer to a query than just a straight number. And there are so many questions that can not be answered that way, such as "why is polcarbonate so much more temperature resistant than polyethylene?" for example. The full answer to this question includes details about the chemical make-up of the two polymers, and how polymer chains work. That is what textbooks are for.
This guy is indeed so full of himself (Score:3, Interesting)
Not useful for the unwashed masses (Score:4, Interesting)
Not only is this not a Google killer (it's not even a search engine, so how can it be?!), I very much doubt it'll be of any use/interest to anyone outside of the intellectual elite. Googling for "swine flue" is of widespread interest, but I suspect that Alpha-ing for computed relationships and statistics is not.
http://www.google.com/trends/hottrends [google.com]
For anyone who has yet to read about Alpha, what it is is basically a large expert system written in Mathematica that computes the answers to queries covering a very large real-world knowledge domain. I havn't even read that it goes out to the web at all - it's basically based on a huge human collated/organized ("curated" in the Alpha parlance) data set of statistics and relationships. Apparently the results are presented in a very slick way including charts and graphics.
No doubt Alpha is a huge achievement in it's chosen domain of knowledge organization and computation, but I find it hard to imagine that a significant portion of the population will find it useful.
Re:search engine that supports pregex (Score:5, Insightful)
I'm still waiting on a decent search engine that supports perl regular expressions
You'll be waiting for a long time. It's impossible to index a database for matching via regex, therefore searches on such an engine would be inordinately expensive to process.
Parent
Re:search engine that supports pregex (Score:5, Informative)
You'll be waiting for a long time. It's impossible to index a database for matching via regex, therefore searches on such an engine would be inordinately expensive to process.
Heh, check the Syntax and Examples here: http://www.google.com/codesearch [google.com]
:).
I mean no offense, however if one can't do it in 5 mins with with an off-the-shelf SQL database, doesn't mean no one can do it
Parent
Re: (Score:2)
Probably because there's no discernible difference between them: Alpha is described to be a web page where you type queries into a text box (queries much like you'd type into Google, it appears), click a button, and it gives you answers that are somehow better than Google's.
Or it could that all the tech reporters just like hating on Google and hope that some uber-genius will come along and smack them down David vs. Goliath style. (Disclaimer: In no way am I saying Wolfram is any kind of uber-genius.)