How Do You Visualize 100 GB of Google Text Data?

Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

How Do You Visualize 100 GB of Google Text Data? 117

Posted by CmdrTaco on Tuesday January 11, 2011 @02:03PM from the do-you-see-what-i-see dept.

An anonymous reader writes "There is an amazing series of charts that visualizes trigrams and bigrams, portions of sentences that have been extracted from Google's web data set. The graphs highlight word associations and the frequency with which we use them on web pages. Chris Harrison from Carnegie Mellon University found, for example, that the word 'he' is often tied to 'argues,' while 'she' is found often with 'loves.' There are also word-relation charts that highlight words used in combination with their opposites, such as good and bad, peace and war, and PC and Mac." There are a lot of these things, and they're really interesting to browse through.

This discussion has been archived. No new comments can be posted.

How Do You Visualize 100 GB of Google Text Data?

Load All Comments

Search 117 Comments Log In/Create an Account

Comments Filter:

/.ed (Score:2)

by grub ( 11606 ) writes:

Was this "anonymous reader" the guy who owns the blog?
- Re: (Score:2, Insightful)
  
  by Anonymous Coward writes:
  
  You're not missing anything - the images are unreadable even at 200% or more.
  Anyway, I don't get what they're illustrating. Word relations? So what.
  This is a "Digg" sort of submission ... back over to Fark for me.
  - - - OT: old, busted news ( was Re:/.ed ) (Score:1)
        
        by sleepy_weasel ( 839947 ) writes:
        
        Also, I'll read stories on fark that I'll see a couple weeks later here. It's getting to the point where I don't need to come here for news anymore. Check the 'Geek' tab on Fark, or check my feeds from Techdirt, New Scientist, Wired, CNet or SciAm, and I've got all the news a good week before ./ . Once in a while, there is a rare gem on the feed here, but it's sad, as I came here a lot a year or two ago... now, I just come here to check what the iFanboys like to say, and to hear what Linux and Microsoft
- Re: (Score:2)
  
  by icebike ( 68054 ) writes:
  
  Quite possibly.
  That last bit about "really interesting to browse through" was a pretty big clue, since I don't find any this all that interesting, or unexpected.
  Word association games have been played for centuries.
  Picking sets of following words given any first word is child's play, and doing it by computer is pretty meaningless until you add other characteristics, such as regional differences, time differences (50 years ago vs Today) or something to actually reveal something useful.
  More interesting would
  - Re: (Score:2)
    
    by Desler ( 1608317 ) writes:
    
    That last bit about "really interesting to browse through" was a pretty big clue, since I don't find any this all that interesting, or unexpected.
    That wasn't part of the original submission. That was added by Taco.
Having trouble visualizing (Score:1)

by epdp14 ( 1318641 ) writes:

due to the server being slashdotted. Anyone have a mirror or alternate link?
- Re: (Score:2)
  
  by shadowknot ( 853491 ) * writes:
  
  Google Cache [googleusercontent.com]
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  Here is a copy of the PDFs, if you want to just view the results.
  http://www.mediafire.com/?ua4dhfxmry2nnhn
  Posting anon for non-karma whoring reasons.
- Re: (Score:2)
  
  by noidentity ( 188756 ) writes:
  
  Coral Cache [nyud.net] (just add .nyud.net to any URL's hostname)
- Re: (Score:2)
  
  by ae1294 ( 1547521 ) writes:
  
  Bing Cache [encycloped...matica.com]
pdf (Score:2)

by jcombel ( 1557059 ) writes:

his files are hosted in *.pdf files. tried looking at them in a windows 7 and an ubuntu machine, both have the text with unreadable lines through them. why would you host graphics as pdf?
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  by jcombel (1557059) writes: Alter Relationship on 01-11-11 11:11 (#34837900)
  Sweet, 511!
- Re: (Score:2)
  
  by icebike ( 68054 ) writes:
  
  his files are hosted in *.pdf files. tried looking at them in a windows 7 and an ubuntu machine, both have the text with unreadable lines through them. why would you host graphics as pdf?
  Scalability is my guess. I found that using Chrome I could zoom such that the smallest text is visible (within the browser). Same with Foxit PDF reader.
  No unreadable lines seen here.
  - Re: (Score:2)
    
    by zach_the_lizard ( 1317619 ) writes:
    
    If he wanted scalability, he should have saved them in SVG format. As it stands now, I can't read them; Okular is rendering them pretty weirdly.
    - Re: (Score:2)
      
      by icebike ( 68054 ) writes:
      
      I have trouble with Okular and Adobe Reader on linux as well.
      I suspect some form of embedded fonts were used that works well on windows but not elsewhere.
      Oddly enough, google chrome's internal sandboxed pdf rendering engine has no problem on Windows or Linux, and since that is my normal browser I didn't even notice problems on Linux.
      - Re: (Score:2)
        
        by ultranova ( 717540 ) writes:
        
        I suspect some form of embedded fonts were used that works well on windows but not elsewhere.
        
        Doesn't work on Windows either. And why would embedded fonts be platform-dependent anyway? Don't PDF renderers do document rendering internally?
        I suspect that the PDF files are simply faulty.
        
        Re: (Score:2)
        
        by icebike ( 68054 ) writes:
        
        I suspect that the PDF files are simply faulty.
        Then how do you explain they work just fine for me on Win 7 and also in Google Chrome regardless of platform?
        
        Re: (Score:2)
        
        by Confusador ( 1783468 ) writes:
        
        Both platforms have excessive fault tolerance not found elsewhere?
  - Re: (Score:2)
    
    by Tobenisstinky ( 853306 ) writes:
    
    Works fine in Safari (on Mac) at maximum zoom, the smallest text appears like a 36pt font, with no jaggies...
    - Re: (Score:1)
      
      by morgan_greywolf ( 835522 ) writes:
      
      Like I thought. It's a font issue. What font is it?
      Linux/Windows people: If you want to view these things, you need to get some Mac fonts.
      - Re: (Score:2)
        
        by icebike ( 68054 ) writes:
        
        Linux/Windows people: If you want to view these things, you need to get some Mac fonts.
        Not true.
- Re: (Score:2)
  
  by jandrese ( 485 ) writes:
  
  Yeah, they're totally unreadable (missing blocks everywhere) with Acrobat reader.
- Re: (Score:1)
  
  by ILuvRamen ( 1026668 ) writes:
  
  I'm on XP and with Adobe Reader X 10.0.0, had the same black line overlay problem at all zoom levels. Dunno why.
- Re: (Score:1)
  
  by morgan_greywolf ( 835522 ) writes:
  
  Agreed. They're illegible even if I view them in the latest version of Adobe Reader on either Linux or Windows. They're not images, though, they're text rotated using PostScript/PDF commands. Any reports from the iPeople? It may be a font issue.
- Re: (Score:2)
  
  by Colonel Korn ( 1258968 ) writes:
  
  his files are hosted in *.pdf files. tried looking at them in a windows 7 and an ubuntu machine, both have the text with unreadable lines through them. why would you host graphics as pdf?
  Same problem here with XP and Adobe Reader 10.
- - Re: (Score:1)
    
    by sexconker ( 1179573 ) writes:
    
    XP, Adobe Reader 9 or some shit with all updates, black lines all over the place.
    Works fine if I open it up in Adobe Acrobat 8.
    Couldn't care less about figuring out why, or the content of the PDFs. Take your fucking word graphs, tag clouds, and other useless shit back to 1999, where you'll still be recognized as completely useless.
- Re: (Score:1)
  
  by fyndor ( 895340 ) writes:
  
  The answer to why is that it is not a graphic/image. It is text shaped in a "half circle". I use Chrome, and as others say that works. He probably didn't notice the problem because he likely uses Chrome (and so should you?, after all it is freaking fast as hell, i use it for my day to day). SVG seems like a bad idea as well because it is not supported by IE except for v9 beta (which btw renders this incorrectly as well). I am not even sure what he should have used since its not a good idea to either pu
This can be used to preload a "human-like" ai (Score:5, Interesting)

by presidenteloco ( 659168 ) writes: on Tuesday January 11, 2011 @02:14PM (#34837936)

With a semantic network which reflects how humans relate various concepts together, and what topics and relationships humans care about.
Yes it will be biased and partial and rough, but it's a good start.
More formal reasoning and association techniques, such as bayesian stuff, logic, etc will be also be needed for general AI, but for the
knowledge base to be grounded in human concerns and human perceptions; that's a key to an ai we can relate to and which can
relate to us.
I imagine this kind of semantic network will be usable for google 2.0 "pre-emptive search" or "my virtual social planner and concierge".

Share
twitter facebook
- Re: (Score:2)
  
  by Kilrah_il ( 1692978 ) writes:
  
  Yes it will be biased and partial and rough...
  Just like most humans.
  More formal reasoning and association techniques, such as bayesian stuff, logic, etc will be also be needed for general AI...
  Because we all know that most people use reasoning and bayesian logic everyday.
- Re: (Score:1)
  
  by korgitser ( 1809018 ) writes:
  
  Semantics is all fluffy and stuff, but you are nowhere near AI until the computer can actually comprehend meaning. Semantics is just yet another buzzword for 'dead data, somewhat organized, but still dead, which we hope will make AI. Building larger or better organized datasets will get us nowhere if we can not put the initial 'cogito, ergo sum' into the machine. (And yes I know the 'cogito' is not the ultimate first thought of any mind.) The defining characteristic of life is the fact that data has meaning
  - Re: (Score:1)
    
    by tb()ne ( 625102 ) writes:
    
    The defining characteristic of life is the fact that data has meaning to a it.
    I'm guessing most biologists would disagree.
    - Re: (Score:1)
      
      by korgitser ( 1809018 ) writes:
      
      I'm guessing most biologists would disagree.
      Of course they would. But they also disagree on the characteristics of life. Biosemiotics on the other hand has no question about it.
      - Re: (Score:2)
        
        by tophermeyer ( 1573841 ) writes:
        
        I'm guessing most biologists would disagree.
        Of course they would. But they also disagree on the characteristics of life. Biosemiotics on the other hand has no question about it.
        ...right. Because biosemiotics is a field dedicated to studying how living organisms processes and interpret data. Your statement is tautological. Biosemioticists have no question because their field is predicated on it.
        Making the claim that anything is the 'defining'g characteristic of life is a little rash, because the definition of life is still kind of up in the air. Clearly, there is some disagreement as to what constitutes life.
  - Re: (Score:2)
    
    by ultranova ( 717540 ) writes:
    
    Semantics is all fluffy and stuff, but you are nowhere near AI until the computer can actually comprehend meaning.
    
    It already is. A hard drive controller comprehends the meaning of alternating magnetic patterns on the disk: a sequence of ones and zeroes. A processor comprehends a higher-level meaning: a stream of assembly instructions. An operating system comprehends the yet higher level of meaning: a page of code belonging to firefox.exe that was just swapped in and began executing.
    This phenomenom should b
    - Re: (Score:3)
      
      by glwtta ( 532858 ) writes:
      
      Meaning what, exactly speaking? What is this "cogito" you're talking about and how does it differ from "mere" data processing?
      
      We don't know. We don't have even the faintest beginnings of a "theory of intelligence".
      
      Which doesn't mean that you can just ignore it, start throwing data at simplistic machines and expect (strong) AI to just happen.
      - Re: (Score:2)
        
        by ultranova ( 717540 ) writes:
        
        Meaning what, exactly speaking? What is this "cogito" you're talking about and how does it differ from "mere" data processing?
        
        We don't know. We don't have even the faintest beginnings of a "theory of intelligence".
        Yes we do. We have a whole branch of science [wikipedia.org] concerning the matter. Which is precisely why I asked: the grandparent post sounds suspiciously like semi-mystical pseudophilosophy that gets thrown around because people don't actually want to know how their minds work and prefer to think them as ma
        
        Re: (Score:2)
        
        by glwtta ( 532858 ) writes:
        
        Yes we do. We have a whole branch of science concerning the matter.
        
        Sure we do, it's just that so far they have not come up with anything concrete. Oh, they've done lots of work poking around the edges, but the main question is still pretty much the same - what is intelligence? Perhaps "not the faintest beginnings" was a little strong, I'll rephrase as "have not made consistent progress towards" understanding intelligence.
        
        Which is precisely why I asked: the grandparent post sounds suspiciously like se
    - Re: (Score:2)
      
      by korgitser ( 1809018 ) writes:
      
      The _engineer_ behind the hard drive controller comprehends the meaning of alternating magnetic patterns on a disk. The hard drive controller or any automated system comprehends it no more than a clock comprehends time. Computers are not smart in any way, they are just clockwork; its only people who have become smarter in programming. And making an honest face while selling hot air.
      - Re: (Score:2)
        
        by ultranova ( 717540 ) writes:
        
        Computers are not smart in any way, they are just clockwork; its only people who have become smarter in programming.
        
        Your brain is a clockwork mechanism, yet it somehow manages to be "smart", or at least appears that way to you.
        
        Re: (Score:1)
        
        by tehcyder ( 746570 ) writes:
        
        Your brain is a clockwork mechanism
        In which case, why don't you just build one and prove it?
        Oh, that's right, you can't.
        
        Re: (Score:1)
        
        by badkarmadayaccount ( 1346167 ) writes:
        
        Give me time.
    - Re: (Score:1)
      
      by tehcyder ( 746570 ) writes:
      
      So, I'd say it's simply a matter of overall complexity whether we'd call something alive or not.
      I'm sure the whole internet is more complex than an amoeba, but that doesn't mean it's alive.
- Re: (Score:2)
  
  by icebike ( 68054 ) writes:
  
  I doubt you can derive human like artificial intelligence from simple word order frequency charts.
  People, or at least intelligent people, start saying something with destination in mind, nor simply to mimic some statistical summary.
  Word order charts made today will be different in 6 months, as new phrases enter common usage, but does that mean human relationships or topics change that much over 6 months?
  This reminds me more of the Bing TV ads than anything else.
  - Re: (Score:2)
    
    by ultranova ( 717540 ) writes:
    
    I doubt you can derive human like artificial intelligence from simple word order frequency charts.
    
    It's been done already, and the resulting AI [mit.edu] was good enough to get three papers submitted to a computer science conference.
- Re: (Score:3)
  
  by ultranova ( 717540 ) writes:
  
  With a semantic network which reflects how humans relate various concepts together, and what topics and relationships humans care about.
  
  Wouldn't it make more sense to simply point it to Wikipedia?
- Re: (Score:2)
  
  by tgv ( 254536 ) writes:
  
  No, this just leads to symptom modeling. There is no relation between "he" and "argues" or "she" and "loves" other than that they occur more frequently in the texts that comprise the corpus. I've done corpus studies, and if you look at word frequencies from a certain corpus, i.e. unigrams, they look ok, until you compare them to another one. One of them had 3rd person personal pronouns high, but the rest low, but in another, the 1st person singular (I) was the most frequent word. The difference? The former
Can I do my own searches? (Score:2)

by Locke2005 ( 849178 ) writes:

I like to see what the correlation is between the two words "microsoft" and "sucks".
- Re: (Score:1)
  
  by benjamindees ( 441808 ) writes:
  
  The results sort of surprised me.
  microsoft sucks vs. microsoft doesn't suck [googlefight.com]
  Hmm, let's see what's going on here.
  microsoft doesn't suck vs. microsoft doesn't suck that much [googlefight.com]
  That makes more sense.
  - Re: (Score:2)
    
    by debrain ( 29228 ) writes:
    
    Slashdot appears twice as often as MSNBC [googlefight.com].
- Re: (Score:1)
  
  by marpot ( 1311479 ) writes:
  
  Try this: http://www.netspeak.org/?query=*%20microsoft%20sucks%20* [netspeak.org]
ngrams (Score:2)

by cangrande ( 199946 ) writes:

Go to the http://ngrams.googlelabs.com/ [googlelabs.com] site and compare word frequency between 'pirates' and 'ninjas'. Please.
- Re: (Score:1)
  
  by noidentity ( 188756 ) writes:
  
  This guy also did that on his earlier visualizations [nyud.net] (not in this current "peacock" style though).
Easy! (Score:2)

by countSudoku() ( 1047544 ) writes:

Just use grep, or vi with a heavy object on the down-arrow key. What did I win?
- Re: (Score:2)
  
  by JamesP ( 688957 ) writes:
  
  nah, just use cat and read really fast
  - Cat Abuse (Score:2)
    
    by bananaendian ( 928499 ) writes:
    
    nah, just use cat and read really fast
    RYRYRYRYRYRYRYRYRY...
    This is an obscene abuse of a perfectly innocent program meant to concatenate files.
    I'll have you know I've called the Unix Police and they will be picking you up shortly.
    And you don't have to read fast. All you need is a 45.5 baud teletype machine and filename > /dev/tty
    Personally I prefer to read the punchtape directly though ... with a torch.
Kudos to Chris Harrison, though (Score:4, Insightful)

by Kupfernigk ( 1190345 ) writes: on Tuesday January 11, 2011 @02:22PM (#34838020)

He does these really interesting data visualisations and publishes them for free - and what do people do?
"Was this "anonymous reader" the guy who owns the blog?"
"his files are hosted in *.pdf files. tried looking at them in a windows 7 and an ubuntu machine, both have the text with unreadable lines through them. why would you host graphics as pdf?" - mine don't.
I am slowly recovering from flu. What's the justification for all you miserable bastards out there? This is genuinely interesting stuff presented in an accessible way, and is the sort of thing /. should be about (checks karma and mod points - yup, probably allowed to say that.)

Share
twitter facebook
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  I'm sorry, but there is no rationale to call this ( http://gyazo.com/57fe0a7de30d5bfbbeb4998b74730fc3.png ) GOOD. Who failed here? Sure, Adobe has their part after .pdf "being demonstrated" [sic!] as a very "robust" format at the 27c3 (you can put all kinds of shit into an uncompiled pdf - it will compile and execute on launch without asking).
  But I have done comparably complex graphics in pdf an those did not fail - so what's the probleM? I use win7x64.
  - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    Windows XP (at work), and I've got the same problem.
- Re:Kudos to Chris Harrison, though (Score:4, Funny)
  
  by FrankDrebin ( 238464 ) writes: on Tuesday January 11, 2011 @02:36PM (#34838204) Homepage
  
  'he' is often tied to 'argues,'
  I don't agree.
  
  Parent Share
  twitter facebook
- Responding to myself (I don't respond to ACs) (Score:2)
  
  by Kupfernigk ( 1190345 ) writes:
  
  Why are all the posters bitching about the PDFs ACs?
  I ask simply because I have viewed them today on the latest Chrome on Ubuntu 10.10 and Windows 7, and I cannot reproduce the problem, even on a crappy 4 year old laptop.
Poor way of presenting (Score:2)

by noidentity ( 188756 ) writes:

Wouldn't it be better to just present it as a list of words, so that it could be rendered in HTML? For example
cold
winter steel case turkey
blood ...
weather ...
spring ...
air ...
water ...
springs spots ...
products new spot ...
hot
with the word lists getting smaller as you go to the right, of course (the ... lists words I can't make out in his image). No need for the "peacock" arrangement that reduces readability and requires it being stored as an image.
- Re: (Score:2)
  
  by Colonel Korn ( 1258968 ) writes:
  
  Wouldn't it be better to just present it as a list of words, so that it could be rendered in HTML? For example
  cold
  winter steel case turkey
  blood ...
  weather ...
  spring ...
  air ...
  water ...
  springs spots ...
  products new spot ...
  hot
  with the word lists getting smaller as you go to the right, of course (the ... lists words I can't make out in his image). No need for the "peacock" arrangement that reduces readability and requires it being stored as an image.
  I think that Tufte would agree with you.
- - Re: (Score:1)
    
    by noidentity ( 188756 ) writes:
    
    That's the point; making it colorful and interesting caused a significant reduction in utility. The plain HTML list approach would have communicated the same information, yet been easily viewable and searchable in any browser, with no need for a PDF. It could still have been with the color gradient as well, and black background.
Easy. (Score:3)

by Beelzebud ( 1361137 ) writes: on Tuesday January 11, 2011 @02:26PM (#34838072)

We'll start off by imagining 1 GB of data. Now multiply that by 100!

Share
twitter facebook
- Re: (Score:1)
  
  by tehcyder ( 746570 ) writes:
  
  Basically, all the worst parts of the bible.
Cats v. Dogs (Score:1)

by drunkenkatori ( 85423 ) writes:

Looking at the Cat vs. Dog picture, all I can say is, "What's wrong with dog people?"
- - Re: (Score:1)
    
    by tehcyder ( 746570 ) writes:
    
    Pretty sure there isn't such a thing as 'kitty-style'.
    Amateur.
- Re: (Score:2)
  
  by jittles ( 1613415 ) writes:
  
  Well you don't see people talking about having sex "kitty style" now do you? So some of the hits on dog may be due to that and not just people who like to feed their dog peanut butter...
Women and Men (Score:1)

by dragonxtc ( 1344101 ) writes:

That one is kind of disturbing.
Warning - unfiltered (Score:2)

by SuperKendall ( 25149 ) writes:

Dog-Cat chart NSFW
How GOOG does it: (Score:1)

by shitetaco ( 1954742 ) writes:

How Do You Visualize 100 GB of Google Text Data?
Easy:
$$$$$$$$$$
Visualization? (Score:1)

by schlameel ( 1017070 ) writes:

Visualization = Dark Background + Light Words + Pretty Lines
How does that give me any sort of understanding of the content?
- Re: (Score:1)
  
  by DeadDecoy ( 877617 ) writes:
  
  I agree somewhat. The problem is two fold: graphing libraries do the same things and there is not much meaning to be had in the raw data. For the former item, many visualization libraries are designed to display graph/network data somewhat gracefully. Consequently, many visualizations center around, how do we put this thing in graph form? rather than what interface naturally explains this data best? The second problem is that this huge morass of data just has frequency counts and n-grams. So, we sorta know
Astonishing ... (Score:3)

by foobsr ( 693224 ) writes: on Tuesday January 11, 2011 @03:08PM (#34838578) Homepage Journal

... progress.

Corpus linguistics

http://en.wikipedia.org/wiki/Quantitative_linguistics [wikipedia.org]

Interestingly enough, most relevant authors (e.g. Kaeding) were not cared for.

CC.

Share
twitter facebook
- - Re: (Score:1)
    
    by tehcyder ( 746570 ) writes:
    
    Er, I think GP meant left and right as in left wing and right wing politics, although admittedly I've never seen lefties and righties used in that way before.
guess the word (Score:1)

by sleepy_weasel ( 839947 ) writes:

I can guess some of the words... but it required blowing up the pics to 2400% and I was using Adobe PDF.
My only question is what to do with it. If you are trying to add keywords that will make your site more search worthy, I can understand, or to show a line of thinking how people associate terms. 'Hot and cold' gets you to "environment" "water" "pool"... Might be fun for word association tests.
- Psychologists. (Score:2)
  
  by Kupfernigk ( 1190345 ) writes:
  
  They are going to have a field day (more likely, a lot of field days.)
as a fraction (Score:2)

by owlnation ( 858981 ) writes:

It's easy to visualize 100GB of data. Just view it as a percentage of the Library of Congress -- e.g. a door, or small closet.
Some more Google N-Gram finds (Score:2)

by bgspence ( 155914 ) writes:

http://ngrams.googlelabs.com/graph?content=blue%2Cred%2Cgreen%2Cyellow&year_start=1880&year_end=2008&corpus=0&smoothing=3 [googlelabs.com]
http://ngrams.googlelabs.com/graph?content=Britannica%2CWikipedia&year_start=1800&year_end=2010&corpus=0&smoothing=3 [googlelabs.com]
http://ngrams.googlelabs.com/graph?content=1881%2C1891%2C1901%2C1911%2C1921%2C1931%2C1941%2C1951%2C1961%2C1971%2C1981%2C1991&year_start=1880&year_end=2008&corpus=0&smoothing=3 [googlelabs.com]
http://ngrams.googlelabs.com/graph?content=poker%2Cc [googlelabs.com]
bigram means two characters (Score:2)

by misof ( 617420 ) writes:

I wish people would stop using the words "bigram" and "trigram" incorrectly. The "-gram" suffix comes from a Greek word for "a written character", the same root is in the word "grapheme". Hence bigram == a two-character substring, and trigram == a three-character substring. And these words are actually being used in the correct sense as well. Two-word and three-word substrings should IMHO be called "bilexes" and "trilexes", or something similar. But a good first step is to stop calling them bigrams and trig
How? (Score:1)

by Arador Aristata ( 1973216 ) writes:

File -> Print
Same way you visualize anything else (Score:2)

by halcyon1234 ( 834388 ) writes:

With your eyes. Your eyes.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

/.ed (Score:2)

Re: (Score:2, Insightful)

OT: old, busted news ( was Re:/.ed ) (Score:1)

Re: (Score:2)

Re: (Score:2)

Having trouble visualizing (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

pdf (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

This can be used to preload a "human-like" ai (Score:5, Interesting)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Can I do my own searches? (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

ngrams (Score:2)

Re: (Score:1)

Easy! (Score:2)

Re: (Score:2)

Cat Abuse (Score:2)

Kudos to Chris Harrison, though (Score:4, Insightful)

Re: (Score:1)

Re: (Score:1)

Re:Kudos to Chris Harrison, though (Score:4, Funny)

Responding to myself (I don't respond to ACs) (Score:2)

Poor way of presenting (Score:2)

Re: (Score:2)

Re: (Score:1)

Easy. (Score:3)

Re: (Score:1)

Cats v. Dogs (Score:1)

Re: (Score:1)

Re: (Score:2)

Women and Men (Score:1)

Warning - unfiltered (Score:2)

How GOOG does it: (Score:1)

Visualization? (Score:1)

Re: (Score:1)

Astonishing ... (Score:3)

Re: (Score:1)

guess the word (Score:1)

Psychologists. (Score:2)

as a fraction (Score:2)

Some more Google N-Gram finds (Score:2)

bigram means two characters (Score:2)