Printed Embedded Data GUIs 111
n7lyg writes: "Xerox PARC has come up with a way to embed data in printed images that involves using something called DataGlyphs. A DataGlyph is essentially an oblong pixel that takes the value of zero or one depending on whether it is printed angled to the left or right. Printed at sufficiently fine resolution, this is no different from ordinary offset printing effects using circular pixels, but when scanned by a computer allows recovery of arbitrary data embedded in the images or text of the printed page. An article in this month's IEEE Computer contains a lot of interesting applications of this technology, including a system to allow teachers to create printed tests and lab assignments with embedded DataGlyphs to allow automatic generation of graded and annotated results." I think we've done an article on this before, but I don't see it in the archives...
MacigPen (Score:1)
This reminds me of those childrens books which came with a small 'watermark' reading light pen that could read the right answers off the page. You would answer the question by filling A, B, C or D in the box and then you would wave the pen over the answer box and it would read the correct answer, thus you could test yourself. It never really cought on and the book/pen sets were quite expensive, but this seems to be targetted (at least partially) at this market. It could also have an application in barcodes that are more pleasant to the eye.
Used on tests? (Score:1)
chilliware (Score:1)
i bought all 3 of their programs, and now they are just GONE!!!
no website, no phone, no email!1!!
wtf?
It doesn't matter. Team Xerox is going belly up (Score:3)
They're called UPC symbols (Score:2)
Like all steganography.... (Score:2)
Re:It doesn't matter. Team Xerox is going belly up (Score:2)
Depending on the paper and ink used, this could be good way to archive long time storage. With a magnifying glass and sufficient patience, it would be human readable too.
However, I think your amount per page is off. If a 3x3 pixel block is used, shouldn't there be 4200 lines of data on the page? With 400 bytes/per line, that works out to about 1.6M. Using a 2x2 pixel block, there would be 6300 lines, and about 3.6M. Even if a blank line is put between each data line, that could still be a significant amount of data.
Re:Teachers' laziness would backfire (Score:3)
I didn't really see how this would aid in automatic grading of tests or homework. If the kids have to use a computer & printer to generate the results, then why not transfer it electronically? If they are to use the good ole No. 2 pencil, why not just use 'fill in the dot' type forms?
Fingerprints (Score:1)
__
Re:I work on this, ask away (Score:1)
What kinda headline is that? (Score:2)
But umm, where'd the "embedded" and "GUI" come from? Were those just buzzword burps as you were trying to type "Printable Data"?
Some properties of watermarks AND of barcodes (Score:1)
Re:I work on this, ask away (Score:1)
Actually, we're quite robust. Read the article for details about synchronization in DataGlyphs. Redundancy is adjustable, and I can personally attest that DataGlyphs have survived my coffee spill. Haven't tried Guiness yet.
Re:I work on this, ask away (Score:1)
User Interfaces." Read the article.
I work on this, ask away (Score:5)
Better then barcodes (Score:2)
What is nice about this is that instead of watermarking, the pixels are shaped differently. Watermarking doesn't scan right, but this does.. I don't really see how this can be considered new at all.. and certainly not patentable! But you never know what they will patent these days
Obvious concept with very little to no more realworld applications then a barcode has, only difference is that the untrained eye may not notice it's existance. What is good about this is, instead of putting an obvious and easily readable barcode on an identification card.. one can print the id card with glyphs, this isn't secure.. but it would prevent others from being able to read it without scanning it into a computer. (bar codes can be easily read by the trained eye). The identification card issue is brought up because there have been schools putting social security numbers on identification cards in barcode, or worse.. plain text..
speaking of plaintext info on identication cards, I wish they would either remove my social security number from my University id... or put encode it in a slightly more secure fashion!
Data glyphs are overkill for this. (Score:3)
A lower-tech solution is to just write on a serial number in UV-fluorescent transparent ink. This stuff's been around for years.
Or laser-print it at stupid-DPI in flyspeck 3.
Or just remember the order in which people go into the room and surveys go into the slot.
Now, unless you're carding the people for other purposes or otherwise already have their ID, this will avail you nothing...
pixels? (Score:2)
when printed, the dots that make up an image are typically referred to as.. um.. dots.
pixels are picture elements, usually on some kind of computer screen. dots of ink are just dots.
just so's you know...
Xerox Data Glyphs are WAY OLD (Score:1)
Their only advantage over other 2-D barcodes was supposed to be how they can masquerade as innocent grey background. That way you don't have to mess up your product's graphic layout with an unsightly bar-code symbol.
It must have sounded really "Hi-Tech" back then but checkers have enough trouble finding UPC symbols as it is, without hiding the things on purpose.
Whoopee... not.
Archival (Score:2)
In this case, it is good for data density. Encode information two ways. Great for encryption, embedded data (like XML for paper) and steganography.
--
Charles E. Hill
What's the fuss about? (Score:2)
Re:What's the fuss about? (Score:2)
I always notice this stuff, because I'm attuned to it. None of my non-industry friends do. If I ever happen to point it out, the reaction is always "Hey, wow, I never noticed that before". It's not an intelligence thing, either. I've got some smart friends, but that sort of stuff is just filtered out of the image by your brain if it doesn't mean anything to you.
Re:It doesn't matter. Team Xerox is going belly up (Score:2)
Re:Universal Turing Machine (Score:2)
Re:But why? (Score:1)
the use of 45 degree bars gives a uniform
grayscale look, regardless of runs of zeros and ones in the next.
Re:hey, my project! (Score:2)
I still own one.
---
Re:Privacy and "Anonymous" surveys with DataGlyphs (Score:2)
_____________
Re:DeCSS! (Score:1)
but not printable, subrosa material. see
http://home.earthlink.net/~retiarius
for further elucidation. embedding scientology
"scripture" within slashdot archives also comes
to mind...
Re:Could be powerful (Score:1)
If you start fiddling with the colour, then the image will be corrupted by the data.
Re:Not useful for preventing counterfeiting (Score:1)
"I'll take the red pill, no, blue. AAAHHHHHHHHHHHHHHHH........"
hey, my project! (Score:2)
From what I understand, this sounds somewhat like a project that I had dreamed of around 8 years ago. Basically, my idea was to eventually create a system that archived my data on a sheet of paper to be scanned back in for recovery. (Regular home printers and scanners would do just fine.)
The initial testing stage was to simply compress and then uuencode (possibly with some adjustments to the character set) any random file, print it out, and then OCR it back in and do the deuuencoding. This can theoretically be done with common tools available for years now, but 1) I've never owned a scanner in my life 2) Any large amount of data would take up lots of paper, and would require a lot of work as well.
Eventually, I would move further towards my own designs: namely inventing an encoding scheme that would allow a lot of information to be saved on the paper, perhaps up to or exceeding a megabyte. This would, of course, also require new software to decipher the data that was encoded onto the paper.
I used to dream of a world where software developers shipped the latest version of their programs on a few sheets of paper rather than a floppy disk or CD-ROM...
(It also would occur to me how funny and ridiculous it would be to store one's encoded pr0n archives in a regular binder...)
Re:hey, my project! (Score:2)
CDROMs basically use the same technology. Optically scanning a flat surface. Sorry, but there doesn't seem to be any need to reinvent the wheel... or disc as it may be.
That might be the case, but then you can make the same jump of logic by saying that CD-ROMs are "basically" the same as a floppy disk, because they store data on a disc, reading in a circular pattern, and then you can say a floppy disk is "basically" the same as magnetic core memory because they both work off storing bits magnetically. But when you step back and look at the whole picture, you begin to realize that data printed to a sheet of paper really has nothing to do with magnetic core memory, other than they both store data in one way or another.
What I'm looking for is a storage medium that doesn't decay with time. Paper burns and yellows, floppies and hard drives lose their can get erased. Maybe CDs are ok, but I don't know what the lifespan is.
My idea of archiving information on paper was not borne from the intention of solving any of those problems. In fact, I can only think of two primary advantages to this method over that of a floppy disk.
1) Ability to store, transmit, or copy the data as easily as any document (examples: store it in a binder, fax it to someone, or xerox it)
2) Ability to hide information in a format that most people wouldn't suspect.
Re:hey, my project! (Score:2)
I've never heard of this Cauzin SoftStrip... perhaps you might provide a picture or link?
Direct Marketing has been using this for years (Score:1)
Re:Privacy and "Anonymous" surveys with DataGlyphs (Score:2)
Another way to distributed DeCSS (Score:1)
Re:and the government? (Score:1)
I'll be the first to say it then (Score:1)
SAT test (Score:1)
But why? (Score:2)
Re:But why? (Score:2)
That's a good point. And it's one that's been brought up in connection with the USPS's "electronic postage" system, which encodes source, destination, and postage info in an ugly bar code block. Maybe this would work for that application. The Xerox scheme, though, requires high-resolution scanning, while the postal system has to read moving envelopes at very high speed.
Re: (Score:1)
Can you? (Score:1)
segfaulteq@home.com [mailto]
So let me see if I have this right? (Score:2)
Now if we were to go scan existing pictures or print for code, what type of meaningful information would there be in... say a typical picture in Playboy?
Re:Could be powerful (Score:2)
That was a rather narrow-minded statement of you...
so.. (Score:1)
________
Re:I work on this, ask away (Score:1)
I happened to be standing around when the inventor of DataGlyphs came in with his copy of the State Farm newsletter - he seemed quite pleased to be receiving mail using his invention.
Re:What's the fuss about? (Score:1)
Also, DataGlyphs are a couple of orders of magnitude higher density than standard barcodes. Two orders of magnitude is enough to make a qualitative difference in how something is used: applications that could not even be considered before are now easy, and so the way in which something is used can change drastically.
Re:hey, my project! (Score:1)
Basically it was just a barcode reader and software that could print barcodes. It worked well on my dot-matrix printer. Some of the hobby magazines printed their BASIC source listings in the barcode format in the margin as well as the human readable form. Like the previous poster said, it made a splash then went away. I thought that the durability and storage density made it an attractive alternative to floppy disks. Especially compared to the old Apple DOS floppy format. Archives were cheap, easy to reproduce, and could (if using acid free paper and good quality ink) last quite a long time.
But like most things, it's a matter of popularity, not technical quality. It just never caught on.
Of course nowadays, I just burn a CD when I want to archive or snail-mail somebody a binary. Still, it was a fun toy.
Re:What's the fuss about? (Score:1)
But that is precisely the great NEW feature of this technology: There's nothing to see! It is invisible to the unaided eye.
The data is embedded within any full colour image. This sets it totally apart from barcodes (Code3of9, etc), which really stand out and would be somewhat distracting.
The whole idea is that the data is there if you want to get at it, but if you don't want to know such details, then you don't even see them.
Also, there are 2D "barcodes" that are in use in manufacturing environments, but they are unsuitable for publication applications for the same reason that barcodes are: they rely on changing the intensity of the area being printed on.
-OzJuggler.
Bells! Whistles! Fireworks! (Score:2)
I think we've done an article on this before, but I don't see it in the archives...
Let me be the first to congratulate you with a hearty handshake for being the first Slashdot editor to check the archives before posting! <grin>
While this is interesting (Score:2)
The other thing I think I have archived achieves hiding data in an image file by subtly shifting the bits. Basically similar to a water mark, but for hiding binary data.
So while this is interesting, I'm hoping that they don't try for a patent or anything.
Check out the Vinny the Vampire [eplugz.com] comic strip
Here are some current applications of DataGlyphs (Score:5)
Jeff works on the DataGlyph technology. I work on applications of it. See http://www.xerox.com/flowport [xerox.com] for my application, which lets you put a sheet with DataGlyphs on your document, drop it in a copier, and get out a paper icon for the document (stored on a WebDAV or other network server). You put the paper icon back in, and press copy to get a copy, type an e-mail address to e-mail it, etc. It's like an electronic version of the paper document, but on paper. We call it a "Document Token". (There has been previous SlashDot discussion of Document Tokens and other applications of DataGlyphs, but I can't find it either).
We're also very interested in open standards, and participate in a variety of standards organizations on this and related technologys. For example I'm on the W3C XForms [w3.org] committee, designing the next generation of web forms, and paper is one of the new "devices" that is being targeted (along with voice, pdas, phones, etc.). Check it out and send your comments to the www-forms mailing list! As we said when XForms was launched [w3.org]:
Re:It doesn't matter. Team Xerox is going belly up (Score:1)
Using the word 'solution' implies that there was a problem, and now it is solved. I can't remember ever thinking, "Damn, I wish I knew how to imbed binary information hidden in some graphic". Mainly because I've never thought that.
Obviously now that we have found a 'soultion' we can go out and find a bunch of problems to solve. Don't get me wrong, It's at least a 8 on the cool-meter (Ascii art [rtty.com] being a 9), I just don't see it as a necessity.
Not to mention the fact that it isn't all that revolutionary, anyone could have done this (in a much rougher way) years ago. It's 1's and 0's, for crying out loud, we've had those oh, at least a few years now. Ordering and weighting them isn't making me buy Xerox stock.
Re:and the government? (Score:1)
If this becomes commonplace, then machines that can read it become commonplace and the government and anyone else just treats it like any other data medium.
Besides, how could they regulate it? All you'd need to use it is a very high-resolution printer.
Re:Old News - Same Xerox Problem: Productization (Score:1)
You would think there would be some limit to the number of times a single company could develop a really cool technology, only to waste it. If Xerox had played their cards right, every scanner, fax, copier, and printer manufacturer would be paying them royalties by now. DataGlyphs would be everywhere.
I first heard about DataGlyphs in this Wired [wired.com] story.
Re:What kinda headline is that? (Score:1)
Re:For those of you wondering about uses ... (Score:1)
When do DataGlyphs become public domain? (Score:1)
Re:I work on this, ask away (Score:1)
With the infrastructure above in place, now add:
http://www.paperdisk.com/ (Score:3)
This is the artical that your talking about (Score:1)
Re:Analog Data Glyphs (?) (Score:2)
Furthermore, it is immune to part of the paper being damaged.
From what I read about this a year ago, you cold print data in a fine grained "gray" pattern on the background of the paper. The stuff printed on the paper, such as a document, 1040 Long Form, etc. would be what you're paying attention to. Not the pattern on the "safety paper".
Re:and the government? (Score:2)
When I read the Xerox article last year it seemed to work like this. The data is encoded into in a background pattern on the paper. Sort of like the safety paper your personal checks are printed on. If you look closely, it's not a solid color, but white paper with colored wavy lines. The data glyphs is similar. But instead of wavy lines is's a pattern of slashes and backslashes that the eye just picks up as a uniform color pattern. The data is encoded all over the page. It is immune to part of the paper being torn, spindled, stapled, printed over, vomited on, etc.
Re:Privacy and "Anonymous" surveys with DataGlyphs (Score:2)
What if they e-mailed it to you as PDF?
It doesnt' need to have your identity on the paper, just a serial number that ties it back.
Re:This will be useful in 2004... (Score:2)
When you vote, you usually have to sign something that you voted. Now if you can just match up the order of signatures in the notebook with the order the blank ballots were removed from the stack, you can identify who voted for the bad guy and go re-educate them about how gravely mistaken they are.
Re:Privacy and "Anonymous" surveys with DataGlyphs (Score:2)
Maybe you took it as part of a study, and they pay you $20. Maybe they can tie the order you took the forms off the blank stack to who you are?
Maybe you take an anonymous survey in a mall, but you have to sign a seperate "guest book", allowing them to tie the card you took with the order of your signature. But you then go into a small room with other people filling out the card, and when finished, drop it in the box as you exit.
There are other possibilities.
Re:Data glyphs are overkill for this. (Score:2)
Too likely to be discovered. Most people know about secret inks. Not most people know about DataGlyphs. Furthermore, even of people who know, most wouldn't look for it. The way I read about it (last year) it disappears into the background color of the paper. Well, not really, but you don't notice it any more than the "color" of the safety paper your personal checks are printed on.
laser-print it at stupid-DPI in flyspeck 3
But now, you're starting to get into a technology race. Sorta like the government mint is in a technology race with counterfiters. One of the benefits of DataGlyphs was that it can be printed by common printers and is very un-obvious.
remember the order in which people go into the room and surveys go into the slot.
It's much more practical to identify people when they take the blank page off the "blank" stack. Even though the forms are pre-numbered, the numbering is invisible. People in a room filling out surveys dropping them into a slotted box as they finish in random order doesn't give you much of a way to identify surveys with individuals.
Re:Privacy and "Anonymous" surveys with DataGlyphs (Score:2)
Re:and the government? (Score:2)
The majority doesn't seem to mind so far.
Privacy and "Anonymous" surveys with DataGlyphs (Score:4)
Yeah you!
Would you like to participate in our anonymous survey? Yeah, it's completely anonymous. You don't write down any personally identifiable information. Just take this printed page and answer the questions on it. Then anonymously drop it through the slot in the box over there.
Oh, don't worry that it asks lots of deeply personal questions. After all, this is an anonymous survey. It cannot be tied back to you.
BTW, yes, Slashdot did have a story on this about a year ago. It really got me to thinking about the implications of DataGlyphs. Suppose each of those survey forms is "pre-numbered" using DataGlyphs. Now they just need to tie the fact that you took the "blank" form from the top of the stack with your identity -- and poof the survey is no longer anonymous.
Re:and the government? (Score:4)
The solution is so simple it should be obvious. Why do you even ask such silly questions. Are you dense?
SImply require all printed material to be government approved. Problem solved. Now wasn't that easy? Think of the benefits in reduced terrorism.
Re:Universal Turing Machine (Score:1)
Well, yeah, but every sort of way to encode data in a linear and discrete way is similar to a turing machine tape.
I don't think that there are any world shaking implications of being able to write executable code on a sheet of paper. :)
-
Re:hey, my project! (Score:2)
I submitted this to slashdot about 3000 years ago, and they rejected it.
The jist of it was you take this chisel, and a large smooth flat rock, and then another rock that fits nicely in your hand.
OK, stay with me here, it gets a little technical.
You create a sort of verbal encoding scheme, a way to represent spoken words in a visual way, then once you assign symbols to words, you take the "hand-rock", and strike the rear of the chisel, while the chisel is resting on the other, flat, rock.
I created a prototype of this system, it seems that it works best when you use a type of rock that doesn't fracture easily, but yet is still pretty hard.
Once we get the bugs worked out, I can release more details.
-
Re:Data glyphs are overkill for this. (Score:3)
Well, all your ideas have valid rebuttals.
1. UV Ink
Risk of discovery. Once one person discovers it, because say, they see it in the sunlight, which contains enough UV to see UV inks on light paper, then everyone will know, because that person probably will submit it to slashdot.
2. Tiny Fonts
Same problem, someone might notice and blow your cover.
3. Order of people in a room
Well that works OK if you have the people there, but most surveys are mail-in.
4. Unless you are carding them it doesn't matter.
Well this is my scenario:
Company mails out surveys with the person's identity printed in this new pixel format in a graphic on the survey or something. You could print it at the same time you print the envelopes, a simple mail merge.
There is little chance of discovery by the target people, and if they are mailing it back in one of those "business reply" envelopes with no return address, you could probably convince them it is anonymous.
Sure, this isn't really a new technology per se, there have been ways to covertly embed data in images for a while, but this is probably the first one that can reliably be printed out and rescanned with no data loss.
-
more like a bar code... (Score:1)
Analog Data Glyphs (?) (Score:2)
Prepare to see the DeCSS code hidden within your next Linux-related document, or hacker magazine.
If you scan it upside down, are all the ones zeros, and vice versa?
Re:I'll be the first to say it then (Score:1)
on the other hand, I wonder how that would work on beat-up-run-through-the-washer-lying-on-the-ground -etc.-etc. bills? I don't think MNP-4 is going to cut it :P
Re:chilliware (Score:1)
Re:so.. (Score:1)
Quite useful for OCR, perhaps security too? (Score:1)
"// this is the most hacked, evil, bastardized thing I've ever seen. kjb"
Re:hey, my project! (Score:1)
What I'm looking for is a storage medium that doesn't decay with time. Paper burns and yellows, floppies and hard drives lose their can get erased. Maybe CDs are ok, but I don't know what the lifespan is.
Similar to the ads in Wired ? (Score:2)
--
Re:Application in print media... (Score:2)
Re:Could be powerful (Score:1)
What happened to HTML? (Score:1)
Re:hey, my project! (Score:1)
Re:It doesn't matter. Team Xerox is going belly up (Score:1)
Ta-da! Instant encoding (without error correction), but the problem lies in OCR reading off the printed page. I think with quality printing, a good quality scanner and a conversion program, you could easily store lots of data in a super-cheap manner.
I figure a standard 8.5" x 11" (about 8" x 10.5" with margins) and a printer putting out 1200 DPI.
My figures are using 3 pixels per glyph bit in an 8-bit byte gives 50 bytes per inch and 400 bytes/line or with 2 pixels/glyph-bit = 75 bytes/inch or 600 bytes/line. That gives for 3 pixels/glyph-bit a page storage of 4200 bytes/page or for 2 pixels/glyph-bit 6300 bytes/page. Thats a whole 4 K to 6 K per page! I'm going to write a BASIC program and back up my 40 GB harddrive right away (if we tack on a full 4-color range we get 7 times the storage - whoa. I'm throwing out my floppies this instant!)
Re:Application in print media... (Score:1)
Teachers' laziness would backfire (Score:3)
There's a reason that most encryption and computer security measures are created in a way to suggest that you never, ever trust the client.
Bonus points to the kid who uses his Palm Pilot or TI-82 to read the answers embedded in his homework. I know I would have spent hours to outsmart the teacher's technology that otherwise could have been spent studying or doing the actual work.
Interesting technology, though...
Dataglyph of a dataglyph? (Score:1)
And again and again and again...
Dude! Let's do it! For great justice!!
>> me vs. the end of the world [antisocialism.org]
And to think my submission was rejected last month (Score:1)
http://www.intacta.com/html/inta6000.htm
Re:But why? (Score:1)
http://www.intacta.com/html/inta6000.htm [intacta.com]
Universal Turing Machine (Score:2)
This isn't new... (Score:1)
Hmm... I just thought of an interesting idea... You encode a MIDI file, or an MP3 on a piece of paper and it plays back when scanned in. Heh. What a way to distribute music, with a photocopier! Oh man, what will I think of next?
Sounds like old news to me (Score:1)
http://www.zdnet.com/zdnn/stories/news/0,4586,2
I grant that this isn't embedded in a "gui", but there's not much gui-looking about a Xerox "X" that looks more like ASCII art.
The REAL use of this (Score:2)
--jeff
and the government? (Score:3)
The US gov has already limited net encryption to 128 bits; will they step in and try to regulate this? I would think so...
For those of you wondering about uses ... (Score:4)
Some seem to have got the wrong idea. This is a way to add extra information into a printed page. For instance (taken from the pdf above) a physical blueprint could have extra information imbedded in it such as the pluming layout and the wiring information etc. Then when viewed under a Glyph-O-Scope (magic magnifier type device)the chosen extra information would be superimposed (aligned properly of course) and would move with the page.
It is not an encryption scheme. If you want to encrypt, you would do so before you encode.
If you try and add more depth to the encoding by adding colour to the scheme, you destroy the ability to have the message embedded in a full colour image.
Neat stuff. Check the pdf.
This isn't new... (Score:2)
I have read once, in my vast perusing of cryptography material, that back in WWI or WWII, a spy in Germany was sending messages out of there by encoding them in paintings. His method was by painting reeds in his pictures with different lengths - long for 1, short for 0. Fortunately for him, the Germans took a long time to figure this out. Unfortunately for him, they did figure it out.
Hope this is interesting.