Slashdot Log In
Metadata in Vista Could Be Too Helpful
Posted by
Zonk
on Fri Dec 23, 2005 10:31 AM
from the you-want-some-toast? dept.
from the you-want-some-toast? dept.
linumax writes "Windows Vista will improve search functionality on a PC by letting users tag files with metadata, but those tags could cause unwanted and embarrassing information disclosure, Gartner analysts have warned. Search and organization capabilities are among the primary features of Windows Vista, the successor to Windows XP due out late in 2006. While building those features, Microsoft is not paying enough attention to managing the descriptive information, or metadata, that users can add to files to make it easier to find and organize data on a PC, according to Gartner. 'This opens up the possibility of the inadvertent disclosure of this metadata to other users inside and outside of your organization,' Gartner analysts Michael Silver and Neil MacDonald wrote in a research note published on Thursday."
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.
Oblig. Nelson (Score:5, Funny)
Ha-ha! You're using Windows!
Non-Oblig. Homer (Score:4, Funny)
Bart: Isn't that just the wrong way?
Homer: Yeah, but faster!
Parent
Not just windows, Mac's too (Score:5, Interesting)
As a result I no longer have spotlight index my e-mails. And of course that's a pain in the ass since it means Mail.app's searhc feature is busted. While I can figure out how to work around that (e.g. don't use mail.app, which would be a pity), the story does not end there. Unfortunately, spotlight indexes my backup volumes too, and it can blunder across old mail there and index it.
Now you might think I could also turn off indexing the backup volumes but there's the rub. First I might not want to. Second, you can't always do it. Spotlight has some bugs in how it handles logical partitions on disks and in particular it sometimes ignores being told not to index a volume if another partitions is being indexed.
Anyhow eventually there will be more fine grained control on privacy, but then the interface will become more cludgy too. In fact that may just kill the whole fine grained control effort since most folks don't worry about this sort of things and would prefer simplicity.
It's perhaps worth noting that windows dropped making the filesystem a database (for now). That might be a smart move since making at a wrapper like spotlight means they are less locked into a single search design. Problems like this will emerge slowly and flexibility to plug problems will be needed.
Parent
other Automatic meta data generation issues (Score:5, Interesting)
Which of course means automated meta-data scraping. this leads to the problem of confidential info disclosure. that's obvious. But it also leads to another problem that annoying. When do you update the meta data? when the file is created or modified? a small lag? or in batch overnight?
On macs you can force a batch overnight search. But the default on is for instant updates. If you add a search term to a document WHILE a search is being performed in another window it will find it! amazing. and very useful too. And it assures things like computers that sleep at night and detachable drives stay indexed.
But it's also amazingly annoying when you stop doing conventional desktop activities and start doing more unix like things. Tage for example untarring a 30 GB archive with twenty thousand small files in it or something that is generating transisent files in a rapid fire fashion. Well you start untarring and for the first few files it zips along. then suddenly throughput nose dives. Why? you look at your processes and you see MDL the indexing programming is chewing up your disk access.
You can work around this if you can control the file names and make sure they are ones it will not index. But that's not assured, always possible, and will vary from computer to computer.
So anyhow there's lots of fine tuning needed on these ubiquitous metadata systems. Fine grained privacy control and fine grained operation modes so it's live in desktop application mode and lags in unix/high performance modes.
Parent
Re:Not just windows, Mac's too (Score:4, Insightful)
It doesn't sound like a metadata related problem to me. It sounds more like a furniture placement issue.
But seriously, de-selecting 'Mail' in the Spotlight pref pane, should stop spotlight from displaying results in its window, while retaining the full indexing facilities within Mail.app itself.
Parent
Re:Not just windows, Mac's too (Score:4, Insightful)
Parent
Re:Oblig. Nelson (Score:4, Funny)
Parent
Easy solution (Score:5, Insightful)
Don't fill out the metadata fields!
Re:Easy solution (Score:4, Insightful)
It has everything to do with human behavior and nothing to do with computer security. As it is, desktop search tools are opening up whole avenues to quickly find the secret smut on your desktop. Do you have a Google account AND search history enabled? Go to google.com and do a Search History and see what stuff you've been searching on that Google knows about. You shouldn't have done a search on "merkin".
Parent
I don't get it.. (Score:5, Interesting)
Re:I don't get it.. (Score:4, Insightful)
Like Big Bird says, remember to put your infants in the back seat, so the "safety" devices don't kill them.
Parent
Re:I don't get it.. (Score:5, Insightful)
Turning to the metadata: Having lots of metadata to search can be a very good thing. But, if used improperly (e.g., having the index not properly secured from outside access or malicious software) they can be a bad thing (read: security risk).
So, as the grandparent said: "Like Big Bird says, remember to put your infants in the back seat, so the "safety" devices don't kill them."
Parent
Re:I don't get it.. (Score:5, Insightful)
Otherwise, you'd be able to search for the meta data in the private files of other users.
Parent
Re:I don't get it.. (Score:3, Informative)
I did RTFA. The "problem" is you may deliberately send a file, eg a spreadsheet, but along with the file, Windows will have your indexing info, which may give away more than you want ("generic fuck off message", etc). Of course, this information comes courtesy of a company that has a "metadata cleaning" system they want to sell you. Everyone seems to be think
The problem is giving away metadata with the files (Score:4, Insightful)
For example, several years ago Microsoft reportedly [computerbytesman.com] posted its annual report as a Word document, which contained evidence that it was composed on a Macintosh.
That example is good for a chuckle (OK, maybe a belly laugh for us Mac fanboys), but suppose someone sent a document to a customer that showed it was filed in a folder named "Correspondence with Idiot Customers" without the sender realizing it...
Parent
Oh Great (Score:5, Insightful)
Surely not ? (Score:4, Funny)
Surely Microsoft aren't adding a feature to Windows without giving thorough consideration as to how the feature will work in a multi user, internet connected, environment ?
After all they've show time and time again how much they cae about these things
That reminds me... (Score:5, Funny)
Re:That reminds me... (Score:3, Informative)
In your colleagues case it sounds like he may have been able to prevent it, but that is not always so [abanet.org] with metadata that that vendor includes in your documents.
Hahaha, must have opened porn.... (Score:5, Insightful)
Re:Hahaha, must have opened porn.... (Score:3, Insightful)
Re:Hahaha, must have opened porn.... (Score:3, Insightful)
Not necessarily. Even in the healthiest of relationships one often becomes unreasonably annoyed with one's partner, and sometimes that annoyance gets vented to others. There's nothing wrong with (say) griping to a friend over IM that your GF is driving you up the wall because "she just won't fucking shut up about how her clothes don't fit right,
Re:Hahaha, must have opened porn.... (Score:5, Funny)
Parent
Stupidity 101 ? (Score:5, Insightful)
After 10 years of M$ Word disclosing secret information, you'd have guessed that "a removal tool" as mentioned in the article is obvious to anyone with half a brain as not good enough.
Storing the meta-data in a seperate file, or how about with the other metadata (i.e. with the inode) isn't so hard, is it? And it is quite obviously the right thing. There's even a big, red hint right there in your face: It's called meta-data. Might want to treat it different from the actual data, you know?
Re:Stupidity 101 ? (Score:5, Insightful)
Parent
Train those users (Score:5, Funny)
Parent
Re:Stupidity 101 ? (Score:3, Interesting)
Unix stores what little metadata it natively supports in the inode, not in the file data blocks.
Special files have nothing to do with metadata, but with the Unix philosophy of "everything is a file", which works great and allows you to reduce the number of necessary system calls considerably.
I know no director
This is a BETA, Right? (Score:5, Insightful)
The 2008 Toyota Prius (Score:4, Insightful)
Oh, sorry... I just figured that we're talking about products that are still a few years down the pipe that haven't been anywhere close to finalized yet.
I don't know about anybody else, but we not only don't evaluate software years before it's released, but we generally wait until the software has been out for at least a year before even looking at it. I don't know what the point is of reviewing a product this early. The only thing that I can figure out is that it's a way to get a few more pageviews.
"embarrassing"? (Score:4, Funny)
All Microsoft has to do (Score:3, Interesting)
is to make the metadata attatched to document files viewable only on the Vista installation it was created on. Perhaps it would be possible to have the operating system strip the data off the files that are being copied or moved to other network locations as a precursor to each respective process. In this case, they would also have to work some kind of functionality into the next iteration of Outlook, so that the problem could be stemmed from the email side of things.
What 3rd party vendors would do to accomodate this is anyone's guess.
Re:All Microsoft has to do (Score:4, Insightful)
This is just another example of disclosures from the past where change log information was left in documents released to public forums. Very interesting info disclosed in some of those word documents. Must be standard procedure now for lawyers to check the change log info on documents they are sent.
And if people don't fill out the meta data info the fancy new search capabilities won't be as useful so why have them?
Parent
Yawn, non-story (Score:5, Insightful)
How is this different than naming your file "Invoice for Asshole Larry.doc" and mailing it to the client? Simple solution: don't put potentially embarassing stuff in the metadata fields.
Do people really need an analysis to tell them this?
Word: "Properties" and Track Changes (Score:3, Insightful)
The more data a computer saves (especially if hidden from plain site), the greater the chance of embarrassment and unintended leakage of sensitive info.
Re:Word: "Properties" and Track Changes (Score:4, Informative)
Parent
Usefulness of metadata (Score:4, Insightful)
Having something like "post-it notes" that do not stick to the file, but instead are part of the directory entry for that file, might be more useful and safer. If someone sends me a file, I don't want that person's metadata to pollute my classification of files.
That's somewhat like what happens with e-mail - I receive plenty of mails that the sender marked as "high priority", but that are low priority to me. Metadata on the file should be objective; subjective information should be stored somewhere else and not be transmitted together with the file.
Re:Usefulness of metadata (Score:5, Funny)
In the interestation of securitization, the catalogation of the nation's datation should not be left to the ineptitudination of incompetentation corporatizations with a historicalization of not giving full thoughtfulination to securitization.
Parent
Company policy. (Score:5, Interesting)
But this will just be an extension to that policy to check for any meta data.
Re:Company policy. (Score:4, Informative)
The places you need to worry about metadata exposure are the document-aware "export" functionality, because rather than simply printing from primitives, these work with full knowledge of the document and it's structure.
Parent
Terms of Embarrassment (Score:3, Insightful)
Oh, you mean more embrassing than finding cookies and cached images from pr0n sites and the like? Unless you're considering self comments like "he's so hawt! I'd so tap that!" Not that you that most people's surfing already involuntarily discloses their personal data like a sieve.
I'd be less concerned about people appending credit card numbers and such to files, not embrassement.
Here is quick fix (Score:3, Insightful)
Yes, if they manage to apply rights based system system wide, something like OS X, it won't be problem.
I mean if they are stealing, steal it completely
Note I had to 'sudo ls -la' to see it even.
(os x 10.4 "tiger")
what planet are these people from? (Score:3, Insightful)
Kind of like Gnome has been doing for a few years now? How out of touch are these people???
Re:Windows Insecure??? (Score:5, Insightful)
Parent
Re:Windows Insecure??? (Score:3, Funny)
Microsoft (Office)
Adobe (PDFs)
If you can think of any other companies that keep turning up, you let me know.
Re:Windows Insecure??? (Score:4, Insightful)
I think you're seeing a conspiracy where none exists. If, for instance, AppleWorks suddenly overnight became the most popular word processor ever, and people were passing AppleWorks bills to the local senator over email... well, you'd have the same problem, because AppleWorks (and most, if not all, word processors) keep the same meta-data as Word and PDF does.
Parent
I'm shocked (Score:3, Funny)
I'm shocked, shocked to see Microsoft prioritizing features over security.
</Claude Rains>
Re:Windows Insecure??? (Score:4, Insightful)
according to a compilation by Workshare, a maker of software that strips metadata out of files.
You wouldn't think that they have some invested financial interest in getting the the public overreacted about the dangers of metadata
Am I being reverse paranoid?
Parent
Google desktop is a little scary... (Score:4, Insightful)
Of course, we don't have it on our main office machines, because they are running Slackware. Our machines that are locked into Windows for hardware interface reasons had to have Desktop removed from them after a couple of almost-incidents.
YMMV
Parent
Security by obscurity (Score:3, Informative)
Medicine is different, though. HIPPA basically requires that you use this kind of security (obscurity). Let me give you an example. If I have your (HIPPA protected) chart in the office on my desk, that's OK. If I leave it in the waiting room, it's not. Information does not have to be hidden from a determined (and illegal!) search, because, well, that's illegal, an
Re:Couldnt care less (Score:4, Funny)
Parent