Slashdot Log In
Nepomuk Brings Semantic Web To the Desktop, Instead
Posted by
timothy
on Tue Dec 16, 2008 11:20 AM
from the semantic-researchers-play-better-tag dept.
from the semantic-researchers-play-better-tag dept.
An anonymous reader writes "Technology Review has a story looking at Nepomuk — the semantic tool that is bundled with the latest version of KDE. It seems that some Semantic Web researchers believe the tool will prove a breakthrough for semantic technology. By encouraging people to add semantic meta-data to the information stored on their machines they hope it could succeed where other semantic tools have failed."
Related Stories
Submission: KDE's Semantic Tool Could Be A Breakthrough by Anonymous Coward
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.
Um, no thanks (Score:5, Funny)
I've tried Symantec products in the past, and they are worse than actually having a virus. They slow your PC to a crawl, get their claws into every part of your computer, and are extremely difficult to purge when you finally give up on them.
Re:Um, no thanks (Score:4, Funny)
Parent
Re: (Score:2)
Well, why not? Isn't that what WINE [vnunet.com] is for?...
Re: (Score:3, Insightful)
Didn't you get the memo [slashdot.org]?
That should read:
Care to explain? (Score:2, Interesting)
Re: (Score:2, Informative)
Re: (Score:3, Insightful)
It describes the ability to add metadata to web content (tags, etc), and you haven't heard of it because web 2.0 is the more popular term. ;)
Personally I think that metadata/tag based systems are the wrong road for semantic analysis of web pages. As soon as the semantics of a thing is decided by additional information added to describe that thing, its open to abuse.
The only advantage is its faster than what should be done, which is using good old maths to extract the true 'meaning' of a document or object.
Its not hard. Well, ok, its a little hard. Oh ok, its really rather difficult, but there are plenty of places you can get example code or libr
Re: (Score:2)
Semantic information is not more or less trustworthy than the document itself.
Well, no. Its possible using some admittedly complex math, to strip out all but the core meaning of a document. Its very hard to hide the meaningful content of a page from properly done semantic analysis. I know this because I've done that kind of thing before (deliberately vague mode active here, sorry).
It offers a useful perspective though. If the costs of storing data in a way that preserves more information are low, why not do it.
True, true, but if that additional data conveys less meaning then correct raw document analysis, it becomes potentially less useful. I would be against the cost of storing it, but then that's me.
I think tag
Re: (Score:3, Interesting)
It describes the ability to add metadata to web content (tags, etc), and you haven't heard of it because web 2.0 is the more popular term. ;)
Wrong and wrong. Sort of. 8^)
The Semantic Web is the term coined by Tim Berners Lee, describing the ability to associate data using inference (rather than explicit reference). In his conception, it relies on XML data formats and the ability to use common elements to translate between one and the other.
It's not a terribly easy concept to grok at first, but the basic premise is that in data transformation, you only need to know the two steps closest to you in order to translate (and process) data from numerou
Re:Care to explain? (Score:5, Informative)
I'm dubious
I have yet to see "semantic web" fully explained, but Wikipedia is giving some good insight [wikipedia.org] into it, especially into its nebulousness. It is supposed to make web (or in this case, desktop) documents machine-readable.
TFA deals not with the Semantic Web, but rather the "semantic desktop". As it says, "Semantic Web researchers believe the tool will prove a breakthrough for semantic technology. By encouraging people to add semantic meta-data to the information stored on their machines they hope it could succeed where other semantic tools have failed".
HTML had "semantic tools" built in - keywords.
<meta meta name="description" content="Auto Mechanics">
<meta name="keywords" content="auto, mechanincs, wrench, sex, penis, tits, clit, boobs">
You see how it was abused. Any more advanced semantic tools will be similarly abused.
There are other problems, as the wikipedia article explains:
Parent
Semantic Web Article in CACM (Score:3, Insightful)
Relational databases were in the same position in the late 60's/early 70's. We needed ways to combine and extract information automatically with a simple and expressive language. Relational database management systems, combined with SQL were the result of that, and they were a smashing succes
Re: (Score:3, Insightful)
If web content is readable and meaningful to me than it already has inherent meaning. Semantic tagging duplicates effort.
Semantic web is also about accessibility. Take a blind person for example surfing the web using a screen reader, do you have any idea how horrible his/her browsing experience would be like in the web today?
[Robotic voice] .. ....
Document Title - Slashdot | Nepomuk Brings Semantic
Document Body
Stories - Anchor link
Slash boxes - Anchor link
Comments - Anchor link
Search
Form field - Text - Query
Submit button - Search
New for nerds, stuff that matters
Hello eihab! - Link
Help & Preferences - Link
Click on a different page, and there you go listening to the same headers _again_. It can get very frustrating.
Without semantics there's no easy way for a screen reader (or other accessibility enabling devices) to successfully translate a document to something intelligible and usable.
You can see how a pho
Re: (Score:2, Insightful)
"Semantics" is information about meaning (whereas syntax is information about form). Semantic tools try to provide meaning by describing relationships between information atoms. The goal is to create systems which can answer questions like "how old is the president's oldest child?" with just the age, instead of listing all documents which contain the words "old" "president" "oldest" and "child".
Re:Care to explain? (Score:5, Interesting)
The Semantic Web is a failed attempt to extend the WWW via "semantic markup", which allows users/editors/etc to tag content (text, images, data) using a standard format that can be read, processed and exchanged by machines which can then give users more useful pointers to stuff that they care about.
The Semantic Web has failed for a bunch of reasons, with many people tending to blame the tools. However, those of us of a particular epistemological bent believe that it is doomed in principle as current conceived because "meaning" is a verb, not an adjective.
"These data mean X" is completely incoherent on this view of meaning, like saying "This smell of orange blossoms has Republican leanings." "Meaning" is simply not an attribute of data, any more than political tendencies are an attribute of scents.
The Semantic Web fails to capture almost everything about the entities that do the meaning (people) but instead is based on the belief that meaning is a property of data. Data inspires meaning, but meaning is something that humans do, and the Semantic Web has no effective mechanism for capturing this, although with sufficient markup by many individuals on the same data it should be possible to do something similar to ROC evaluation of the ways people mean, which would greatly enhance the utility of the Semantic Web.
A colleague who works in GIS pointed out an consequence of this phenomena to me many years ago when he described an experiment involving a bunch of geologists mapping a particular terrain. At the end of the day, after integrating all their inputs, he could tell who mapped where, but not what anybody mapped.
Parent
Re:Care to explain? (Score:5, Insightful)
I've got a better reason why it failed that doesn't require delving into first year philosophy.
People are lazy. Look at any image database and figure out why it's difficult to find something. Because people don't want to spend 20 minutes filling in tags for a single image they just want to show off to their friends.
Now expand that to every other form of data type, and its easy to see why the semantic web never did, and never will take off without significant AI involvement.
Parent
Re: (Score:2)
Not only that, but also, businesses are control freaks and do not like their websites to be easily processed by machines. They could just open their databases as well.
Re: (Score:3, Informative)
And even when they do fill in the tags, they're sloppy about it. Things get misspelled and mislabeled all the time. Most people are very inconsistent about labeling even when they're trying their best to do an honest, thorough job. Okay, let me tag this photo "wife", because has my wife
That's the idea. (Score:5, Informative)
> ... the semantic web never did, and never will take off without significant AI involvement.
I understand that the point of Nepomuk is to allow for automated tagging by the standard tools of the KDE desktop. For instance, say you receive a picture from an IM contact who KDE also knows (through the address book framework, Akonadi) lives in Europe.
Then Nepomuk would allow you to make search queries as "Bring up all the pictures that people living in Europe sent me last week". Well, that's the theoretical goal anyway; we will see if they ever get there.
There's one nifty application already: you can create a Folder View plasmoid on your desktop, and instead of making it display ~/Desktop/ as usual, you can make it display the result of a query through the Nepomuk KIO slave. See here [osnews.com] how it works.
Parent
Re: (Score:2)
Man, I want to get my hand on some of those Orange Blossoms you got! Two and then pass it man, don't hog it.
Re: (Score:2, Interesting)
Explained better than I could ever do: http://www.well.com/~doctorow/metacrap.htm [well.com]
Re: (Score:3, Interesting)
I disagree. First of all, the semantic web is just about allowing content creators to associate context with their content [blogspot.com] to facilitate a context sensitive search. The semantic web has lackluster adoption because google does a great job at context sensitive search without the context providing meta-data markup.
A more limited version of semantic web has achieved some notable traction. Microformats [blogspot.com] are another way of associating context with content that is more agreeable with content providers.
A more c
You got that exactly backwards (Score:5, Interesting)
You got that exactly backwards.
The WWW was an earlier doomed attempt at semantic markup, and up until the summer of '93 or so it looked like it might work. That's when the early rants about people using the tags to control layout instead of too convey meta information (e.g. using em to get italics in a bibliography, dt/dd to make roman numeral lists, etc.) started--or at least when I first became aware of them. In fact, pretty much the entire history of HTML has been a tension between the language's designers and purist, who want users to care about what markup means, even if it does nothing, and the vast majority of users who only care about what it does regardless of the "meaning" that may be ascribed to it. Once you can get your head around both perspectives some of the goofier things in the whole tawdry history (the Table Wars, XML, CSS) make a lot more sense.
Ok, a little more sense. But only if you already knew what people are like.
--MarkusQ
Parent
Re: (Score:3, Insightful)
Actually, I'd say it's too early to say that the Semantic Web has failed. What has clearly failed for now is the vision for how the technology was to be used.
For one thing, it turned out that really, really clever textual matching is a lot more powerful than anybody thought possible. Twenty years or so ago, you'd have thought that you'd need to have some kind of sophisticated metadata to do the kinds of stuff we take for granted in Google today. I turns out that a technology that turns a needle in a hay
Re: (Score:2)
Basically, think of the tags that are at the bottom of slashdot articles. You can tag them with things like PATRIOT ACT, or EFF, or whatever, and in theory, its going to help you search slashdot and get more relavant articles in your search. Now, when you add that capability to the unwashed internet masses, you see things like story tags with "No" or "itsatrap" or whatever crap people think is funny, which is funny, but ruins its main purpose of helping people find information. Multiply that by the numbe
As a KDE 4 user... (Score:5, Informative)
The other problem at the moment is a lack of ability to query your semantic data. Can I get anything to show all photos with my wife in them that I've rated four or above? Not at the moment. Hopefully this is coming in KDE 4.2, but as it stands at the moment it makes Nepomuk a case of write-only memory.
So, maybe something to get excited about in the future, but not quite yet.
Re: (Score:2)
I still don't get the case for semantic-web, -desktop, -anything.
I think one of the reasons people are slow to adopt it is because people don't know what the heck it is, or can't imagine how it could benefit them in any way.
Re: (Score:2)
So currently it is just Yet another tagging system .. .that has the ability of sharing the tags with your friends ... the data on my computer is private... that's why it is on my computer and not on my Facebook/MySpace/Bebo etc site....
Another poorly implemented, mis-aimed application by I researchers ...!?
Re: (Score:2)
Can I get anything to show all photos with my wife in them that I've rated four or above?
Another problem is that you have to have it tagged, if you don't tag all the images with a given person in it, then it's not going to show up in a search.
I think that's also an issue with Semantic web too, not only do items have to be tagged, the tags have to be accurate and trustworthy. It sounds like a nice system that can be undone by a bunch of tag spamming bozos. It will work better under a controlled environment.
Re: (Score:2)
Just use digiKam for photos. You can search photos by all metadata what is included to photos. Even that digiKam uses SQlite, you can sync metadata between database and photos.
In future, there should come a API to sync database with nepomuk. But currently not planned because nepomuk is not so well implented for KDE4 or the KDE4 at all. Just like all can notice that you cant search files any other way easily than using the kickoff or lancelot menu search bar.
Horrible name. (Score:3, Insightful)
NepoMUCK? Anything ending in "MUCK" doesn't sound like a good product. The concept is very interesting but the name isn't the best I've seen.
I'm glad that they don't prefix everything with K though.
Yes, I know that Nepomuk means "Networked Environment for Personalized, Ontology-based Management of Unified Knowledge" as stated in the article.
Re:Horrible name. (Score:4, Funny)
Agreed. They ought to call it NepoGIMP. Now that's a name.
Parent
On the brighter side... (Score:5, Funny)
It's not as bad as GIMP :)
Parent
Re:Horrible name. (Score:5, Funny)
Yes, I know that Nepomuk means "Networked Environment for Personalized, Ontology-based Management of Unified Knowledge" as stated in the article.
I assumed it was KumOpen (come open) backwards. I think the real acronym is even stupider than that.
Parent
Re:Horrible name. (Score:4, Insightful)
Yep, that was my first thought as well. Quickly followed by wondering if 'into a collaboration environment which supports both the personal information management and the sharing and exchange across social and organizational relations' was some kind of euphemism for, eh, group pr0n of some kind.
Oh, well, either they have much less dirty minds than mine, or someone's desire for well-indexed pr0n browsing has gotten slightly out of hand.
Parent
Re: (Score:2)
The name MUD didn't seem to bother users and MUDs were (maybe still are?) quite popular. I don't think your concerns hold up.
Re: (Score:2)
Neopmuk was the half-dragon in "Jim Button and Luke the Engine Driver". His mother was a Hippopotamus. Pleasant enough character and all, but I hope the code looks more like the excellent book and less like a half-dragon, half-hippopotamus hybrid. Yuck!
Cluster analysis (Score:2)
Typical flameware (Score:2)
I doubt it will catch on... (Score:3, Insightful)
And I'll tell you why.
The Nepomuk Web site wants to make me chew my own arm off. Now, I'm familar with the Semantic Web, I'm excited by the idea of semantic organisation. But this site is the epitome of grim, lifeless European research-ese. It completely fails to convey the technological approach, how it works, or why you should give a damn. I get the impression that the team was more interested in the EC funding then actually developing a disruptive technology.
Why why can't researchers spend 15 minutes thinking about how to convey the importance and excitement of what they are trying to do in terms of practical examples.
I'm afraid you'll probably have to wait until some enterprising 3rd party to grab the source and build some of the technology into a different product.
Re: (Score:2, Informative)
Re: (Score:2, Informative)
The Nepomuk Web site wants to make me chew my own arm off.
ha, good one.
Why why can't researchers spend 15 minutes thinking about how to convey the importance and excitement of what they are trying to do in terms of practical examples.
There are some, but they are not very elegant:
http://dev.nepomuk.semanticdesktop.org/wiki/UsingNepomuk [semanticdesktop.org]
http://dev.nepomuk.semanticdesktop.org/wiki/UsingDropBox [semanticdesktop.org]
Or check out the KDE stuff:
http://nepomuk.kde.org/discover/user [kde.org]
also in cute little moving pictures:
http://www.youtube.com/watch?v=_8oavLQeAjM [youtube.com]
Shmemantic (Score:2)
Semantic shmemantic. It's so 2008, let it go.
Let's have a new buzzword for 2009. I nominate "emotional".
Redundant (Score:3, Insightful)
Metacrap rant from Corey Doctorow (Score:3, Interesting)
There's a good rant from Corey Doctorow about this [well.com]. I think the best phrase that summaries people's high hopes for the semantic web is "nerd hubris".
This indexing fad should curl up and die (Score:2, Insightful)
Re: (Score:2)
Re:This indexing fad should curl up and die (Score:4, Interesting)
"Everybody and his uncle tries to make systems that will index every piece of crap on your PC and it invariably results in a useless and horrible waste of resources."
On the contrary, we should seriously be asking ourselves *why*, when all our data is sitting there on our PCs, we've let ourselves get into such a state of disorganisation at the operating system level that a class of program called 'indexer' exists as a third-party tool in the first place.
How come it's not already taken as given that the primary thing an operating system *does* is, you know, *know where all its data is*?
It's as if we're living in an age before 'directories' were invented - or before databases had 'indexes' and 'queries' - and we have to manually write down and key in raw sector numbers every time we open a file. And we're okay with that, because we think - and teach - that that's 'just how computers work'. We've accepted that there's a whole class of things our computers can't do 'because there's no application to do that'.
Something is wrong with this picture.
Parent
Oops. Apparently it directs back to the ads. (Score:2)
Sorry about that.
Re: (Score:2, Funny)
Easy: Just have a bot add "untagged" tags to everything not yet tagged. Then it's tagged, because it's tagged "untagged".
Also easy: Just remove all wrong information before making your deduction. OK, so how is the computer to know what is wrong? Well, that's of course again semantic information, so just tag anything wrong as "wrong". If some "wrong" tagging happens to be wrong, you can still tag that as "wrong"
Re: (Score:2)
Even if one is using a proper reference frame (not a rotating one), there is still an outward force in the system, namely the reactionary force to the centripetal force. Said reactionary force could legitimately be called a centrifugal force, but it is a force applied to the central object, not on the outer object, which distinguishes it from what people usually mean when they say centrifugal force.
But this is really quite off-topic.