New Web Application Attack - Insecure Indexing 120
An anonymous reader writes "Take a look at 'The Insecure Indexing Vulnerability - Attacks Against Local Search Engines'
by Amit Klein. This is a new article about 'insecure indexing.' It's a good read -- shows you how to find 'invisible files' on a web server and moreover, how to see contents of files you'd usually get a 401/403 response for, using a locally installed search engine that indexes files (not URLs)."
but its fixed in firefox now (Score:2, Funny)
Re:but its fixed in firefox now (Score:3, Insightful)
that's all nice and good, personally I think files that were never meant to be indexed make for the best reading by far !
Speaking of firefox (Score:5, Interesting)
Re:Speaking of firefox (Score:2)
Another exploit can out this weekend.
I don't think it is so new - it is fixed by 1.0.1. From the description [www.mikx.de]:
Re:Speaking of firefox (Score:1)
Now what's funny about that? Should it always be the other way round? Yeah I know this is against the "majority mindset" as someone just said. I don't care
should have been from.... (Score:5, Funny)
Re:should have been from.... (Score:2, Insightful)
Of course in those days people actually built their sites using static HTML...
Re:this is'nt new (Score:1, Offtopic)
Re:this is'nt new (Score:1, Offtopic)
Re:this is'nt new (Score:1, Offtopic)
Dammit! A perfect Grammar Nazi!
Re:this is'nt new (Score:2)
Re:this is'nt new (Score:2)
Same here, that rule drives me absolutely batty... In my opinion, if I put the period in the quotes, I effectively tell the parser (aka "person reading my text" in the case of normal English communication) that I attribute the period to the source of the quote, while simultaneously leaving my own sentence un-terminated.
However, I realized that you just nee
Re:this is'nt new (Score:2)
Re:this is'nt new (Score:2)
Um, it IS new, because
Besides, in a few hours it will be new all over again when they post the dupe.
You can see evidence of that
and don't forget... (Score:5, Interesting)
Re:and don't forget... (Score:1)
User-agent: *
Disallow: /
Re:and don't forget... (Score:1)
Re:and don't forget... (Score:2)
Incidentally, it also breaks properly-designed retrieval mechanisms (like, say, RSS readers -- yes, dailykos.com, I'm talking about you!)
Re:and don't forget... (Score:1)
One of my friend does some genealogy research. He decided to put all his data online. Even before all the files were uploaded his site took a massive load. He thought his site was really popular, but after looking at his logs I could tell him that most of the trafic was done by searchengine robots and his ISP might not be so happy for this massive trafic.
Re:and don't forget... (Score:3, Insightful)
if they break, how can they be properly designed ?
Re:and don't forget... (Score:2)
Re:and don't forget... (Score:2)
Re:and don't forget... (Score:2)
If anything, I'd block googlebot/others in
indexing google (Score:5, Interesting)
Re:indexing google (Score:2)
Re:indexing google (Score:3, Interesting)
DVD/CD servers...
Re:indexing google (Score:2, Informative)
This is an execellent trick for searching for porn (ie "index of
Google Hacks Database (Score:5, Informative)
Re:Google Hacks Database (Score:2)
Uh-huh. I imagine most of his readers are using them to make sure everyone else's site is secure : )
Re:Google Hacks Database (Score:1)
permissions permissions permissions (Score:5, Insightful)
Re:permissions permissions permissions (Score:4, Insightful)
Re:permissions permissions permissions (Score:1)
Re:permissions permissions permissions (Score:1, Insightful)
Amit Klein at least used to work for Watchfire formerly known as Scrotum (Sanctum), and the same company who tried to patent the application security assessment process. I guess it's been a rea
Interesting. Brief summary. (Score:5, Insightful)
Does anyone know if the Google search applicance is affected by this?
- Cary
--Fairfax Underground [fairfaxunderground.com]: Where Fairax County comes out to play
Re:Interesting. Brief summary. (Score:5, Insightful)
Re:Interesting. Brief summary. (Score:5, Interesting)
Re:Interesting. Brief summary. (Score:5, Interesting)
EnterFind appliance [enterfind.com] (which I participated in developing) has this (still unique) feature and their clients were amazed by what the crawler can dig out. Especially in those "hidden" fields in the Office documents.
Re:Interesting. Brief summary. (Score:5, Informative)
No. First of all, the Google Search Appliance crawls over http, and therefore obeys any
Re:Interesting. Brief summary. (Score:4, Insightful)
That way an anonymous user would see only results for documents that have read permissions for anonymous while a logged-in user would see results for anything they had permissions to.
Of course this idea works fine for a special purpose database-backed web site but takes a bit more work on just your average web site.
Crawling the site via localhost:80 is the most secure method for a normal site. This would index only documents available to the anonymous user already and would ignore any unlinked documents as well.
Re:Interesting. Brief summary. (Score:2)
A smarter way to do such a thing would be to "crawl" the whole site on localhost:80 instead of just indexing files, that way .htaccess and the such would be preserved throughout.
That would not help much. Most sites have different content depending on the IP address accessing the content, i.e. internal IP:s get content that external IP:s cannot access. Crawling on localhost:80 would remove the non-linked files, but still gives the search engine access to a lot of content that should not be indexed.
The o
News at 11! (Score:3, Insightful)
Re:News at 11! (Score:1, Insightful)
Re:News at 11! (Score:1)
Re:News at 11! (Score:2)
sounds like fun (Score:2, Funny)
I get the idea this might take a while.
Re:sounds like fun (Score:2, Funny)
(Sorry for the reply to self. It's like my own little dupe.)
Re:sounds like fun (Score:2)
Well, just record the guessed words, you might stumble on Hamlet.
Re:Does he really mean this (Score:2)
And, frankly, there aren't _that_ many words in English. Even assuming that your server is really slow and can return as little as 10 searches per second (via more than one thread, if needed), we're talking less than an hour for this script to do its job.
And woe if someone decides to use an army of zombies to
Vs. Database-Driven Sites? (Score:3, Insightful)
Then again, it's about being organized, isn't it? A check of what should and shouldn't be allowed to go public, some sort of flag where even if it shows up in the result, it better not make its way onto the HTML being sent back. (I figure that's more DB-centric though)
Last madman rant -- Don't put anything up there that shouldn't be for public consumption to begin with!!! If you're the kind to leave private XLS, DOC, MDB, and other sensitive data on a PUBLIC server thinking it's safe just because nobody can "see" it, to put it delicately, you're an idiot.
Re:Vs. Database-Driven Sites? (Score:2, Insightful)
yes and no (Score:2)
The search index problem is similar, but not quite.
sort of like googling for visa cards (Score:1)
Isn't this (Score:2)
Either that, or it's a user error (configuration).
that gives me an idea... (Score:1)
search indexer = magic (Score:1)
Nothing new here.
obvious? (Score:5, Insightful)
I mean, I understand its a little more complex as described in the article- but i would hardly call this a 'new web application attack', at best perhaps one of those humorous advisories where the author overstates things and creates much ado about nothing- or at least thats my take;
-1 not profound
does this mean more PRON? (Score:2, Funny)
P2P (Score:5, Interesting)
Nothing really new here..
Re:P2P (Score:2)
Search for the right extension and you're likely to find MSN Messenger logs from people who have shared out all of "My Documents" without thinking!
Re:P2P (Score:2)
One interesing thing to note is that the site spammers are onto these things already. The photo one now pulls in lots of sample advert images for adult sites, as did a couple of the older searches that are linked on the site the article refers to.
Uh huh.... (Score:1)
Re:Uh huh.... (Score:2)
Search foo. You get:
Then search just the above. You get:
Repeat...
Doesn't sound too hard?
Of course the length is limited but that can be solved by "moving frame." Say, putting the above, the engine says your query is too long.
Search: "anti-gravity engine tha
Re:Uh huh.... (Score:2, Informative)
Yep. Did you keep reading it? I'm referring to the methods for when no excerpts are given.
RTFM (Score:5, Informative)
I can't recall the last time this kind of attack wasn't mentioned in the documentation for the product, along with instructions on how to disable it. If you choose to ignore the product documentation, you get what you deserve.
It's quite simple folks. Don't open the search engine. ACL query connections. Sanitize queries like you (should?) do other CGI applications. Authenticate queries and results. If you can't be bothered, hire someone who can.
RTFA (Score:2)
A method I see for that would be running the indexing by piping it through httpd, make even local indexing go the same way remote indexin
Re:RTFA (Score:1)
You are inferring that search engines should only index public information, essentially crippling their usefulness. Glad you don
Assumptions (Score:2)
Im pretty sure the indexing server on Windows won't return 'search results' for files you dont have permissions to list. As with any other sensible indexing schemes, except perhaps the newer silly 'desktop search' tools. Seems pretty obvious to me.
Re:Assumptions (Score:3, Informative)
The problem and vulnerablity lies in definition of "you".
The indexing program runs on privledges of a local user with direct access to the harddrive. Listing directory contents, reading user-readable files. "you" are the user, like one behind the console, maybe without access to sensitive system files, but with access to mostly everything in the htroot tree the administrator hasn't blocke
Re:Assumptions (Score:2)
ie, the indexing service checks the permssion of the requesting user, and only lists files they would be able to list in the OS. Its only common sense.
Re:Assumptions (Score:1)
application in porn (Score:1, Funny)
New option for robots.txt (Score:5, Funny)
Re:New option for robots.txt (Score:1)
Please put this new undocumented tag on your robots.txt file: "hackthis=false" "xss=false" "scriptkiddies=log,drop" And all your problems will be solved.
Note to mods: *slap*
how to solve (Score:1)
1) write your own web applications
2) Use lucene
3) only index what you want to index
4) ????
5) profit
This is old. (Score:4, Insightful)
Whoever posted this as a "new" item, is behind the times.
OWASP covers it! [owasp.org]
Lets not rehash old things!
Re:This is old. (Score:2)
All it takes to implement this is an "access level" field stored with each index entry, and assigning an "access level" session value to each visitor (defaulting to 0 for anonymous visitors).
Plus, this way you'll avoid pissing off visitors who click on essentially broken links in their search results.
No wonder the search capabilities of
Why bother with phisching scams... (Score:4, Interesting)
Re:Why bother with phisching scams... (Score:2)
solution (Score:3, Insightful)
There are variations and contingencies, but the bottom line is, even if someone cracked into the location for an xml metadata file, its not the data itself and while it may reveal a few things about the page or file it relates to, certainly is bottom line much less of a risk than full access to other file types on the server.
heres another tip for free. because you now have metadata in RDF, with a few more lines of code you can output it as RSS.
Why worry about this? (Score:1, Insightful)
Anything else goes on a pocket network or not at all.
The only exception would be an order form, and that will be very narrowly designed to do exactly one thing securely.
Should search be addressed at file system level (Score:1)
chmod -i file
With the search vendors racing to own desktop search and microsoft working on WinFS, is "indexability" now an important security attribute for a file?
Re:Should search be addressed at file system level (Score:1)
--
Help me help you get a free mini Mac [freeminimacs.com].
To those who put sensitive documents on web server (Score:1)
Once upon a time, intelligent people were responsible for computers and IT.
Now, it's either a manager, or a bunch of kids ("web developers") who don't know what they are playing with.
Of course there are plenty of exploits waiting to be discovered that WILL get those documents off your web server.. UNLESS you are smart enough to keep them elsewhere.
I realize this is a flamebait as good as they get - but please understand that I will just duck. It
Re:Mozilla Firefox fucking sucks (Score:2, Insightful)
Don't worry, we will give you a full refund.