Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Google Pushes To Open Public Records

Posted by kdawson on Mon Apr 30, 2007 08:20 PM
from the whole-lotta-redacting-going-on dept.
AlHunt sends us an AP story on Google's push to help states open up their data to online searchers. Google is going about this in an evenhanded way, according to the story, and the results of its labors — initially in Arizona, California, Utah, and Virginia — will be available to all search engines, not just theirs. The move is being hailed by groups such as OpenTheGovernment.org, but the Electronic Privacy Information Center expressed concerns, given what they call Google's "checkered past" with regard to privacy on the Internet.
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • by Richard McBeef (1092673) on Monday April 30 2007, @08:24PM (#18935875)
    Now not only are my stupid usenet posts from the early 1990s going to be available, so are my other "youthful indescretions". Great.
  • So when... (Score:5, Funny)

    by Short Circuit (52384) * <mikemol@gmail.com> on Monday April 30 2007, @08:25PM (#18935887) Homepage Journal
    So when are Google, the Library of Congress and the CIA going to combine and be simply known as the CIC?

    (Literary reference. Hope I didn't get first post.)
  • Privacy (Score:5, Insightful)

    by NaCh0 (6124) on Monday April 30 2007, @08:26PM (#18935897)
    If privacy advocates are concerned about public records becoming more easily accessible, they should get laws passed that limit the collection of such data by the government. It seems like Google gets the criticism because their search engine is too good at doing what it is designed to do.
    • NO MORE LAWS!

      Let me repeat that and see if I can get around the "we'll tell you how to say what you want to say" filter.

      NO MORE LAWS!

      As if we don't have enough of the useless things already. All they do is cause problems and criminalize the very things they were meant to protect.

      NO MORE LAWS!

      It's time to start equilibrating the government. The Federal Government (especially) has done nothing but expand, and expand, and expand, for 200 years. It's time for them to retract, and shrink, and be pruned back u
      • Re:Privacy (Score:4, Funny)

        by QuantumG (50515) <qg@biodome.org> on Monday April 30 2007, @08:39PM (#18936003) Homepage Journal
        Maybe we should make a law that there can be no more laws.

        Welcome to human nature.

        • Maybe we should make a law that there can be no more laws.

          You joke, but I could think of a good law that do almost that. How about a law that states that the number of words that can be used to create laws is now fixed at its current levels. So, pretend that you want to pass a law with 10,000 words in it. That would mean that you would need to either remove a law, or reword a current law such that you free up 10,000 words.

          What would be the result? Well, I bet you would find government pork would drop li
          • by lilomar (1072448) <lilomar2525@gmail.com> on Monday April 30 2007, @10:01PM (#18936589) Homepage
            Something like this [downsizedc.org]?
          • Great, then you just need a law that defines what words have what meanings and make it cover every possible use of that word. Essentially you end up with a new language just for writing laws in. I suggest we call this new language "Newspeak".

          • Then you end up with vauge laws being misapplied.

            My solution would be to make all new laws have an expirey date of no more than 5 years at which point it will need to be re-voted upon. Think that would clog the system? Fuck yes it would, until they start saying "no" to renewing frivilous or outdated laws.

            All existing laws should then expire in 20 years unless renewed under the new system. That way we dont have legal anarchy, but we do weed out the old ones quick enough.
      • Re: (Score:3, Informative)

        There are many laws that dictate the length of public records. Repealing those laws and correcting existing laws will reduce complexity and overall reduce cost over time.

        One of the parts of a real solution is something like 'cvs blame' for every single word in every single law passed. Want to know who added every single phrase. Yes, even punctuation, grammar, spelling, and capitalization changes should be tracked, after all, "I helped my Uncle Jack off a horse" is distinctly different than "I helped my uncl
        • You have hit upon the biggest problem. When our law enforcement services try to uphold the laws, they are often strapped with doing so in the manner in which others interpret those laws, and that is a problem. Laws are seldom repealed or revised, so the true intent of laws gets lost. When that happens, rather than review/amend we often just implement new laws and leave the old ones to die silently on the books.

          There are many crazy laws on the books that were intended to curb particular behaviors that no lon
      • If you hate how much the federal government has expanded, why are you opposed to putting restrictions on how they can expand? It would seem that the best way to oppose "big government" is to pass laws or amendments that restrict the government's powers.

        Laws are not inherently bad. A well written piece of legislation can unambiguously take powers away from the government.
      • All they do is cause problems and criminalize the very things they were meant to protect.

        Yeah, like that dumbass law against killing. All it does is make it illegal to stay alive!
        • That's a decent enough idea, as long as we allow that the mother document (eg. The Constitution for the Fed, the various Constitutions for the State governments) is allowed to persist indefinitely. With any luck we could even have a lifespan on the various amendments so that each new generation could reaffirm its approval of modifications to the mother document. No more being sold into slavery by our great-grandfathers whom we never had a chance to meet.

          It passes a logic test, as well: one "lifetime" sho
    • Re:Privacy (Score:5, Insightful)

      by Holmwood (899130) on Monday April 30 2007, @08:43PM (#18936025)
      The above poster has it exactly right. I'll amplify. We shouldn't be worrying about governments redacting personal information, or even it being accessible via search engines; we should be worried about them collecting it in the first place.

      Sure, the IRS needs to know your income, and the DMV should know whether or not you have 10 recent speeding tickets.

      But I find the number of pieces of information that State, Federal, state-funded bodies, and legislative mandates (e.g. corporate information gathering and disclosure pursuant to governmental affirmative action directives) require from you seems to be going up and up.

      This is rather disturbing.

      Redacting, as the article suggests, is merely a half step. Setting a sunset on how long most information about you is available is a full step, and not collecting the information in the first place is better yet.
      • I'll amplify.
        [..]
        This is rather disturbing.
        I don't disagree, but rather than just "amplifying", how about expanding a little.

        Why is it disturbing to you?
        • Re:Privacy (Score:5, Interesting)

          by Holmwood (899130) on Monday April 30 2007, @08:56PM (#18936145)
          Fair question. I like privacy.

          I also don't like the idea of some bureaucracy's picture of me defining me, especially if it's distorted.

          I lean, slightly, to a libertarian perspective. Your mileage may vary; fair enough.

          I really don't like the idea in our hyper-sensitive culture of some one (say) being able to look up (and granted, not all of these can be looked up -- at present) my ethnicity, my voting history, or every letter/report/form I've had to file with the government, whether or not I belonged to a gay/straight alliance in high school, or a Christian fellowship club in university. Or whether I asked for the Kosher or the Halal meal on my last airline flight.

          These, frankly, are no one's business but my own, my family's and close personal friends.

          I see data-mining as an expanding source of derivative information about people, to a disturbing degree.

          There certainly are legitimate things (in my personal view) for people to know about. Does someone have a criminal record? Are they a sexual predator? Child molester? Have they been disbarred? What is their credit history (if a lender).

          But I don't see increasing governmental information -- even if its universally accessible -- on us all as a uniform positive.

          Let me now turn the question back on you. Do you? If so, can you please elucidate?
          • I'm more worried about the use to which governments have historically put such information. Namely, people in power use their power to gather and collate information to maintain their power. First it is anyone who is a "threat" to their power is placed under greater scrutiny. Then it is anyone who is "opposed" to the current government. Then it is anyone who is at all "interested" in government. Then it is everyone. China is currently at the opposed stage and is quickly moving on to the interested sta
            • Re: (Score:2, Interesting)

              As they said in the 60's, 'the personal is the political'. I don't think our two concerns are infinitely far apart.

              I admit, my mindset in responding to your question was shaped by TFA -- namely, wide access to lots of information gathered on you.

              Nope, I don't like the uses governments have historically found for such information.

              I would again point out something that you haven't addressed -- perhaps because you took it as read -- the combination of search engines and datamining seems to raise the stakes. Be
      • The Knotty problem becomes one of interconnections. Sure department X needs to know fact A and department Y needs to know fact B, and sure we have laws blocking X from collecting B and Y from collecting A but when all such records are public and publicly searchable then the point is rendered moot, then X can know B and Y can know A and the stalker down the street can know both.

        This is, in fact, just what the late and unlamented Total Information Awareness project was all about and what private companies su
      • ...is universality. Sort of the same reason some Democrats were pushing for a military draft without as many exemptions--if it applies to everyone, a lot fewer people are willing to go down that road. If everyone's information is available, with no exceptions for being a Senator or CEO of a fortune 500 company or a famous actor or famous conservative talk-show host, then enough important (i.e. rich) people will be opposed to scuttle it and inadvertently protect the privacy of us little people. But if yo
    • If privacy advocates are concerned about public records becoming more easily accessible, they should get laws passed that limit the collection of such data by the government.

      I think that would be a great step (even though it would probably cost billions to implement).

      That said, it still doesn't do anything about the decades of State & Federal records that are full of SSNs and other 'private' information... which would also cost billions to do something about.

  • Where our government claims copyright on court cases and findings and other public documents. If you want a document, you order it from Go Print [qld.gov.au]. There's libraries in our court houses, like most the rest of the civilized world, but if you go in there in a pair of jeans the librarian will come over and ask if they can "help" and then ask you if you are a law student, and then ask you if you are a lawyer, and then ask you to leave.

    Thankfully you can still read the laws without paying the government for a copy of them.
    • Re: (Score:2, Informative)

      Gee, or you can grab them off the web for free:

      AustLii [austlii.edu.au]

      Still isn't complete, but they're adding past cases to it very regularly.

      I suspect that this is true in the rest of the western world, but legal systems were hardly ever going to be the first to embrace technology.

      Note they're robots.txt (this is just the most important snippet):

      # 14 August 2003 - unrestricted access to everything except cases
      User-agent: *
      Disallow: /au/cases/
      Disallow: /au/other/HCATrans
      Disallow: /au/other/hca/
      Disallow: /nz/cases/
      Disallow
      • Australia is the only nation in the world that claims copyright over documents produced by the government.

        Everywhere else has the sense to recognise that works produced by the government are automatically in the public domain.

  • God no! (Score:4, Insightful)

    by nog_lorp (896553) * on Monday April 30 2007, @08:45PM (#18936065)
    Dear god, now anyone will be able to read public records. What is the world coming to?
    • Re:God no! (Score:5, Insightful)

      by tymbow (725036) on Monday April 30 2007, @09:39PM (#18936457)

      I think you missed the privacy problem of public records - the issue is not whether such records are public or not but why they were made public. It was never intended that public records would be harvested by information brokers and marketers and data mined but that it exactly what will happen once easy access is provided to such data.

      I don't mind (most) requirements for public records being public but what I do mind is when that data is then used for purposes other than for which it was intended. This is where we need privacy laws. I have no problem for example with having my name, address and phone number in the phone book for public use but I do have a problem when this information is abused by using it in ways that were not intended.

  • by 140Mandak262Jamuna (970587) on Monday April 30 2007, @08:50PM (#18936095) Journal
    New tags to search for, like Mother's maiden name, social security number, schools attended and the name of the first pet, of the first car. A Google spokesman said, "You dont have to click on the phishes any more, we provide all they need ourselves!"
    • If information shouldn't be public, lobby your congresscritters (at the state level, mostly) to get it out of public records. Relying on the "security" of having something be inconvenient to access is a false security at best; all it does right now is limit access to this information to those willing to pay (not very much $$) to get at it. It's nothing but security through obscurity -- and while there are cases where that can be useful, giving folks a perception of privacy while they actually enjoy no such
  • by Wordplay (54438) <geo@snarksoft.com> on Monday April 30 2007, @08:57PM (#18936155)
    We've always maintained this weird security-through-obscurity dichotomy with public records. Technically the information is available to everyone by law, but it's such a pain to get it that nobody bothers

    This has given people a false sense of security when it comes to government data collation. I don't think most people realize just how much public information this out there that anyone with a few bucks and who knows who to ask can see it. On the flip side, it means there's almost no public benefit from the government keeping the information because it can't be easily collated by a private citizen.

    This is the best thing that could happen--let's dump it all out on the net and make it easy to see someone's entire public record. Let's go for complete transparency and let public information really be public information. If the government really is overreaching, the outrage should be enough to throttle them back. And maybe they aren't; maybe this really is in the public interest. Now we can find out. Either way, it's going to force a resolution.

    On another positive side note, this'll also gut the cottage ripoff industry that's grown around public records research. You shouldn't have to pay some PI wannabe $$ to walk across the street and meet his records-room friend at the Capitol.
    • Technically the information is available to everyone by law, but it's such a pain to get it that nobody bothers.

      The only people that bother are exactly the people that probably shouldn't have access to it.
      • Re: (Score:2, Insightful)

        I don't know if I agree with you. If it's public, everyone should have access. If it shouldn't be public, it should be taken out of the searchable network. Either way, this gets all the cards on the table.

        By and large, I do think anything the government tracks, short of an active investigation, should be available publically. Transparency is an important check; I wish we had it now with the current administration.

        If you're concerned about embarrassing or damaging information being fossilized in public r
      • If it's a public benefit to everyone, that's pretty much what taxes are for, yes. I'd much sooner see my tax dollars go to a public information network than corporate favoritism.

        In practice, I suspect those admin/handling fees were pulled straight out of someone's rear as "what's the maximum they won't object to?" rather than reflecting any real cost. They're only unreasonable after they've been declared so in court.
        • I'd much sooner see my tax dollars go to a public information network than corporate favoritism.

          You say that like they're mutually exclusive.

          Such a project, in reality, would be a giant boondoggle; I can almost imagine all the big IT implementors slavering over their keyboards writing up slick proposals for it. But in the end it would be overbudget, incomplete, behind schedule, poorly designed, and nearly impossible for a sane person to use without cringing, just like 99% of everything that's produced accor
      • Re: (Score:3, Insightful)

        We already have. It's just that only people with lots of money or time or government support can reasonably collate the information.

        Your current expectations of privacy reflect a misconception that obscurity is somehow the same thing as privacy. It's not and never was. This effort will draw a clear line between what's private and what isn't and correct any misconceptions.
  • by Nephster (203800) on Monday April 30 2007, @09:10PM (#18936239)
    The Wisconsin Circuit Court Access (WCCA) [wicourts.gov] ?

    That site allows you to search any court case in WI. There are limitations - minors often aren't on there, and certain other cases are blocked from public access as well. But overall this has been a *good* thing.

    Hell, I even once ran a girl I had started dating through there - and turned up three shoplifting convictions.

    We always went to her place after that... :-)

    • Well, for a start, I'm sure Google's search engine would actually let you search the full text of the case by keyword.

      This stupid form wants me to know case numbers or people involved.. I just wanna know what copyright cases have been prosecuted in the last year.

  • I'm tired of sites like Domania or any of a gazillion foreclosure sites make money by selling you information that is of public record. Why do I have to pay to find out if a used car I want to buy has been in an accident, when that information is available in public records? Google is not talking about publishing stuff that isn't already public record -- they're publishing stuff that HAS ALREADY HAD THE MERITS OF BEING PUBLIC DEBATED -- that's how these things became public records in the first place. S
  • Very, very bad (Score:3, Interesting)

    by Angst Badger (8636) on Monday April 30 2007, @09:54PM (#18936553)
    I'm sure there will be a steady stream of eager users for stalker.google.com long before it emerges from beta.

    I'm guessing that it's cluelessness on the part of Google management, but I hope someone there gives some thought to what will happen to their "do no evil" public image when the body count from their negligence first crests over three digits.
  • Google Is Creepy (Score:4, Insightful)

    by chromozone (847904) on Monday April 30 2007, @10:20PM (#18936707)
    Google plays along with China in censoring but it lobbys in the US for opening government records? While at the same time its board is advising it's sharholders to not vote for proposals to bar any "proactive" censorship (and Google is censoring a lot)? Google is getting creepy and I already use another search engine to get unbiased results. Google's board objects to anticensorship proposal http://www.marketwatch.com/news/story/googles-boar d-objects-anti-censorship-proposal/story.aspx?guid =%7BE4924442-BA3A-4F47-A5B8-DCA66F1A9CB0%7D [marketwatch.com]
  • I'd like to have the name and address of every legitimate business in the United States, for web site legitimacy validation. [sitetruth.com] I've purchased databases which contain an approximation to that information, but that's mostly phone book data, not Government data.

    More business records need to be easily available. This varies from state to state now. Corporation records are usually freely available, although a few states (notably Delaware) charge for address information. Every US state has their own format;

  • easier to mine (Score:3, Insightful)

    by john_uy (187459) on Tuesday May 01 2007, @01:01AM (#18937537)
    the difference though if you can search the records electronically is that it will be easier to mine the information. that will be very difficult with printed records limiting the scope of malicious activities.
  • by hey! (33014) on Tuesday May 01 2007, @09:02AM (#18940105) Homepage Journal
    While the social security number issue is important, it barely scrapes the surface when it comes to the dangers of doing this.

    Open government records exist to ensure government accountability.

    For example, court records exist to prove that the operations of the courts are fair, impartial and proper.

    However, when incorporated into private databases (in this I include search engine indices) the character of these records changes tremendously. For example, if you are sued by your landlord, future landlords will be able to search the records, using it as an intelligence database on you, not a record of the operation of the government. Likewise, if you had a disgruntled employee who files complaints about alleged violations of state regulations, then potential employees and investors could be deterred from doing business with you.

    Placing public records in private datasets alters the nature of the records.

    The problem is that in the US, the legal notion of privacy is broken. It was broken by technology.

    In the US, we have a libertarian notion of privacy that is based on some simple dichotomies: public/private, disclosed/undisclosed. Information is either of a public nature, in which case it is fair game to ferret out and publish, or it is of a private nature, in which case you are protected from intrusion. It is either undisclosed, which means that if it is of a private nature it is safe, or it is disclosed, in which case anybody who has the data in hand is welcome to publish it to the world. The only exception are those who have a specially recognized duty of confidentiality, such as doctors and lawyers.

    The reason that this is a libertarian notion is that it seeks to preserve the freedom of anybody who receives data to do whatever they please with it. That is why when you give your name and address to a vendor, that vendor can turn around and sell that information, as well as information about what you have purchased, to somebody else. While US law forbids using this information in a credit report, it does not clearly forbid using it for investigative purposes such as a background check. Even if you construe a usage of this information as a violation of the Fair Credit Reporting Act, since it is not part of your credit report, you have no way of knowing that it is being used in a background check or by identity thieves or stalkers.

    Our notions about privacy tend to be centered around the concepts of non-disclosure, but the issue of privacy is much deeper. Privacy, in my view, is the right of the individual to choose and act autonomously without unreasonable interference by outside parties. Limiting informational privacy to protection of non-disclosed facts falls far short of protecting what we expect privacy to secure, which is nothing less than individual liberty. Nowhere is the threat to individual liberty greater than in the use of government records for purposes other than ensuring the proper operation of government. As an individual, you are not free to avoid appearance in such records. If you are sued, your name, address, and information about your doings goes into a public record. This is true even if you are subpoenaed. What is worse, individual bits of data about you can be assembled from various public record sources to create a picture of your private life, even if no single fact in isolation reveals much.

    In 1972, the US Department of Health Education and Welfare published a report called "Records, Computers and the Rights of Citizens". It was a very early look into the impact of computer record keeping on privacy. The report, started under Secretary Elliot Richardson, recognized the privacy dangers inherent in using data for purposes other than. Richardson, whose integrity and impartiality was widely admired on both sides of the isle, left his position in HEW to take over as Attorney General during the mushrooming Watergate scandal, and he was replaced by Caspar Weinberger, known to current generations as an early
    • They are talking about increasing access to already public records.

      There is nothing inherantly "public" about the IP address of the computer I happen to be sitting at when sending an email. You might like to have it for some reason but it is arguably private information, not public.