Launching a website that contains public government data
-
Where can I find information about the legality of using public government data on my website? I came across the website http://sf.everyblock.com/, which takes data from city governments and posts them on its website in a nice user interface. These include things like dates and locations of crimes, liquor license violations, and so on. (I have also seen websites that allow you to search for sex offenders in your neighborhood.) I am interested in a vaguely related idea that would involve using public government data that is already available on the internet. What are the rules about copying data from government websites and using it for a private website? It seems legal since EveryBlock is doing it without apparent problems, but I'd like to know how I can go about ascertaining this if I want to use a particular city or state's data. (Note: I'm just looking for general information on this topic, and not specific legal advice about my idea.)
-
Answer:
All federal work http://www.law.cornell.edu/uscode/html/uscode17/usc_sec_17_00000105----000-.html. I assume this applies to state and local government, but I'm not positive.
lunchbox at Ask.Metafilter.Com Visit the source
Other answers
Slight tangent: I doubt that EveryBlock is scraping the data from other web sites. They've probably bought (or otherwise obtained) a database, or are using an API to access the info. (This probably also means that the terms for using the info were provided to them by their source).
winston
null terminated: I think the federals are the only folks required to have their stuff in the public domain.
the dief
I just had to research this for work and things done by Federal employees in the course of their work is in the public domain (excepting some logos and a few other things that you are not allowed to use) but state and local are not necessarily the same. You'd be best to check your local statutes to find out about state and city government sites. For instance, IANAL, but I see the site you linked to is formatting the data in their own way. They are not copying the city of San Fran's website verbatim nor using their SFGOV logo, etc. It looks like all the information they are posting is available by public record (business permits, etc.). However, the city might take offense if they posted an exact copy of their website, pics and logos (it does have a copyright at the http://www.sfgov.org/). I spent $120 to speak with a copyright attorney once about a product I was licensing and it was the best money I ever spent. He told me things in plain English that I would not have known to look for on the US Government's copyright site.
Marie Mon Dieu
I assume this applies to state and local government, but I'm not positive. It does not apply to state and local governments, but there may be other reasons that the work is not copyrightable subject matter. Also, there is a possibility that EveryBlock's specific arrangement of the works (whether public domain or not) may be subject to protection. I am an intellectual property lawyer, but I am not your lawyer. The above is not legal advice.
anathema
there may be other reasons that the work is not copyrightable subject matter You'd be hard-pressed to find anyone knowledgeable on the matter who will tell you that raw data can be copyrighted. Clever arrangement of the data can be, but the data can't be. I also think that NASA can claim copyright on things it makes, even if it's a branch of the federal government.
oaf
I also think that NASA can claim copyright on things it makes, even if it's a branch of the federal government. http://www.nasa.gov/audience/formedia/features/MP_Photo_Guidelines.html. The restrictions have to do with use of marks and sponsorship.
anathema
I point to http://numbrary.com/ and (my own) http://infochimps.org/ as two sites that are doing similar things. Infochimps was designed to host and distribute exactly this kind of data. Also, http://theinfo.org is a burgeoning community for us data nerds. As for laws on copyright & data: two great resources are http://www.iusmentis.com/databases/us/ and http://www.bitlaw.com/source/cases/copyright/#Compilations%20and%20Databases. Government or not, a comprehensive assemblage of facts cannot, in general, be copyrighted. My non-lawyer but well-investigated understanding (following only applies to the US, where the database laws are actually more liberal than elsewhere; I have no idea what the situation is outside the US): Copyright only applies where there is 'creative' content. A comprehensive list of cars and retail prices cannot be copyrighted; a comprehensive collection of *reviews* of those cars can be copyrighted. This is the important http://en.wikipedia.org/wiki/Feist_Publications_v._Rural_Telephone_Service case: "Facts, whether alone or as part of a compilation, are not original and therefore may not be copyrighted. A factual compilation is eligible for copyright if it features an original selection or arrangement of facts, but the copyright is limited to the particular selection or arrangement. In no event may copyright extend to the facts themselves." -- Sandra Day O'Connor for the Supreme Court "A collections of facts are not copyrightable per se ... A compilation, like any other work, is copyrightable only if it satisfies the originality requirement ("an original work of authorship"). Facts are never original, so the compilation author can claim originality, if at all, only in the way the facts are presented. The facts must be selected, coordinated, or arranged "in such a way" as to render the work as a whole original." -- Sandra Day O'Connor for the Supreme CourtA presentation of data can be creative -- you can't xerox the blue book and hand that out. However, a conversion of data into your own creative presentation satisfies this restriction. So would a presentation (original or converted) that did not arise from a creative act -- you couldn't claim copyright on a .CSV file of some dataset. Besides "presentation" and a couple edge cases (such as "hot news" or "selection and arrangement"), the main one to be aware of is "Terms of Service"... If you have to agree to terms of service that restrict the data, but you take it anyway, you can be guilty of trespass. My understanding there is that if you can a) access the site by robot (no person clicks anything) AND b) there is no http://www.automotive.com/robots.txt, they can't sustain a claim that it's a restricted resource.
mrflip
Related Q & A:
- Why does a website redirect to another website?Best solution by Yahoo! Answers
- How to scrape data from a website?Best solution by Stack Overflow
- How can I get a job in the Federal Government?Best solution by publications.usa.gov
- Where is a website where I can find demographic data. For the US?Best solution by sba.gov
- Does anyone know of a good website to search public records?Best solution by Yahoo! Answers
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.