How compare website is crawling differnt website data because when I was checking term of use policy across different website it is illegal to crawl or monitor their website through some automated program? .What is the legal issue in india for this?
-
-
Answer:
IANAL Scraping isn't by itself illegal, and not all data types are subject to copyright. Statistics, for example, aren't subject to copyright. Google and other search engines couldn't exist without scraping. Scrapers are just special web browsers. Stopping your site from being scraped: Use a robots.txt file on your site to forbid crawlers. http://en.wikipedia.org/wiki/Robots_exclusion_standard This will stop indexers and crawlers that support robots.txt from crawling and scraping your site. Put all the content you want to protect behind a login/password, and use a captcha. Circumventing access restrictions is illegal (in the US), and a captcha makes it much harder to access programmatically. If someone wants to scrape your site behind a login, they have to break the law. Enable hot-linking protection on your web server. This will stop other sites from linking to content on your servers. FWIW: Your site's TOS has no effect on the legality of scraping your site. If you believe your site has valuable data that people will want to scrape legally, you might want to consider creating an API that requires a key. Then you can keep track of how people are accessing your data.
Anonymous at Quora Visit the source
Other answers
I had been an Scraping expert since 5 years now and Data/Content Consultant for various Clients The Question is very Subjective and has been a point of Discussion since long, I will try to sum up here: 1. Its Actually not illegal unless and until mentioned in terms and conditions of target sites, So Scraping Govt. Sites which clear mentions Scraping is prohibited without prior consent might land you in trouble. 2. Whereas Scraping public sites and sites containing product specification , its absolutely alright as that Information is freely available. 3. Some sites have there own IP Blocker if excess activity is seen from an IP Address. Examples are whitepages and yellowpages.(This is worked out by using Proxy pool) 4. You can protect your sites by using Robots.txt in root folder , so that it will block bots from scraping. Best way to save your site from Scraping is to use Images rather then plain text or Dynamic Content using AJAX which doesnt shows text in HTML Source. Hope this Answers your Doubts. Cheers! Mohit
Mohit Khatri
I guess I am too late here, but thought will write if anyone is still looking for an answer. Is scraping is legal? Its a very debatable question. As already answered by someone, it depends on how good is your terms and condition and privacy policy. As you have asked for help, first let me give you some copyright protection tips. - First creating a "Content Policy" page on your site and link it in footer. - Please read this https://support.google.com/blogger/answer/157170?hl=en by google. - You can write a small Copyright protected in footer. Instead of going for reactive measures like copyright, let me give you some insights on preventive measures to stop scraping/ block scraping. Let me introduce you to ShieldSquare anti scraping solution, which does exactly what you are looking for, in this question. By using ShieldSquare you can - analyze every single page request you get to your website. - Get notified of scraping activity via our bot feed. - Get detailed visibility of your scraping activity including the ISP, Geo location, number of pages scraped, page directory. I am from http://www.shieldsquare.com/ anti scraping solution, feel free to reach us out to stop scrapers and protect your website content. We have 15 day trial from which you will get detailed visibility of bot signatures.
Raviraj Hegde
In a way these terms are pretty meaningless as described in Website's user agreement http://www.nelsonmullins.com/articles/browse-wrap as a browsewrap agreement because companies do not provide sufficient notice of the terms to site visitors. So it's perfectly legal to scrape websites as long as you don't crawl at a disruptive rate.
Bernardas Ališauskas
Related Q & A:
- Why do I keep getting an error message when I use Yahoo?Best solution by Yahoo! Answers
- How do I file taxes when I worked in two different states?Best solution by Yahoo! Answers
- How do I transfer ALL of my songs onto a different iTunes on a different computer?Best solution by Yahoo! Answers
- How do I know when I need a new LCD monitor?Best solution by answers.yahoo.com
- How do I use chat when I am in a room?Best solution by Yahoo! Answers
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.