What are some good tools for interactive web scraping and automation?
-
Tools for automating or scripting web app access, including rich JavaScript apps. Commercial or open source. Clarification Looks like the answers tools are for classic "scraping" or parsing websites read-only. I'm looking for a solution for those cases where the user needs to load forms or even run JavaScript client-side functions.
-
Answer:
Another python option is scrapy From http://scrapy.org/ What is Scrapy? Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.FeaturesSimpleScrapy was designed with simplicity in mind, by providing the features you need without getting in your wayProductiveJust write the rules to extract the data from web pages and let Scrapy crawl the entire web site for youFastScrapy is used in production crawlers to completely scrape more than 500 retailer sites daily, all in one serverExtensibleScrapy was designed with extensibility in mind and so it provides several mechanisms to plug new code without having to touch the framework corePortable, open-source, 100% PythonScrapy is completely written in Python and runs on Linux, Windows, Mac and BSDBatteries includedScrapy comes with lots of functionality built in. Check this section of the documentation for a list of them.Well-documented & well-testedScrapy is extensively documented and has an comprehensive test suite with very good code coverage [...]
Brooks Cutter at Quora Visit the source
Other answers
Ruby and Ruby gem Nokogiri
Paul Denlinger
BeautifulSoup is a Python library for HTML and XML parsing. Great for extracting data from web!
Samarth Bansal
I tried out WebQL once, I don't think they have a free version though.
Konrad Szpirak
For windows, tried HTMLAgility Pack, open source. Works well.
Earljon Hidalgo
The ONE that works well, Open Source, Free, and always kept updated! Anyone can download it and use it at no charge. It is called Easy Data Feed http://www.easydatafeed.com/ Development · Open Source · Sample Projects · Visual Studio Support · Programming Language · Customize Core · Native API Development Scraping · Run Daily Schedules · Set Scrape Limits · Visit as Anonymous · Set Thread and Speed · Password Protected Scrape · Solve Decaptcha · Use Proxies Export · Upload to FTP · Save as CSV · Split File · Zip File · Force Save · Import/Export Profiles · EDI Integration · CRM Integration · Shopping Cart Integration Data Manipulation · Multiple Profiles · Add Custom Values · Limit Length · Strip HTML · Convert Measurements · Change Case · Custom Profiles · Find/Replace · Custom Merge Rules · Custom Clean Rules · Custom Remapping Rules Images · Unlimited Free Image Downloads · Download Images Locally · Unlimited File Storage · PDF to JPG · GIF to JPG · Apply Watermark · Transfer JPG from Body to FTP You can read about how to use it here: http://www.easydatafeed.com/open-source/ They also have developers you can hire to do the job for you, their skype is âeasydatafeedâ
Sammir Diesuza
For scraping, There are hundreds of Web Scraping tools today available for both commercial and personal use. If youâve never done any web scraping before, there is one basic tool is available which is more flexible and has extra functionality. It is user-friendly tool and easy to perform. I have one best free open source solution for you. I think you should have to use it because it is a freeware and you can easily download it and feel the difference compared to other scraping tools. It is called as âEasy Data Feedâ and it is available on http://www.easydatafeed.com Here is the list of its features: ⢠Upload to FTP ⢠Save as CSV ⢠Split File ⢠Zip File ⢠Force Save ⢠Import/Export Profiles ⢠EDI Integration ⢠CRM Integration ⢠Shopping Cart Integration ⢠Inventory Management You can read about how to use it here: http://www.easydatafeed.com/open-source/ They also have developers, you can hire them to do the job for you, and their Skype is âeasydatafeedâ
Mike John
Check out ParseHub - http://www.parsehub.com. It's a visual web scraper built to handle rich Javascript apps and to get data from forms, search, drop downs and even maps. I'm one of the founders of ParseHub and would love to get feedback from anyone that tries our tool!
Angelina Fomina
You will find many web scraping and automation services that brings you information from your targeted websites. I had tried different scraper tools such as ScraperWorld's LinkedIn scraper, Twitter scraper, Business directory extractor, Amazon product scraper etc. They all worked pretty well for me and provide accurate results, it save all extracted information at my database for further analysis and use. It arrange data in a csv file format to make it easy to use and analysis. These tools are user friendly and interactive..
Erma J. Deaton
Try it! http://www.visualscraper.com
Ho Thanh Son
Related Q & A:
- What is the best language for HTML parsing and web scraping?Best solution by Quora
- What is the best web scraping software for building contact information databases from online directories?Best solution by Quora
- What web scraping tool is the best to extract data?Best solution by Quora
- What is a good web page to apply for a job?Best solution by Yahoo! Answers
- What are some good web design trends for 2010?Best solution by Quora
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.