What framework in PHP can I use to write a web application (like peoplesmart.com, ancestry.com. spock.com) that will scrape or collect data from public records, social media about people?!
-
I plan to write the application with PHP. Thanks.
-
Answer:
Both PHPCrawl (http://phpcrawl.cuab.de/) and Snoopy (http://sourceforge.net/projects/snoopy/) are PHP libraries you can use to crawl / scrape data from publicly available webpages.
Eric Wu at Quora Visit the source
Other answers
While I agree that PHP is not the ideal tool of choice for this task, for the record I totally disagree with ' reduction of PHP to being just "a web template language". If you're set on using PHP as you indicate, there's a built in library PHP-DOM (http://php.net/manual/en/book.dom.php) or this 3rd party library which makes the task syntactically easier called Simple HTML DOM (http://simplehtmldom.sourceforge.net/). As a suggestion, I know node.js has been used very successfully to build independent scraper robots which run lean and iterate quickly. I would consider making systems calls back to a node.js instance and then build your app in whatever language you prefer.
Mike Greenberg
For the scraping part you definitely want to take a look at, Arc90's Readability. There's a PHP port too: http://www.keyvan.net/2010/08/php-readability/
Irakli Nadareishvili
For starters -- Give up on PHP for the application, PHP is not designed for doing this sort of task well. Alternatively, I suggest: Python -- look at Scrapy Java -- Nutch though hopefully the Droids framework will make it more useful Ruby -- Anemone Then you're just off to page parsing.
David Koblas
Related Q & A:
- What can I use to clean a DVD?Best solution by Yahoo! Answers
- Can I use silverdine for a scar?Best solution by Yahoo! Answers
- How can I describe myself on a job application?Best solution by Yahoo! Answers
- What kind of jobs can i apply for with a associate degree in computer science?Best solution by Yahoo! Answers
- How can I use recipient in a sentence?Best solution by sentence.yourdictionary.com
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.