Which framework to use in my enterprise application?

What framework in PHP can I use to write a web application (like peoplesmart.com, ancestry.com. spock.com) that will scrape or collect data from public records, social media about people?!

Eric Wu at Quora Visit the source

Was this solution helpful to you?

Other answers

While I agree that PHP is not the ideal tool of choice for this task, for the record I totally disagree with ' reduction of PHP to being just "a web template language". If you're set on using PHP as you indicate, there's a built in library PHP-DOM (http://php.net/manual/en/book.dom.php) or this 3rd party library which makes the task syntactically easier called Simple HTML DOM (http://simplehtmldom.sourceforge.net/). As a suggestion, I know node.js has been used very successfully to build independent scraper robots which run lean and iterate quickly. I would consider making systems calls back to a node.js instance and then build your app in whatever language you prefer.

Mike Greenberg

For the scraping part you definitely want to take a look at, Arc90's Readability. There's a PHP port too: http://www.keyvan.net/2010/08/php-readability/

Irakli Nadareishvili

For starters -- Give up on PHP for the application, PHP is not designed for doing this sort of task well. Alternatively, I suggest: Python -- look at Scrapy Java -- Nutch though hopefully the Droids framework will make it more useful Ruby -- Anemone Then you're just off to page parsing.

David Koblas

Related Q & A:

Just Added Q & A:

Find solution

For every problem there is a solution! Proved by Solucija.

  • Got an issue and looking for advice?

  • Ask Solucija to search every corner of the Web for help.

  • Get workable solutions and helpful tips in a moment.

Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.