How to check URL format?

Is there a way to access text from a website, automatically extract only the text, and auto-format the text, title, date, and URL into a Google Sheet?

  • For example, I wanted to put a document together full of news clippings about Joey Ramone. I want the articles formatted as follows...there will be a Header and then the text. HEADER: on the 1st line, I want the article title emboldened. On the 2nd line, I want the name of the publication, on the 3rd line I want the article's date, and on the 4th line, I want the URL (however, if the URL could be hyperlinked to the title, that'd work, too). TEXT: I'd like stripped of its native format, pictures, ads, etc. from the website and reformatted into 12 pt. Arial font whilst mainting the paragrahping, indentation from the original website. I want to have 20+ articles in a single google drive formatted like this. Thanks so much in advance! Please see below: ARTICLE 1 http://articles.latimes.com/2013/feb/08/entertainment/la-et-ms-joey-ramone-auction-vinyl-lps-guitars-estate-20130208 (URL hyperlinked) The LA Times 2-3-13 How much are a punk rock hero's personal items worth? Joey RamoneĂ¢€™s vinyl record collection, two electric guitars, a leather jacket, his passport, Rolodex, T-shirts and other items will be auctioned off by the late singerĂ¢€™s estate. ARTICLE 2, etc... http://www.spin.com/articles/rockers-honor-joey-ramones-b-day-nyc/ SPIN 5-20-09 It's not everyday that you find the Supersuckers, Fishbone, and a bunch of bagpipe players together in the same room... ...and so it goes...

  • Answer:

    Yes, I believe this is the tool you are looking for: Cheerio is a Node.js package that can parse, scrape, extract text data from webpages. https://github.com/MatthewMueller/cheerio You will have to do some plumbing yourself, like finding out how to write to Google Drive (https://developers.google.com/google-apps/spreadsheets/ I hope my answer points you to the right path. Thanks for reading.

Jae Yang at Quora Visit the source

Was this solution helpful to you?

Related Q & A:

Just Added Q & A:

Find solution

For every problem there is a solution! Proved by Solucija.

  • Got an issue and looking for advice?

  • Ask Solucija to search every corner of the Web for help.

  • Get workable solutions and helpful tips in a moment.

Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.