How to crawl same url using Scrapy?

Does Google crawl http://goo.gl shortened URLs?

  • We are using Google's own url shortening service. All the target URLs which are shortened are publicly available but I don't want Google to index them. I understand that robots.txt is going to prevent indexing of such documents if I choose to. I was wondering, if Google starts crawling target URLs too. If Google is crawling, I don't mind it. I don't want them to appear in search results. Any ideas?

  • Answer:

    No, shortened URL's will not appear in the search results. The content will. Google won't necessarily index your content, but using Google's URL shortener will ping the content, likely resulting in a crawl. If you setup the page correctly, you can almost definitely keep the page from being indexed. As a quick note of clarification: Google indexes content based on other pages linking to that content. Google crawls through the web and looks for links that have never been seen before. It is my understanding that if a URL shortened by Google has never been seen before, it will behave similarly to a new link. It doesn't add any SEO value, but Google bot will probably visit your site, simply because it is new content. I sometimes get my pages indexed faster by sharing them on Google+, because I know it pings the content, resulting in a crawl.

Jesse Leimgruber at Quora Visit the source

Was this solution helpful to you?

Related Q & A:

Just Added Q & A:

Find solution

For every problem there is a solution! Proved by Solucija.

  • Got an issue and looking for advice?

  • Ask Solucija to search every corner of the Web for help.

  • Get workable solutions and helpful tips in a moment.

Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.