How to crawl same url using Scrapy?

Google Webmaster Tools: Why Duplicate Meta descriptions are increasing even after using URL parameters?

  • Our site is showing Increase in Duplicate Meta tag and descriptions even after we have used Paginates_ NO URL to crawl from URL parameter section? What else we should do so that Googlebot stop crawling and increasing list of duplicate tags?

  • Answer:

    Changing URL parameters won't solve your duplicate meta description issue. Please change the meta description instead. Each URL should have unique meta title and description and for pagination, you can try rel=”next” and rel=”prev” to avoid this problem Let's say you have pages paginated into the following URLs: In the <head> section of the first page (http://www.example.com/article-part1.html), add a link tag pointing to the next page in the sequence, like this:<link rel="next" href="http://www.example.com/article-part2.html"> Because this is the first URL in the sequence, there’s no need to add markup forrel="prev". On the second and third pages, add links pointing to the previous and next URLs in the sequence. For example, you could add the following to the second page of the sequence:<link rel="prev" href="http://www.example.com/article-part1.html"> <link rel="next" href="http://www.example.com/article-part3.html"> On the final page of the sequence (http://www.example.com/article-part4.html>), add a link pointing to the previous URL, like this:<link rel="prev" href="http://www.example.com/article-part3.html"> Because this is the final URL in the sequence, there’s no need to add arel="next" link. "Google treats rel="previous" as a syntactic variant of rel="prev". Values can be either relative or absolute URLs (as allowed by the <link> tag). And, if you include a <base> link in your document, relative paths will resolve according to the base URL. Some things to note: rel="prev" and rel="next" act as hints to Google, not absolute directives. If a component page within a series includes parameters that don't change the page's content, such as session IDs, then the rel="prev" and rel="next" values should also contain the same parameters. This helps our linking process better match corresponding rel="prev" and rel="next" values. For example, the page http://www.example.com/article?story=abc&page=2&sessionid=123 should contain the following:<link rel="prev" href="http://www.example.com/article?story=abc&page=1&sessionid=123" /> <link rel="next" href="http://www.example.com/article?story=abc&page=3&sessionid=123" /> rel="next" and rel="prev" are orthogonal concepts to rel="canonical". You can include both declarations. For example, http://www.example.com/article?story=abc&page=2&sessionid=123 may contain:<link rel="canonical" href="http://www.example.com/article?story=abc&page=2"/> <link rel="prev" href="http://www.example.com/article?story=abc&page=1&sessionid=123" /> <link rel="next" href="http://www.example.com/article?story=abc&page=3&sessionid=123" /> If Google finds mistakes in your implementation (for example, if an expectedrel="prev" or rel="next" designation is missing), Google will continue to index the page(s), and rely on our own heuristics to understand your content.

Khem Raj at Quora Visit the source

Was this solution helpful to you?

Other answers

This is even canonical problem of website, Define preferred URL And use canonical tag in each page. Use No index Tag in parameter page or block by robots.txt with parameter rule. Either you can use pagination rule which is defined by Google Webmaster. Indicate paginated content (https://support.google.com/webmasters/answer/1663744?hl=en)rel="next" and rel="prev" are orthogonal concepts to rel="canonical". You can include both declarations.For example, http://www.example.com/article?story=abc&page=2&sessionid=123 (http://www.example.com/article?story=abc&page=2&sessionid=123)may contain:<link rel="canonical" href="http://www.example.com/article?story=abc&page=2%22%2F><link rel="prev" href="http://www.example.com/article?story=abc&page=1&sessionid=123" /><link rel="next" href="http://www.example.com/article?story=abc&page=3&sessionid=123" />

Chhote Lal Lodh

Related Q & A:

Just Added Q & A:

Find solution

For every problem there is a solution! Proved by Solucija.

  • Got an issue and looking for advice?

  • Ask Solucija to search every corner of the Web for help.

  • Get workable solutions and helpful tips in a moment.

Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.