alfanous icon indicating copy to clipboard operation
alfanous copied to clipboard

Find a strategy to eliminate double indexing of similar pages.

Open assem-ch opened this issue 10 years ago • 9 comments

  • add "no index" to search pages if a parameter is not affecting the keywords in output such as: vocalized, script, view...
  • add "no follow" to clickable links that should not be indexed, like Juz:N, Hizb:2

This is the report of google in Webmaster tools about this issue:

Googlebot encountered problems while crawling your site http://www.alfanous.org/.

Googlebot encountered extremely large numbers of links on your site. This may indicate a problem with your site's URL structure. Googlebot may unnecessarily be crawling a large number of distinct URLs that point to identical or similar content, or crawling parts of your site that are not intended to be crawled by Googlebot. As a result Googlebot may consume much more bandwidth than necessary, or may be unable to completely index all of the content on your site.

More information about this issue

Here's a list of sample URLs with potential problems. However, this list may not include all problematic URLs on your site.

    http://www.alfanous.org/en/translation/?sortedby=score&recitation=14&query=ille-btig%C3%A2e&translation=en.transliteration&page=1&unit=translation&view=default
    http://www.alfanous.org/en/translation/?sortedby=mushaf&recitation=18&query=hixiyana&translation=en.ahmedali&page=1&unit=translation&view=default
    http://www.alfanous.org/id/translation/?sortedby=score&recitation=14&query=lang%3Apt+AND+gid%3A1702&translation=en.transliteration&page=1&unit=translation&view=default
    http://www.alfanous.org/pt/aya/?query=%D8%A2%D8%AA%D9%8A%D9%86%D8%A7%D9%87%D8%A7&page=1&unit=aya
    http://www.alfanous.org/en/translation/?query=gid:2633%20AND%20lang:de&page=1&unit=translation
    http://www.alfanous.org/fr/translation/?query=kjenner&page=1&unit=translation
    http://www.alfanous.org/es/translation/?query=Hereafter)%3B&page=1&unit=translation
    http://www.alfanous.org/fr/?query=sura%3A%22At-Taghabun%22%20%2B%20aya_id%3A12&page=1&unit=aya
    http://www.alfanous.org/en/aya/?query=%D9%88%D9%8E%D9%85%D9%90%D9%86%D9%8E&page=8&unit=aya
    http://www.alfanous.org/ar/translation/?query=befall.&page=1&unit=translation
    http://www.alfanous.org/ku/translation/?sortedby=mushaf&recitation=18&query=ilmu&translation=en.ahmedali&page=1&unit=translation&view=default
    http://www.alfanous.org/fr/aya/?query=aya_%3A%D8%B8%D9%8E%D8%A7%D9%84%D9%90%D9%85%D9%8E%D8%A9%D9%8C&page=1&unit=aya
    http://www.alfanous.org/ja/aya/?query=a_w%3A22&page=1&unit=aya
    http://www.alfanous.org/en/translation/?sortedby=score&recitation=14&query=biyun&translation=en.transliteration&page=1&unit=translation&view=default
    http://www.alfanous.org/ar/aya/?query=sura%3A%22%D8%A7%D9%84%D9%85%D8%A7%D8%A6%D8%AF%D8%A9%22+%2B+aya_id%3A75&page=1&unit=aya
    http://www.alfanous.org/fr/translation/?sortedby=mushaf&recitation=18&query=%5Buninhibited%5D&translation=en.ahmedali&page=1&unit=translation&view=default
    http://www.alfanous.org/en/translation/?query=budur.%22&%22=&-=&page=1&unit=translation
    http://www.alfanous.org/fr/translation/?query=grain,&page=4&unit=translation
    http://www.alfanous.org/ku/aya/?query=%3E%D8%A7%D9%82%D8%B0%D9%81%D9%8A%D9%87&page=1&unit=aya
    http://www.alfanous.org/ku/aya/?sortedby=mushaf&recitation=18&query=%D9%8A%D9%8E%D8%B4%D9%8E%D8%A7%D8%A1%D9%8F&translation=en.ahmedali&page=1&unit=aya&view=default
    http://www.alfanous.org/es/translation/?sortedby=mushaf&recitation=18&query=Noastr%C4%83%3F&translation=en.ahmedali&page=1&unit=translation&view=default
    http://www.alfanous.org/ms/aya/?query=%D8%A2%D9%8A%D8%A9_%3A%D9%85%D9%8F%D8%B9%D9%8E%D9%85%D9%91%D9%8E%D8%B1%D9%8D&page=1&unit=aya
    http://www.alfanous.org/en/translation/?query=lang%3Aes+AND+gid%3A4846&page=1&unit=translation
    http://www.alfanous.org/id/translation/?query=gid%3A623+AND+lang%3Aen&page=1&unit=translation
    http://www.alfanous.org/fr/?query=aya_:%D9%86%D9%8F%D9%83%D9%8E%D9%81%D9%91%D9%90%D8%B1%D9%92&%22=&page=1&unit=aya
    http://www.alfanous.org/en/translation/?query=gid:5524%20AND%20id:ml.abdulhameed&page=1&unit=translation
    http://www.alfanous.org/en/translation/?sortedby=mushaf&recitation=18&query=lang%3Ade+AND+gid%3A600&translation=en.ahmedali&page=1&unit=translation&view=default
    http://www.alfanous.org/ml/translation/?query=lang%3Aen+AND+gid%3A4078&page=1&unit=translation
    http://www.alfanous.org/ku/translation/?sortedby=score&recitation=14&query=gid%3A5432&translation=en.transliteration&page=1&unit=translation&view=default
    http://www.alfanous.org/ku/translation/?sortedby=mushaf&recitation=18&query=ya%5D&translation=en.ahmedali&page=1&unit=translation&view=default
    http://www.alfanous.org/ja/aya/?query=sura%3A%22Al-An'am%22%20%2B%20aya_id%3A53&page=1&unit=translation
    http://www.alfanous.org/en/translation/?query=rrug%C3%ABn&page=10&unit=translation
    http://www.alfanous.org/id/aya/?query=%3E%D8%B9%D9%86%D8%AF&page=1&unit=aya
    http://www.alfanous.org/es/?query=a_l%3A177&%22=&page=1&unit=aya
    http://www.alfanous.org/en/translation/?sortedby=mushaf&recitation=18&query=%D8%A7%D9%86%D8%B8%D8%B1%D9%88%D9%86%D8%A7%C2%BB&translation=en.ahmedali&page=1&unit=translation&view=default
    http://www.alfanous.org/id/translation/?query=gid:3424%20AND%20id:en.yusufali&page=1&unit=translation
    http://www.alfanous.org/pt/translation/?query=rozpowszechni%C5%82o&page=1&unit=translation
    http://www.alfanous.org/es/translation/?query=feekum&page=1&unit=translation
    http://www.alfanous.org/fr/translation/?query=op%2C&page=1&unit=translation
    http://www.alfanous.org/ja/aya/?action=search&query=%D9%82%D9%8F%D9%88%D9%91%D9%8E%D8%A9%D9%8E&sortedby=relevance&page=1&unit=aya
    http://www.alfanous.org/en/?query=%3E%D9%88%D8%A7%D8%AC%D8%B9%D9%84&page=1&unit=aya
    http://www.alfanous.org/ar/?query=%D9%83_%D8%A2%3A19&page=8&unit=aya
    http://www.alfanous.org/id/translation/?query=gid%3A295+AND+id%3Atr.yuksel&-=&page=1&unit=translation
    http://www.alfanous.org/ar/translation/?query=gid:5812%20AND%20id:en.yusufali&page=1&unit=translation
    http://www.alfanous.org/ar/aya/?action=search&query=%D8%A2%D9%8A%D8%A9_:%D9%88%D9%8E%D9%85%D9%8E%D9%84%D9%8E%D8%A6%D9%90%D9%87%D9%90&sortedby=relevance&page=1&unit=aya
    http://www.alfanous.org/en/translation/?query=%C5%9F%C3%BCkretmeyecekler&page=1&unit=translation
    http://www.alfanous.org/pt/translation/?sortedby=mushaf&recitation=18&query=deseja&translation=en.ahmedali&page=1&unit=translation&view=default
    http://www.alfanous.org/pt/aya/?query=%D8%A3%D9%81%D8%B3%D8%AF%D9%88%D9%87%D8%A7&page=1&unit=aya
    http://www.alfanous.org/en/translation/?sortedby=mushaf&recitation=18&query=lang%3Aen+AND+gid%3A5933&translation=en.ahmedali&page=1&unit=translation&view=default

assem-ch avatar Apr 18 '14 20:04 assem-ch

Can we just redirect bots to common search pages instead of adding "no index"? For example: if a bot is requesting "?query=Allah&view=default" redirect it to "?query=Allah".

mdebbar avatar Apr 18 '14 20:04 mdebbar

is there a way to do that?

assem-ch avatar Apr 18 '14 20:04 assem-ch

Yes. We have to read the User-Agent (from the HTTP headers) and compare it with a list of bot user agents. http://user-agent-string.info/list-of-ua/bots. For example, Google's bot has the user agent "Googlebot".

mdebbar avatar Apr 18 '14 20:04 mdebbar

will the User-agent think that it is crawling start url or end url? because if it assumed it is indexing different urls with same contents so it will be like "fake content"

assem-ch avatar Apr 18 '14 20:04 assem-ch

When they hit the start url, they shouldn't get any content. We should respond with an HTTP redirect (code 302) so they go and load the end url. Maybe redirection is a bad idea, I'm not sure.

mdebbar avatar Apr 18 '14 20:04 mdebbar

I dont know what's better , but seems for the urls sharable by users, is better to be redirected.

here is redirection of googlebot to smartphone version of pages: https://developers.google.com/webmasters/smartphone-sites/redirects

assem-ch avatar Apr 18 '14 21:04 assem-ch

I see two issues:

  • [x] multi-language support (URL switch/sub-directory)

    Related doc to fix it: Multi-regional and multilingual sites & Use hreflang for language and regional URLs

      ##for each supported language xx
      <link rel="alternate" href="http://www.alfanous.org/xx/..." hreflang="xx" />
      ##default too
      <link rel="alternate" href="http://www.alfanous.org/..." hreflang="x-default" />
    
  • [x] trivial query parameters (query seems the only important one like mdebbar had mentioned)

    Related doc to fix it: Use canonical URLs

      ##canonical page with only `query=` parameter
      <link rel="canonical" href="https://www.alfanous.org/?query=..." />
    

Update to add another one:

sneetsher avatar May 10 '17 19:05 sneetsher

Canonical, multi-language & pagination rel-links added in the linked PR.

  • "no index" seems not needed, Canonical link is expected to fix it.
  • "no follow" for Juz:N, Hizb:N, sura_arabic:الفاتحة ... I'm ok if this is a temporary workaround, but seems to me that the correct way is to have specific result page template for such pre-ordered collection of ayahs. Ayahs should be ordered as in noble Quran by default and no separation between ayahs. That's what the user would expect.

sneetsher avatar May 22 '18 08:05 sneetsher

Grouping by juz and hizb and sura, are intended to be #302 feature

assem-ch avatar May 23 '18 18:05 assem-ch