Google crawler showing cache of another page
-
For the page http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php google is showing another page in cache (http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php). Please let me know how this happened and how to correct it.
-
Hi my friend, if you look at the cache of the URL you gave:
http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php :::: Click Me
You are actually looking at the source code of the following page:
http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php
To confirm this, look at the meta data in the source, it says Ultrabooks.
Now comes the issue where the rel=canonical implementation is incorrect on both the pages as they both point to themselves. Check out the source code of both the pages. Their rel=canonical attributes point to themselves. So as per my original explanation, Google is showing the cache of /top-10-ultrabooks-153.php for top-10-laptops-124.php which is the actual issue at hand. So when you look at the source code of cached page, you are actually looking at the source code of /top-10-ultrabooks-153.php page.
Best,
Devanur Rafi
-
Your slightly actually incorrect Devanur, the reason the wrong page is cached is because the page previously had a canonical tag referencing the other page.
If you look at the cache of http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php :::: Click Me
You will see in the source code a canonical tag for the other page:
http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php" />
And the info at the top of cache page confirms Google is counting the one page as the other (see attachment)
-
Hi,
First things first, the page, http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php is not in Google's index.
Secondly, for both the phrases, 'top 10 laptops' and 'top 10 ultrabooks', the page,
http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php, ranks in the first position from your website, thinkdigit.com
So when you try to look-up the cache for a non-existing page in the index, Google tries to return the closest match and which is, http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php
I see a problem with the Sitemap.xml file for your site. Its not comprehensive and if you look at the cache of it in Google, you will see, the page, http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php is in there but its missing in the current Sitemap.xml file.
Here are three things you might do to make http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php in to the Google's index.
1. From Google webmaster tools account, Fetch as Google the above page and submit.
2. Come up with a comprehensive Sitemap.xml file
3. There is no reference to the Sitemap.xml file from Robots.txt file. You can add it as follows:
Sitemap: http://www.thinkdigit.com/sitemap.xml
You should be good after that. All the best to you my friend.
Regards,
Devanur Rafi
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Fetch as Google
Are there any pros or cons with using Google fetch and submit? I realise Google will likely find it of its own accord in due course but I have found it may take a couple of weeks if at all. Fetch and submit seems to speed this process up, sometimes anyway.
On-Page Optimization | | seoman100 -
Make Google reindex the website after on-page optimization
Hello Moz Community, I just finished the on-page optimization for a new project and I would like to know how can I make Google to reindex the new link structure, titles and meta tags? Thank you!
On-Page Optimization | | CosminC0 -
Duplicate Page Titles in Crawl Errors (although Google is rewriting in serps ??)
Hi Im working on a client/project and crawl report is showing thousands of dupe page titles In the case of the blog/news section its aprox 50 since aprox 50 posts and they all have the same meta-title: "Brand News | Brand" as opposed to: "Title Unique to Page/Topic/KW Relating to Content | Brand" Since these are the main content pages we want to rank (in addition to the main site category pages) then i have instructed dev must prioritise populating these pages meta-titles with the actual post/article titles, as per the latter version of the above example. (I should mention that i have requested they fix all dupe titles but main content pages are the priority). Whilst this will reduce the number of dupe titles in crawl error/warning report which is a good thing, is it actually likely to increase the ranking of these news/content pages given that Google does seem to be rewriting the titles correctly in the serps based on the page content ? Many Thanks in advance for your input
On-Page Optimization | | Dan-Lawrence0 -
Why don't all my pages have On Page Optimization Reports
Apologies if this question has been asked a million times, but I can't find it. I have 35 pages, yet only 5 of them have generated On Page Optimization Reports. I know I can create them manually, but wondered if I've done something incorrectly? Iain.
On-Page Optimization | | iainmoran0 -
How google handle with Title when is invisible on the page?
I would like to display H1 Title tag invisible on the homepage. I set up Title colour same colour as background colour. How google handle with this cloaking? What should I to set up in the style.css?
On-Page Optimization | | joeko0 -
Issues with Product Pages Getting Index In Google
I just started working here the other week and one of the big issue is that a lot of the product pages are not getting index in google. We have an xml.gz site map they submitted a long time ago. My guess is it might be something with not enough content on the pages? Here are a few example of pages that are not getting index in google. http://www.rockymountainatvmc.com/p/43/-/439/716/-/33097/Alpinestars-Dual-Motorcycle-Gloves http://www.rockymountainatvmc.com/p/47/-/201/803/-/28948/Camelbak-Blowfish-2013 http://www.rockymountainatvmc.com/p/46/-/203/836/-/6996/MSR-Head-Case http://www.rockymountainatvmc.com/p/44/54/208/764/80/1220/Galfer-Brake-Pad-Sintered-Metal There are 100's that are not indexed just trying to figure out what we need to do! We are working on new content to them all but we have over 5000 products so it will take a long time. We also have the reviews on the pages and are looking at starting a Q&A on page to help get more unique content.
On-Page Optimization | | DoRM0 -
Duplicate page
Just getting started and had a question regarding one of the reports. It is telling me that I have duplicate pages but I'm not sure how to resolve that.
On-Page Optimization | | KeylimeSocial0 -
Web Page Refresh
Hi there, we redesign our Website, changing it for a jquery based version. This new design is much more usable and nice for our users, however the average page views for user decreased a lot. Basically this is due to the fact that once the user is logged in, it spends most of the time in the same Web form which is updated through jquery without refreshing it. We were thinking about adding a meta refresh tag, or ad some javascript for getting this task done in order to get the relation page views/visitor increased. Do you think refreshing the page every 4 minutes could be penalized by Google (or other Search engines) ? Which should be the interval between refresh ? Would it be better to make it very explicit (i.e. adding a meta refresh tag) or using a kind of hide javascript ? We want to increase the pageviews but of course, we don't want to get penalized
On-Page Optimization | | martincad0