Webmaster Tools Indexed pages vs. Sitemap?
-
Looking at Google Webmaster Tools and I'm noticing a few things, most sites I look at the number of indexed pages in the sitemaps report is usually less than 100% (i.e. something like 122 indexed out of 134 submitted or something) and the number of indexed pages in the indexed status report is usually higher. So for example, one site says over 1000 pages indexed in the indexed status report but the sitemap says something like 122 indexed.
My question: Is the sitemap report always a subset of the URLs submitted in the sitemap? Will the number of pages indexed there always be lower than or equal to the URLs referenced in the sitemap?
Also, if there is a big disparity between the sitemap submitted URLs and the indexed URLs (like 10x) is that concerning to anyone else?
-
Unfortunately not, the closest you'll get is selecting a long period of time in Analytics and then exporting all the pages that received organic search traffic. If you could then cross check them with your list of URLs on your site it could provide you with a small list. But I would still check them in Google to make sure they aren't indexed. As I said it's not the best way.
-
Is there a reliable way to determine which pages have not been indexed?
-
Great answer by Tom already, but I want to add that probably images and other types of content whom are mostly not by default included in sitemaps could also be among the indexed 'pages'.
-
There's no golden rule that your sitemap > indexed pages or vice versa.
If you have more URLs in your sitemap than you have indexed pages, you want to look at the pages not indexed to see why that is the case. It could be that those pages have duplicate and/or thin content, and so Google is ignoring them. A canonical tag might be instructing Google to ignore them. Or the pages might be off the site navigation and are more than 4 links/jumps away from the homepage or another page on the site, make them hard to find.
Conversely, if you had lots more pages indexed than in your sitemap, it could be a navigation or URL duplication problem. Check to see if any of the pages are duplicate versions caused by things like dynamic URLs generated through search on the site or the site navigation, for example. If those pages are the only physical pages that you have created and you know every single one has been submitted in a sitemap - and so any other indexed URLs would be unaccounted for, that may well be cause for concern, so check nothing is being indexed multiple times.
Just a couple of scenarios, but I hope it helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Analytics - Tracking multiple thankyou pages?
Hi Guys, I want to track email opt-ins for multiple thank you pages. The setup is as follows: http://image.prntscr.com/image/57632e05a15f42fda0b8ffec2d176460.png I have not yet built the thank you pages, so i was wondering what the URL should be to make it easy to track them in GA? I'm thinking: domain.com/thankyou-page/page1 Then using regular expression in GA to track /thankyou-page/ Would this be a good way to go about it? Cheers. f6c7r0
Reporting & Analytics | | spyaccounts110 -
Automated XML Sitemap for a BIG site
Hi, I would like to do an automated sitemap for my site but it has more than a million pages. It would need to be a sitemap index with a separation on different parts of the site (i.e. news, video) and I'll want a news sitemap and video sitemap as well (of course). Does anyone have any recommended way of making this and how much would you recommend it getting updated? For news and , I would like it to be pretty immediate if possible but the static pages don't need to be updated as much. Thanks!
Reporting & Analytics | | mattdinbrooklyn0 -
If you include video in a video sitemap should it also be in your global xml sitemap
I was wondering in hope of not duplicating URLs, if you include video in a video sitemap should it also be in your global xml sitemap. Would it be better to put them in one or both?
Reporting & Analytics | | mattdinbrooklyn0 -
Google is not indexing all URLs
My website have company and events profile from 200 countries. So it does have lots of URL. Earlier in August 2014, Google used to crawl 90% of URLs we submit. Thing goes wrong when we shifted from http to https. We lost traffic. But we are gaining it slowly. Main concern is that, It still does not indexed all submitted URLs. It have crawled merely 8% of all URLs submitted. site address is businessvibes.com Any help would be appreciated.
Reporting & Analytics | | irteam0 -
Google Analytics VS target="_blank" internal links: How much wrong is it?
I am working on an e-commerce website, and our CEO is sure that having target="_blank" in internal search result is boosting the conversion (not sure, but it's not an issue at the moment). The problem is that Google Analytics sees all URLs visited from search results as entrances/direct visits, hence the Booking Funnel Tracking does not work as it was supposed to. Is there any way to recover the tracking? Or we shall get the rid of target="_blank" attribute?
Reporting & Analytics | | apartmentGin0 -
Page Views Plummeted
Since using SEOMoz to diagnose and address issues with my site www.esecaucus.com, my page views have plummeted while my number of visits and unique visitors has remained intact. Does this mean that previously the amount of page views were artificially inflated by some kind of technical error in the site?
Reporting & Analytics | | Nobody15330770827560 -
Conflicting Analytics: Getclicky vs. Google Analytics
Removal of an old getclicky code (and installation of new tracking code) from my site after some changes resolved inaccuracies in my page view/bounce rate information. Prior to adding the correct getclicky code, getclicky was showing 95% plus bounce rate on my site. I knew this was wrong because GA showed around 40%. Ironically, now that my getclicky account is working (showing a 30% bounce rate), GA bounce rate is steadily climbing. Is it best to just stick with one analytics account so that the codes don't conflict? Is it possible to have two accurate analytics accounts at the same time?
Reporting & Analytics | | JSOC0 -
Yahoo Webmaster Tools still work?
Now that Yahoo is using Bing's search results, do we still need to monitor and use Yahoo's Webmaster's Tools - or do we stick to using the Bing Webmaster Tools?
Reporting & Analytics | | findachristianjob0