Webmaster Tools Indexed pages vs. Sitemap?
-
Looking at Google Webmaster Tools and I'm noticing a few things, most sites I look at the number of indexed pages in the sitemaps report is usually less than 100% (i.e. something like 122 indexed out of 134 submitted or something) and the number of indexed pages in the indexed status report is usually higher. So for example, one site says over 1000 pages indexed in the indexed status report but the sitemap says something like 122 indexed.
My question: Is the sitemap report always a subset of the URLs submitted in the sitemap? Will the number of pages indexed there always be lower than or equal to the URLs referenced in the sitemap?
Also, if there is a big disparity between the sitemap submitted URLs and the indexed URLs (like 10x) is that concerning to anyone else?
-
Unfortunately not, the closest you'll get is selecting a long period of time in Analytics and then exporting all the pages that received organic search traffic. If you could then cross check them with your list of URLs on your site it could provide you with a small list. But I would still check them in Google to make sure they aren't indexed. As I said it's not the best way.
-
Is there a reliable way to determine which pages have not been indexed?
-
Great answer by Tom already, but I want to add that probably images and other types of content whom are mostly not by default included in sitemaps could also be among the indexed 'pages'.
-
There's no golden rule that your sitemap > indexed pages or vice versa.
If you have more URLs in your sitemap than you have indexed pages, you want to look at the pages not indexed to see why that is the case. It could be that those pages have duplicate and/or thin content, and so Google is ignoring them. A canonical tag might be instructing Google to ignore them. Or the pages might be off the site navigation and are more than 4 links/jumps away from the homepage or another page on the site, make them hard to find.
Conversely, if you had lots more pages indexed than in your sitemap, it could be a navigation or URL duplication problem. Check to see if any of the pages are duplicate versions caused by things like dynamic URLs generated through search on the site or the site navigation, for example. If those pages are the only physical pages that you have created and you know every single one has been submitted in a sitemap - and so any other indexed URLs would be unaccounted for, that may well be cause for concern, so check nothing is being indexed multiple times.
Just a couple of scenarios, but I hope it helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Will adjusted bounce rate affect avg time on page?
I recently read Rob Beirne's piece on how and why to set up an adjusted bounce rate in Google Analytics (https://moz.com/blog/adjusted-bounce-rate). I am getting myself ready to talk to our site team about why we should set up an adjusted bounce rate and am anticipating some questions I'd like to be able to answer: 1. Will an adjusted bounce rate improve the accuracy of our avg time on page metrics? 2. Are we able to keep the unadjusted bounce rate in GA as well, so we can compare the two metrics if we ever need to? Does anyone know the answers to these questions? Any help would be much appreciated!
Reporting & Analytics | | seoisfun0 -
Webmaster Tools and Geolocation
Our site gets a significant amount of traffic from Venezuela, about 9% based on our Google Analytics (the US being 63%). According to Webmaster Tools we get 48% of our clicks from Venezuela. That discrepancy is massive, considering we are an English site. My theory is that they are not allowed/capable of using secure search in Venezuela for some legal reason and therefore their searches are much more 'visible'. Does anyone have any insight?
Reporting & Analytics | | evansluke0 -
Google Analytics is treating my blog like all the content is just on the home page.
Hello all, I installed Google Analytics on a main website and a blog (blog.travelexinsurance.com) While it appears to be tracking correctly (and when I test it in real time it shows that I'm visiting) but it is treating the entire blog as though it's one page. So I can't see data on blog post X. All I see is that X visitors came to my blog in aggregate. So I see blog.travelex.com has 999 visitors, but it doesn't show that /travel-luggage got 50 visits, while /insurace-tips got 75 and so forth. I assume I screwed up the tracking somehow, but can't figure out where I went wrong. Tracking on the main domain works just fine. It's specific to the blog.
Reporting & Analytics | | Patrick_G0 -
Referral Traffic vs. Campaign Traffic in Google Analytics
I have two sites: a blog and an ecommerce site. The blog funnels people to the ecommerce site. In Analytics I'm seeing declines in referral traffic from the blog to the ecommerce site. During the same time I'm seeing an increase in campaign traffic to the ecommerce site, with most campaign traffic coming from the blog. I believe the increase in campaign traffic is largely a result of simply having installed more tracking links. This leads me to believe that the declines I'm seeing in referral traffic is simply a result of the increase in campaign traffic. In other words, what was once counted and reported as being referral traffic is now being counted and reported as campaign traffic. So my question is this: In Google Analytics is campaign traffic ALSO reported as referral traffic, or is campaign traffic reported separately and not duplicated in referral traffic reports? I'll provide a concrete example to make this more clear in case it isn't: Say site X sends 1000 visits each month to site Y. Say 50 of those visits come from a single link on X. If that link is changed so that campaign Z data info added (via the Google URL Builder), would you expect to then see 950 referral visits each month from site X to site Y plus 50 campaign visits to site Y via new campaign Z, or would you continue to see 1000 referral visits plus the new 50 campaign visits? Many thanks in advance to anyone that can shed some light on this.
Reporting & Analytics | | aaronprimal0 -
How to Setup New Domain in Webmaster Tools? Can Anybody HELP?
I, I have a new domain, and a new account is now created in webmatser tools. Is there anythings I should know about using or setting up webmaster tools? For example, the geographic target, I'm in Canada our web site, a local Pilates studio, is a commercial web site to provides informations about our services. Should I select Canada? One thing about this, I have another domain, http://www.furnacefilterscanada.com/ and in the setiing, I have select geographic target to: Canada. When using google.ca result are better then when using google.com! I think must Canadian user are using google.com Does this ''Geographic Target'' setting has anythings to do with search results in Goolge.ca versus Google.com? Thank you for your help, BigBlaze
Reporting & Analytics | | BigBlaze2050 -
Why are Seemingly Randomly Generated URLs Appearing as Errors in Google Webmaster Tools?
I've been confused by some URLs that are showing up as errors in our GWT account. They seem to just be randomly generated alphanumeric strings that Google is reporting as 404 errors. The pages do 404 because nothing ever existed there or was linked to. Here are some examples that are just off of our root domain: /JEzjLs2wBR0D6wILPy0RCkM/WFRnUK9JrDyRoVCnR8= /MevaBpcKoXnbHJpoTI5P42QPmQpjEPBlYffwY8Mc5I= /YAKM15iU846X/ymikGEPsdq 26PUoIYSwfb8 FBh34= I haven't been able to track down these character strings in any internet index or anywhere in our source code so I have no idea why Google is reporting them. We've been pretty vigilant lately about duplicate content and thin content issues and my concern is that there are an unspecified number of urls like this that Google thinks exist but don't really. Has anyone else seen GWT reporting errors like this for their site? Does anyone have any clue why Google would report them as errors?
Reporting & Analytics | | kimwetter0 -
Google: show all images indexed on a domain
Is there a way to display all images that google has indexed on a domain / subdomain? I'm basically looking for something like a site:-command for google image search.
Reporting & Analytics | | jmueller0 -
Export all links on page with anchor text
As the question says really. Is there a tool that will export all the links on a page and anchor text related to those links into Excel? Thanks.
Reporting & Analytics | | RikkiD220