Webmaster Tools Indexed pages vs. Sitemap?
-
Looking at Google Webmaster Tools and I'm noticing a few things, most sites I look at the number of indexed pages in the sitemaps report is usually less than 100% (i.e. something like 122 indexed out of 134 submitted or something) and the number of indexed pages in the indexed status report is usually higher. So for example, one site says over 1000 pages indexed in the indexed status report but the sitemap says something like 122 indexed.
My question: Is the sitemap report always a subset of the URLs submitted in the sitemap? Will the number of pages indexed there always be lower than or equal to the URLs referenced in the sitemap?
Also, if there is a big disparity between the sitemap submitted URLs and the indexed URLs (like 10x) is that concerning to anyone else?
-
Unfortunately not, the closest you'll get is selecting a long period of time in Analytics and then exporting all the pages that received organic search traffic. If you could then cross check them with your list of URLs on your site it could provide you with a small list. But I would still check them in Google to make sure they aren't indexed. As I said it's not the best way.
-
Is there a reliable way to determine which pages have not been indexed?
-
Great answer by Tom already, but I want to add that probably images and other types of content whom are mostly not by default included in sitemaps could also be among the indexed 'pages'.
-
There's no golden rule that your sitemap > indexed pages or vice versa.
If you have more URLs in your sitemap than you have indexed pages, you want to look at the pages not indexed to see why that is the case. It could be that those pages have duplicate and/or thin content, and so Google is ignoring them. A canonical tag might be instructing Google to ignore them. Or the pages might be off the site navigation and are more than 4 links/jumps away from the homepage or another page on the site, make them hard to find.
Conversely, if you had lots more pages indexed than in your sitemap, it could be a navigation or URL duplication problem. Check to see if any of the pages are duplicate versions caused by things like dynamic URLs generated through search on the site or the site navigation, for example. If those pages are the only physical pages that you have created and you know every single one has been submitted in a sitemap - and so any other indexed URLs would be unaccounted for, that may well be cause for concern, so check nothing is being indexed multiple times.
Just a couple of scenarios, but I hope it helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is having the same URL in several sitemaps a problem for google?
We have 30 sitemaps, one for each language version of our site. About 5000 pages per sitemap.
Reporting & Analytics | | lcourse
To get a better idea on which pages google is not indexing, I thought about quickly generating sitemaps by page cagetories to see if there are any patterns. Any problems if I submit now new additional sitemaps dividing all our pages by product page, considering that the same pages are already in our existing sitemaps we submitted in the search console. So having same URL in more than 1 sitemap would be a problem? As a side note, we observed when adding a sitemap index that google search console in its count of total indexed pages, now counts every page twice since we submitted both the sitemap index and the individual sitemaps, so search console does not recognize in count that sitemaps in sitemaps index are identical to the ones we submitted individually in search console.0 -
WMT data vs. Analytics
Hi Each month I export my data from WMT and go through analytics. I also export our non brand queries from analytics and not WMT - I haven't had an issue before, but this month the impression data is quite different. In the hundreds of thousands different for keywords, everything seems to have taken a big jump and it seems strange. However, not everything is different, I've spot checked some and its; consistent in both, I'm not sure what's going on? One example would be: <colgroup><col width="281"> <col width="72"></colgroup>
Reporting & Analytics | | BeckyKey
| industrial shelving | 1016 |
| industrial racking | 999 | These appear as impressions from Query data in analytics, but they appear nowhere in my WMT query data. Analytics query data shows: | industrial equipment | 670 | WMT Data: | industrial equipment | 143 | Anyone have any idea? Perhaps some kind of tracking issue? Also I've triple checked dates etc...0 -
Moz Crawl shows over 100 times more pages than my site has?
The latest crawl stats are attached. My site has just over 300 pages? Wondering what I have done wrong? RRv3fR0
Reporting & Analytics | | Billboard20120 -
Show item status in GA - LIve vs sold vs unsold
Hi guys, I am working with SEO for an auction site with B2C and C2C. (similar to eBay) All item\products on our page have an unique url. The auction can last from hours to weeks.
Reporting & Analytics | | helgeolaussen
When the item is live, sold or (finished but)unsold it still has the same url.
So when I take a look at SEO traffic to items in Google Analytics, I can't tell if the item was live, sold or unsold at the time the user landed on the page. Which makes it diffucult to analyse the traffic. Is there anyway I can make GA show the status of the item for the time user landed on it? Best regards, Ceran0 -
Is the meta description available on the On Page Optimization Report even if its currently being optimized?
Currently, description is only available if the element is not being optimized (i.e. character count is off/keyword isn't included in the description)
Reporting & Analytics | | Jerome670 -
How do I set up goals in analytics to track the sales funnel when several pages of the steps required to checkout have the same url?
I have found this in Google Analytic but it makes no sense to me - is there are better resource which explains how you do this step by step ( or a good video perhaps?) Identical URLs Across Multiple Steps In some situations, the URL does not change across a sequence of activity. For example, a sign-up process might have the following URL path: Step 1 (Sign Up): www.example.com/sign_up.cgi Step 2 (Accept Agreement): www.example.com/sign_up.cgi Step 3 (Finish): www.example.com/sign_up.cgi To track visitors' progress through a funnel with the same URL for each step, modify the tracking code to create a virtual URL for each step in the sequence that you want to track. For details on how to use this in your tracking code, see Virtual Pageviews in the _Asynchronous Migration Examples_guide, which shows how to do this in all versions of the tracking code. The following example shows how you might fabricate 3 URLs using the asynchronous tracking code: _gaq.push(['_trackPageview', '/funnel_G1/step1.html']); _gaq.push(['_trackPageview', '/funnel_G1/step2.html']); _gaq.push(['_trackPageview', '/funnel_G1/step3.html']); You would then define your funnel and goal URLs using the ones you created in the tracking code modifications.
Reporting & Analytics | | pookiepro0 -
In-Page Analytics
Hey folks, In the in-page analytics section of GA I am wanting to get the % of clicks on a particular link over a large selection of dates. Does anyone know how to pull the data without having to do it day by day? Ideally I want to pull it all at once. There might of course be other ways to do this, if so please let me know. Thanks for your help. Cheers,
Reporting & Analytics | | CraigAddyman0 -
Duicated page error
Hi, I am trying to figure out how to fix duplicated error Most of them are from wordpress "feed" Does anyone know how to fix this problem? | Wedding Photographer San Antonio | Soobumim Photography 210-863-9878 begin_of_the_skype_highlighting 210-863-9878 end_of_the_skype_highlighting http://www.soobumimphotography.com/feed/?paged=11 21 1 0 Wedding Photographer San Antonio | Soobumim Photography 210-863-9878 begin_of_the_skype_highlighting 210-863-9878 end_of_the_skype_highlighting http://www.soobumimphotography.com/feed/?paged=12 21 1 0 Wedding Photographer San Antonio | Soobumim Photography 210-863-9878 begin_of_the_skype_highlighting 210-863-9878 end_of_the_skype_highlighting http://www.soobumimphotography.com/feed/?paged=13 |
Reporting & Analytics | | BistosAmerica0