Webmaster Tools, why does it show 486 pages submitted to web, and only 40 indexed?
-
I am confused on what a client account shows in WMTs, client account is http://multiview.com. They have a graph showing 486 pages submitted to web, but only 40 are indexed. Also, they recently re-launched, i.e in April 2014, and the new site has about 40 pages indexed.... so I am guessing that the 486 number relates to all the pages that are showing errors in retrieving...i.e. 28 soft 404 errors, 10 access denied errors, 808 not found errors.
Does this make sense to explain why there is such a gap between 486 and 40?
-
If you're looking at the page that shows the sitemap submitted, then it's showing how many pages from the sitemap are indexed. Be sure to look at the Index Status to see the total indexed there. It might be different...
HOW CAN THAT BE? You say....
Here's a problem a client of mine ran into recently. Their sitemap generator was outputting all URLs in lower case. The site itself linked to all URLs in mixed case. TECHNICALLY, /happyhour.html and /HappyHour.html are different URLs....whether or not your webserver is set up to be case-sensitive. But what Google saw was a list of 160,000 URLs (all lowercase) in the sitemap that it never saw itself when it crawled the site. However, it found 160,000 OTHER URLs (all mixed case) when crawling the site, and indexed those just fine.
This could be what you're seeing.
If it's not, then I'd say Google really doesn't like the content on the majority of those pages (or they're 404'ing, 500'ing, or blocked by robots.txt or something like that).
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to Diagnose "Crawled - Currently Not Indexed" in Google Search Console
The new Google Search Console gives a ton of information about which pages were excluded and why, but one that I'm struggling with is "crawled - currently not indexed". I have some clients that have fallen into this pit and I've identified one reason why it's occurring on some of them - they have multiple websites covering the same information (local businesses) - but others I'm completely flummoxed. Does anyone have any experience figuring this one out?
Reporting & Analytics | | brettmandoes2 -
Switch to www from non www preference negatively hit # pages indexed
I have a client whose site did not use the www preference but rather the non www form of the url. We were having trouble seeing some high quality inlinks and I wondered if the redirect to the non www site from the links was making it hard for us to track. After some reading, it seemed we should be using the www version for better SEO anyway so I made a change on Monday but had a major hit to the number of pages being indexed by Thursday. Freaking me out mildly. What are people's thoughts? I think I should roll back the www change asap - or am I jumping the gun?
Reporting & Analytics | | BrigitteMN0 -
Google Analytics Organic search queries aren't being updated, even though I'm still seeing results in all our typical results pages.
We pushed some new changes to the site and Google Analytics is no longer updating the Organic Search queries listing, even though traffic is consistent and and we're still landing results in all our typical keyword searches. Any ideas?
Reporting & Analytics | | unclekaos0 -
Free Online XMl Site Map Creators up sites over 1000 pages
Does anybody know of a XML site map creator that is free for websites over 1000 pages?
Reporting & Analytics | | AppleCapitalGroup0 -
Indexed URLs in Webmaster Tools
Hi everybody! I've been looking at my Webmaster Tools stats, and it looks like not all the URLs in the sitemap tree have been indexed, according to WMT at least. Is this reliable, and if so, is it worth investigating further? | Sitemap | Status | Type | Downloaded | URLs submitted | URLs in web index |
Reporting & Analytics | | neooptic
| | /ISitemap1.xml | | Sitemap | Dec 15, 2011 | 2,000 | 1,309 |
| | /isitemap.xml | | Index | Dec 15, 2011 | 8,695 | 4,127 |
| | /isitemap2.xml | | Sitemap | Dec 15, 2011 | 2,000 | 998 |
| | /isitemap3.xml | | Sitemap | Dec 15, 2011 | 2,000 | 819 |
| | /isitemap4.xml | | Sitemap | Dec 15, 2011 | 2,000 | 719 |
| | /isitemap5.xml | | Sitemap | Dec 15, 2011 | 695 | 282 | Thanks!0 -
Why are Seemingly Randomly Generated URLs Appearing as Errors in Google Webmaster Tools?
I've been confused by some URLs that are showing up as errors in our GWT account. They seem to just be randomly generated alphanumeric strings that Google is reporting as 404 errors. The pages do 404 because nothing ever existed there or was linked to. Here are some examples that are just off of our root domain: /JEzjLs2wBR0D6wILPy0RCkM/WFRnUK9JrDyRoVCnR8= /MevaBpcKoXnbHJpoTI5P42QPmQpjEPBlYffwY8Mc5I= /YAKM15iU846X/ymikGEPsdq 26PUoIYSwfb8 FBh34= I haven't been able to track down these character strings in any internet index or anywhere in our source code so I have no idea why Google is reporting them. We've been pretty vigilant lately about duplicate content and thin content issues and my concern is that there are an unspecified number of urls like this that Google thinks exist but don't really. Has anyone else seen GWT reporting errors like this for their site? Does anyone have any clue why Google would report them as errors?
Reporting & Analytics | | kimwetter0