What does this mean about my site index?
-
How should I go about fixing this? See image.
-
How do I find out from Google webmaster toolkit all the pages google has indexed of our site?
-
It means your website is creating a lot of different URLs. However, Google is deeming them as low quality (perhaps duplicates or near duplicates) and choosing not to index them.
I would look at these two options first:
- Prevent any unecessary URLs from being created
- Restrict crawl access through robots.txt
You also need to figure out, how many pages does your site actually have? Should you have significantly more or significantly less than 3,400 URLs in the index?
If you should have more than 3,400 URLs, I'd suggest making multiple sitemaps based on site sections. This will allow you to see what sections are having problems with indexation.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
PDF best practices: to get them indexed or not? Do they pass SEO value to the site?
All PDFs have landing pages, and the pages are already indexed. If we allow the PDFs to get indexed, then they'd be downloadable directly from google's results page and we would not get GA events. The PDFs info would somewhat overlap with the landing pages info. Also, if we ever need to move content, we'd now have to redirects the links to the PDFs. What are best practices in this area? To index or not? What do you / your clients do and why? Would a PDF indexed by google and downloaded directly via a link in the SER page pass SEO juice to the domain? What if it's on a subdomain, like when hosted by Pardot? (www1.example.com)
Reporting & Analytics | | hlwebdev1 -
Using Site Maps Correctly
Hello I'm looking to submit a sitemap for a post driven site with over 5000 pages. The site hasn't got a sitemap but it is indexed by google - will submitting a sitemap make a difference at this stage? Also, most free sitemap tools only go up to 5000 pages, and I'm thinking I would try a sitemap using a free version of the tool before I buy one - If my site is 5500 pages but I only submit a sitemap for 5000 (I have no control of which pages get included in the sitemap) would this have a negative effect for the pages that didn't get included? Thanks
Reporting & Analytics | | wearehappymedia0 -
Not many pages being indexed on google
Hi I am putting in to Google: site:www.mysite.com to see the pages listed on Google - the figure Google is coming back with is much lower than the actual pages, I have no crawer warning etc... What could the problem be? Thanks
Reporting & Analytics | | acumenadagency0 -
What is the impact of a panda refresh on a Pandalized site?
When a panda refresh hits and you have a pandalized site, If the site were to de-pandalized, would you see traffic back to pre-panda levels right away? Or any type of movement right away?
Reporting & Analytics | | jessefriedman0 -
Indexed URLs in Webmaster Tools
Hi everybody! I've been looking at my Webmaster Tools stats, and it looks like not all the URLs in the sitemap tree have been indexed, according to WMT at least. Is this reliable, and if so, is it worth investigating further? | Sitemap | Status | Type | Downloaded | URLs submitted | URLs in web index |
Reporting & Analytics | | neooptic
| | /ISitemap1.xml | | Sitemap | Dec 15, 2011 | 2,000 | 1,309 |
| | /isitemap.xml | | Index | Dec 15, 2011 | 8,695 | 4,127 |
| | /isitemap2.xml | | Sitemap | Dec 15, 2011 | 2,000 | 998 |
| | /isitemap3.xml | | Sitemap | Dec 15, 2011 | 2,000 | 819 |
| | /isitemap4.xml | | Sitemap | Dec 15, 2011 | 2,000 | 719 |
| | /isitemap5.xml | | Sitemap | Dec 15, 2011 | 695 | 282 | Thanks!0 -
Should we add the city to our keywords for a site that is only local?
This is one of those things I have done for a long time and all of a sudden asked myself was it necessary: For our local clients, we add the city name (Houston, KC, Birmingham) after each keyword. An example would be TestSite.com/big-tester-houston A Title Tag might be Big Tester Houston | Test Site, etc. Where appropriate we do the same with H1 or H2's and occasionally in the content we will use the city name. The thought being that since the site is only for a given city, it will be deemed more relevant than a site from outside.( I understand there are other factors in SEO; this is a specific question around adding the city). Yes, we also optimize with local directories/citation sites. Is this overkill, is it even worthwhile? Is there any evidence one way or another? I would love some strong opinions backed up with something other than anecdotal evidence where possible.
Reporting & Analytics | | RobertFisher0 -
How to Refesh site comapign?
How to Refesh site comapign? its displaying 3 days old data. now fixed some contents. unable to test it. kindly guide me for howto refresh the report?
Reporting & Analytics | | peanut20100 -
Meta Robots Tag - What's it really mean?
I used on a handful of pages recently and noticed that they're still popping up in the Google search index. I'd like to keep these from appearing, so I figured I needed a directive statement with stronger semantic meaning. From what I understand, is what I'm looking for. Using this will keep Google from not only crawling the page, but indexing the page, as well. I decided to see what the official robotstxt.org website said about it, so I checked (link here): the NOFOLLOW directive only applies to links on this page. It's entirely likely that a robot might find the same links on some other page without a NOFOLLOW (perhaps on some other site), and so still arrives at your undesired page. So, is their explanation saying that the page itself will be indexed, but the content / links on it won't be followed / indexed? Let me hear your thoughts, mozzers.
Reporting & Analytics | | mudbugmedia0