Why does Google say they have more URLs indexed for my site than they really do?
-
When I do a site search with Google (i.e. site:www.mysite.com), Google reports "About 7,500 results" -- but when I click through to the end of the results and choose to include omitted results, Google really has only 210 results for my site.
I had an issue months back with a large # of URLs being indexed because of query strings and some other non-optimized technicalities - at that time I could see that Google really had indexed all of those URLs - but I've since implemented canonical URLs and fixed most (if not all) of my technical issues in order to get our index count down.
At first I thought it would just be a matter of time for them to reconcile this, perhaps they were looking at cached data or something, but it's been months and the "About 7,500 results" just won't change even though the actual pages indexed keeps dropping!
Does anyone know why Google would be still reporting a high index count, which doesn't actually reflect what is currently indexed?
Thanks!
-
It seems like you are taking the correct steps. I'm guessing those pages were tossed in to the supplementary index (as they were most likely were dupes) and I beleive by tweaking your robots.txt files, over time, these should be removed.
Another thing to do is inform Google on what to do with those parameters inside webmaster tools:
Configuration => URL parameters
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How long does google takes to crawl a single site ?
lately i have been thinking , when a crawler visits an already visited site or indexed site, whats the duration of its scanning?
Algorithm Updates | | Sam09schulz0 -
Folders or no folders in url?
What's best for SEO: a folder or no folder? For example: https://domain.com/arizona-dentist/somecontent or just https://domain.com/somecontent. The website has 100+ pages with "dentist" within the content of the somecontent pages, as well as specific pages for /arizona-dentist/. Also, the breadcrumb for the somecontent page would appear something like follows: Arizona Dentist > Some Content ... you can find the somecontent page from the Arizona Dentist page. I didn't include folders in the path because I did not want the url to be too long. In terms of where it is showing up on google search results...it is within the top 3-4 on the first page when searching Arizona dentist come content. The website is pretty organized even without subfolders because it was made using Umbraco. I am wondering if using folders will increase the SEO ranking, or if it really doesn't and could hurt it if paths become too long; especially since it's not doing too bad in the search ranking right now. -Thanks in advance for any help.
Algorithm Updates | | bellezze0 -
Keywords in Paragraphs: How much do they matter at Google?
Hi all, Generally we care a lot about keywords at headings, title tags, URL, etc. always. But I wonder how much impact they have being in paragraphs. How much do they matter at paragraphs? Thanks
Algorithm Updates | | vtmoz0 -
Delay between being indexed and ranking for new pages.
I've noticed with the last few pages i've built that there's a delay between them being indexed and them actually ranking. Anyone else finding that? And why is it like that? Not much of an issue as they tend to pop up after a week or so, but I am curious. Isaac.
Algorithm Updates | | isaac6630 -
New site or subdomain
what are pros and cons of launching a new product site as opposed to placing it under a subdomain of the company site? will the new site be placed in the google sandbox? the main goal is to provide credibility for the product, and by placing it under the company site that has been live for over 10 years. It is not a consumer product - more dealers. So people would be pushed to the site or find it through the brochure.
Algorithm Updates | | bakergraphix_yahoo.com0 -
Site name appended to page title in google search
Hi there, I have a strange problem concerning how the search results for my site appears in Google. The site is Texaspoker.dk and for some strange reason that name is appended at the end of the page title when I search for it in Google. The site name is not added to the page titles on the site. If I search in Google.dk (the relevant search engine for the country I am targeting) for "Unibet Fast Poker" I get the following page title displayed in the search results: Unibet Fast Poker starter i dag - få €10 og prøv ... - Texaspoker.dk If you visit the actual page you can see that there is no site name added to the page title: http://www.texaspoker.dk/unibet-fast-poker It looks like it is only being appended to the pages that contains rich snippets markup and not he forum threads where the rich snippets for some reason doesn't work. If I do a search for "Afstemning: Foretrukne TOPS Events" the title appears as it should without the site name being added: Afstemning: Foretrukne TOPS Events Anybody have any experience regarding this or an idea to why this is happening? Maybe the rich snippets are automatically pulling the publisher name from my Google+ account... edited: It doesn't seem to have anything to do with rich snippets, if I search for "Billeder og stuff v.2" the site name is also appended and if I search for "bedste poker bonus" the site name is not.
Algorithm Updates | | MPO0 -
Google seems to have penalised one section of our site? Is that possible?
We have a page rank 5 website and we launched a new site 6 months ago in February. Initially we had horrible urls with a bunch of numbers and stuff and we since changed them to lovely human readable urls. This had an excellent effect across the site except on one section of the site: http://www.allaboutcareers.com/careers/graduate-employers Although Google has indexed these pages and several have a PR 2 they do not appear in Google when previously they were on page 1 when we had the old urls. We figured we just needed some time for Google to get used to it, but it hasn't done anything. It is also worth mentioning we changed the page titles from: FIRM NAME | DOMAIN NAME then... FIRM NAME | Graduate Scheme, Jobs, Internships & Apprenticeships | DOMAIN NAME then.. FIRM NAME | Graduate Scheme, Jobs, Internships & Apprenticeships Do you think these are being penalised? There are two types of page: Example A: http://www.allaboutcareers.com/careers/graduates/addleshaw-goddard.htm Example B: http://www.allaboutcareers.com/careers/graduates/accenture.htm
Algorithm Updates | | jack860 -
Google results on an Ipad 2
Has anyone else seen different google organic results for a site when viewing on an Ipad compared to computer browser ? I've just checked a site and were no1 on google when searched on the Ipad 2 but when searched on my Macbook we are page 2 ? Could this just be different data centers or do google serve up different results to the 2 devices ? Would be really interested to know if anyone else has seen this. JP
Algorithm Updates | | Prongo0