Google indexing wrong pages
-
We have a variety of issues at the moment, and need some advice.
First off, we have a HUGE indexing issue across our entire website.
Website in question: http://www.localsearch.com.au/
Firstly
In Google.com.au, if you search for 'plumbers gosford' (https://www.google.com.au/#q=plumbers+gosford), the wrong page appears - in this instance, the page ranking should be http://www.localsearch.com.au/Gosford,NSW/PlumbersI can see this across the board, across multiple locations.
Secondly
Recently I've seen Google reporting in 'Crawl Errors' in webmaster tools URLs such as:
http://www.localsearch.com.au/Saunders-Beach,QLD/Electronic-Equipment-Sales-Repairs&Sa=U&Ei=xs-XVJzAA9T_YQSMgIHQCw&Ved=0CIMBEBYwEg&Usg=AFQjCNHXPrZZg0JU3O4yTGjWbijon1Q8OAThis is an invalid URL, and more specifically, those query strings seem to be referrer queries from Google themselves: &Sa=U&Ei=xs-XVJzAA9T_YQSMgIHQCw&Ved=0CIMBEBYwEg&Usg=AFQjCNHXPrZZg0JU3O4yTGjWbijon1Q8OA
Here's the above example indexed in Google: https://www.google.com.au/#q="AFQjCNHXPrZZg0JU3O4yTGjWbijon1Q8OA"
Does anyone have any advice on those 2 errors?
-
Issue 1:
I think your intended ranking page is not indexed.
https://www.google.com/?gws_rd=ssl#q=site:http:%2F%2Fwww.localsearch.com.au
It's probably because, as Donna indicated, you have so many pages. This happens when you have what are essentially search pages that are indexed. Stuff happens like having a page for plumbing and plumbers in the same city, for example.
In the short term, you can make sure that non-indexed pages are linked to across the site. Long-term you're going to want to think of a way to organize your site to make sure Google and users can find the most important pages. For example, add breadcrumbs back to the city page, and have the city page linking to your most important types of pages (even if they're still searches) for the city. Right now your city pages are just more search pages, which is a big wasted opportunity to layout which pages you most want people to find. Also make sure you figure out what's going on between these two "types" of the exact same page. There should only be one for the same results where possible:
http://www.localsearch.com.au/Gosford,NSW
http://www.localsearch.com.au/Search?where=Gosford,NSW
Issue 2:
Look at the "linked from" and figure out where these bad pages are linked to on the site. Google wouldn't make up a URL if someone wasn't linking to them, and my guess is your site is causing them. With a highly-dynamic site like yours it's usually either a crawl trap or a combination of dynamic URLs through a particular path that the server wasn't expecting.
Alternatively, and maybe more likely, Google has been trying to parse Javascript lately, and doing a rather poor job of it. I've seen Google try to find links in Javascript that were never intended to be links. You can either ignore these errors and wait for Google to get better, or you can dig into the JS with a dev and see what's causing Google to interpret something as a link. There's usually another way to put the code together where Google understands.
-
Issue #1:
I think what you're doing is fine with canonicals. The problem (I think) might be all the duplicates. The page you're asking about (http://www.localsearch.com.au/Gosford,NSW/Plumbers) isn't indexed, yet ~5 million others are. Google is probably abandoning the site before all the relevant pages get indexed. You should look into removing duplicates like in the following examples:
-
http://www.localsearch.com.au/Australia
http://www.localsearch.com.au/Australia/ -
http://www.localsearch.com.au/Atherton,QLD
http://www.localsearch.com.au/Atherton,QLD/ -
http://www.localsearch.com.au/Albion-Park,NSW/Body-Ear-Piercing
http://www.localsearch.com.au/Albion-Park-Rail,NSW/Body-Ear-Piercing -
http://www.localsearch.com.au/Airlie-Beach,QLD/Breeze-Bar/profile/tSdO
http://www.localsearch.com.au/Airlie-Beach,QLD/Breeze-Bar/profile/tSdO.vcf
Issue #2:
Sounds like issue #1 and 2 are closely related. I think you're on the right path though. If it doesn't fix it, come back and ask again. You'll have eliminated some possibilities and can get a different perspective 2nd time round.
Good luck!
-
-
Issue #1
I'm not sure how else we would use them. The example given above (Gosford, NSW) is about 40KM (or around 20miles) from the page that is ranking (Wyong, NSW). In our business model, these are 2 separate markets. We wouldn't be able to canonical 1 to the other as they are completely separate.Issue #2
I believe the issue could be because we're displaying "search results" as static pages - this is something that I have my team working towards fixing by having "static" proximity based business listing pages (such as root.com/find/plumbers/state/city/suburb/) and having no-indexed search result pages (such as root.com/search?what=plumbers&where=suburb,state).The above may even fix issue #1, but I wanted to get some more information from a community as 2 minds are better than 1..
-
Issue #1
Neither of the results that Google has indexed when executing the site operator are duplicated pages - we also have canonical URLs setup on all pages to avoid duplicated URLs.You might not be using canonical tags to your advantage though. From what I can see, the canonical tags on pages just point to themselves as opposed to one master page that should be the catch-all for incoming links and social mentions.
With regards to the Title tags; unless there's a crowd of people agreeing with this, nearly everything I have found to try to prove this has fallen through - it seems having slightly similar title tags with brand name / locales included doesn't affect search results.
Some of the title tags you are using on pages are identical to one another, not "slightly similar". That's why I raised it.
Issue #2
_I don't believe this is the issue either as the actual pages still exist. _
Hm. I see. Those pages appear to be dynamically created, indexed, and canonicalized to themselves. Can you tag them as no-index?
-
Hi Donna, thanks for your reply.
Issue #1
Neither of the results that Google has indexed when executing the site operator are duplicated pages - we also have canonical URLs setup on all pages to avoid duplicated URLs.With regards to the Title tags; unless there's a crowd of people agreeing with this, nearly everything I have found to try to prove this has fallen through - it seems having slightly similar title tags with brand name / locales included doesn't affect search results.
Issue #2
I don't believe this is the issue either as the actual pages still exist.Thanks for your help though! Anything else you come up with, I'm open ears.
-
Issue #1:
You're right, you do seem to have a "variety of issues at the moment". The thing that stands out the most to me is duplicate content.
When I did a site search (site:http://www.localsearch.com.au/", Google indicates it has more than 5 million pages indexed on the site. When I did a site search for the specific URL in your example (site:http://www.localsearch.com.au/gosford,NSW/Plumbers), it found 2 results, neither of which the page in question. Yet your keywords were replicated in the page URLs, content, meta tags, and internal links. Google is probably having a heck of time figuring out which page to rank for what.
It also looks like you have your entire site replicated because URLs are indexed with and without a trailing "/".
Many of the title tags for Gosford pages are replicated containing "Gosford, NSW - LocalSearch" for example, www.localsearch.com.au/Gosford,NSW/Carriers-Light-Transport, www.localsearch.com.au/Gosford.../Radio-Communication-Equipment, www.localsearch.com.au/Gosford,NSW/Hair-Treatment-Replacement, www.localsearch.com.au/Gosford,NSW/Hobbies-Models-Accessories, www.localsearch.com.au/Gosford,NSW/Stone-Masons-Monumental, and so on. Can you see why Google might be confused.
That's probably the first thing you need to fix, duplicate content.
Issue #2:
This is a guess. These might be errors caused by pages that have been renamed or removed from the site and not properly redirected. Google can't find them. I'll be interested to hear if anyone else has any ideas.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
HTTP Pages Indexed as HTTPS
My site used to be entirely HTTPS. I switched months ago so that all links in the pages that the public has access to are now http only. But I see now that when I do a site:www.qjamba.com, the results include many pages with https in the beginning (including the home page!), which is not what I want. I can redirect to http but that doesn't remove https from the indexing, right? How do I solve this problem? sample of results: Qjamba: Free Local and Online Coupons, coupon codes ... **<cite class="_Rm">https://www.qjamba.com/</cite>**One and Done savings. Printable coupons and coupon codes for thousands of local and online merchants. No signups, just click and save. Chicnova online coupons and shopping - Qjamba **<cite class="_Rm">https://www.qjamba.com/online-savings/Chicnova</cite>**Online Coupons and Shopping Savings for Chicnova. Coupon codes for online discounts on Apparel & Accessories products. Singlehop online coupons and shopping - Qjamba <cite class="_Rm">https://www.qjamba.com/online-savings/singlehop</cite>Online Coupons and Shopping Savings for Singlehop. Coupon codes for online discounts on Business & Industrial, Service products. Automotix online coupons and shopping - Qjamba <cite class="_Rm">https://www.qjamba.com/online-savings/automotix</cite>Online Coupons and Shopping Savings for Automotix. Coupon codes for online discounts on Vehicles & Parts products. Online Hockey Savings: Free Local Fast | Qjamba **<cite class="_Rm">www.qjamba.com/online-shopping/hockey</cite>**Find big online savings at popular and specialty stores on Hockey, and more. Hitcase online coupons and shopping - Qjamba **<cite class="_Rm">www.qjamba.com/online-savings/hitcase</cite>**Online Coupons and Shopping Savings for Hitcase. Coupon codes for online discounts on Electronics, Cameras & Optics products. Avanquest online coupons and shopping - Qjamba <cite class="_Rm">https://www.qjamba.com/online-savings/avanquest</cite>Online Coupons and Shopping Savings for Avanquest. Coupon codes for online discounts on Software products.
Intermediate & Advanced SEO | | friendoffood0 -
JavaScript Issue? Google not indexing a microsite
We have a microsite that was created on our domain but is not linked to from ANYwhere EXCEPT within some Javascript elements on pages on our site. The link is in one JQuery slide panel. The microsite is not being indexed at all - when i do site:(microsite name) on Google, it doesn't return anything. I think it's because the link's only in a Java element, but my client assures me that if I submit to Google for crawling the problem will be solved. Maybe so, but my point is that if you just create a simple HTML link from at least one of our site pages, it will get indexed no problem. The microsite has been up for months and it's still not being indexed - another newer microsite that's been up for a few weeks and has simple links to it from our pages is indexing fine. I have submitted the URL for crawling but had to use the google.com/webmasters/tools/submit-url/ method as I don't have access to the top level domain WMT account. p.s. when we put the microsite URL into the SEOBook spider-test tool it returns lots of lovely information - but that just tells me the page is findable, does exist, right? That doesn't mean Google's going to necessarily index it, as I am surmising...Moz hasn't found in the 5 months the microsite has been up and running. What's going on here?
Intermediate & Advanced SEO | | Jen_Floyd0 -
How do I know what pages of my site is not inedexed by google ?
Hi I my Google webmaster tools under Crawl->sitemaps it shows 1117 pages submitted but 619 has been indexed. Is there any way I can fined which pages are not indexed and why? it has been like this for a while. I also have a manual action (partial) message. "Unnatural links to your site--impacts links" and under affects says "Some incoming links" is that the reason Google does not index some of my pages? Thank you Sina
Intermediate & Advanced SEO | | SinaKashani0 -
How can I see all the pages google has indexed for my site?
Hi mozers, In WMT google says total indexed pages = 5080. If I do a site:domain.com commard it says 6080 results. But I've only got 2000 pages in my site that should be indexed. So I would like to see all the pages they have indexed so I can consider noindexing them or 404ing them. Many thanks, Julian.
Intermediate & Advanced SEO | | julianhearn0 -
Adding Orphaned Pages to the Google Index
Hey folks, How do you think Google will treat adding 300K orphaned pages to a 4.5 million page site. The URLs would resolve but there would be no on site navigation to those pages, Google would only know about them through sitemap.xmls. These pages are super low competition. The plot thickens, what we are really after is to get 150k real pages back on the site, these pages do have crawlable paths on the site but in order to do that (for technical reasons) we need to push these other 300k orphaned pages live (it's an all or nothing deal) a) Do you think Google will have a problem with this or just decide to not index some or most these pages since they are orphaned. b) If these pages will just fall out of the index or not get included, and have no chance of ever accumulating PR anyway since they are not linked to, would it make sense to just noindex them? c) Should we not submit sitemap.xml files at all, and take our 150k and just ignore these 300k and hope Google ignores them as well since they are orhpaned? d) If Google is OK with this maybe we should submit the sitemap.xmls and keep an eye on the pages, maybe they will rank and bring us a bit of traffic, but we don't want to do that if it could be an issue with Google. Thanks for your opinions and if you have any hard evidence either way especially thanks for that info. 😉
Intermediate & Advanced SEO | | irvingw0 -
How can we get a site reconsidered for Google indexing?
We recently completed a re-design for a site and are having trouble getting it indexed. This site may have been penalized previously. They were having issues getting it ranked and the design was horrible. Any advise on how to get the new site reconsidered to get the rank where it should be? (Yes, Webmaster Tools is all set up with the sitemap linked) Many thanks for any help with this one!
Intermediate & Advanced SEO | | d25kart0 -
404'd pages still in index
I recently launched a site and shortly after performed a URL rewrite (not the greatest idea, i know). The developer 404'd the old pages instead of a permanent 301 redirect. This caused a mess in the index. I have tried to use Google's removal tool to remove these URL's from the index. These pages were being removed but now I am finding them in the index as just URL's to the 404'd page (i.e. no title tag or meta description). Should I wait this out or now go back and 301 redirect the old URL's (that are 404'd now) to the new URL's? I am sure this is the reason for my lack of ranking as the rest of my site is pretty well optimized and I have some quality links.
Intermediate & Advanced SEO | | mj7750 -
Pages un-indexed in my site
My current website www.energyacuity.com has had most pages indexed for more than a year. However, I tried cache a few of the pages, and it looks the only one that is now indexed by Goggle is the homepage. Any thoughts on why this is happening?
Intermediate & Advanced SEO | | abernatj0