Google indexing wrong pages
-
We have a variety of issues at the moment, and need some advice.
First off, we have a HUGE indexing issue across our entire website.
Website in question: http://www.localsearch.com.au/
Firstly
In Google.com.au, if you search for 'plumbers gosford' (https://www.google.com.au/#q=plumbers+gosford), the wrong page appears - in this instance, the page ranking should be http://www.localsearch.com.au/Gosford,NSW/PlumbersI can see this across the board, across multiple locations.
Secondly
Recently I've seen Google reporting in 'Crawl Errors' in webmaster tools URLs such as:
http://www.localsearch.com.au/Saunders-Beach,QLD/Electronic-Equipment-Sales-Repairs&Sa=U&Ei=xs-XVJzAA9T_YQSMgIHQCw&Ved=0CIMBEBYwEg&Usg=AFQjCNHXPrZZg0JU3O4yTGjWbijon1Q8OAThis is an invalid URL, and more specifically, those query strings seem to be referrer queries from Google themselves: &Sa=U&Ei=xs-XVJzAA9T_YQSMgIHQCw&Ved=0CIMBEBYwEg&Usg=AFQjCNHXPrZZg0JU3O4yTGjWbijon1Q8OA
Here's the above example indexed in Google: https://www.google.com.au/#q="AFQjCNHXPrZZg0JU3O4yTGjWbijon1Q8OA"
Does anyone have any advice on those 2 errors?
-
Issue 1:
I think your intended ranking page is not indexed.
https://www.google.com/?gws_rd=ssl#q=site:http:%2F%2Fwww.localsearch.com.au
It's probably because, as Donna indicated, you have so many pages. This happens when you have what are essentially search pages that are indexed. Stuff happens like having a page for plumbing and plumbers in the same city, for example.
In the short term, you can make sure that non-indexed pages are linked to across the site. Long-term you're going to want to think of a way to organize your site to make sure Google and users can find the most important pages. For example, add breadcrumbs back to the city page, and have the city page linking to your most important types of pages (even if they're still searches) for the city. Right now your city pages are just more search pages, which is a big wasted opportunity to layout which pages you most want people to find. Also make sure you figure out what's going on between these two "types" of the exact same page. There should only be one for the same results where possible:
http://www.localsearch.com.au/Gosford,NSW
http://www.localsearch.com.au/Search?where=Gosford,NSW
Issue 2:
Look at the "linked from" and figure out where these bad pages are linked to on the site. Google wouldn't make up a URL if someone wasn't linking to them, and my guess is your site is causing them. With a highly-dynamic site like yours it's usually either a crawl trap or a combination of dynamic URLs through a particular path that the server wasn't expecting.
Alternatively, and maybe more likely, Google has been trying to parse Javascript lately, and doing a rather poor job of it. I've seen Google try to find links in Javascript that were never intended to be links. You can either ignore these errors and wait for Google to get better, or you can dig into the JS with a dev and see what's causing Google to interpret something as a link. There's usually another way to put the code together where Google understands.
-
Issue #1:
I think what you're doing is fine with canonicals. The problem (I think) might be all the duplicates. The page you're asking about (http://www.localsearch.com.au/Gosford,NSW/Plumbers) isn't indexed, yet ~5 million others are. Google is probably abandoning the site before all the relevant pages get indexed. You should look into removing duplicates like in the following examples:
-
http://www.localsearch.com.au/Australia
http://www.localsearch.com.au/Australia/ -
http://www.localsearch.com.au/Atherton,QLD
http://www.localsearch.com.au/Atherton,QLD/ -
http://www.localsearch.com.au/Albion-Park,NSW/Body-Ear-Piercing
http://www.localsearch.com.au/Albion-Park-Rail,NSW/Body-Ear-Piercing -
http://www.localsearch.com.au/Airlie-Beach,QLD/Breeze-Bar/profile/tSdO
http://www.localsearch.com.au/Airlie-Beach,QLD/Breeze-Bar/profile/tSdO.vcf
Issue #2:
Sounds like issue #1 and 2 are closely related. I think you're on the right path though. If it doesn't fix it, come back and ask again. You'll have eliminated some possibilities and can get a different perspective 2nd time round.
Good luck!
-
-
Issue #1
I'm not sure how else we would use them. The example given above (Gosford, NSW) is about 40KM (or around 20miles) from the page that is ranking (Wyong, NSW). In our business model, these are 2 separate markets. We wouldn't be able to canonical 1 to the other as they are completely separate.Issue #2
I believe the issue could be because we're displaying "search results" as static pages - this is something that I have my team working towards fixing by having "static" proximity based business listing pages (such as root.com/find/plumbers/state/city/suburb/) and having no-indexed search result pages (such as root.com/search?what=plumbers&where=suburb,state).The above may even fix issue #1, but I wanted to get some more information from a community as 2 minds are better than 1..
-
Issue #1
Neither of the results that Google has indexed when executing the site operator are duplicated pages - we also have canonical URLs setup on all pages to avoid duplicated URLs.You might not be using canonical tags to your advantage though. From what I can see, the canonical tags on pages just point to themselves as opposed to one master page that should be the catch-all for incoming links and social mentions.
With regards to the Title tags; unless there's a crowd of people agreeing with this, nearly everything I have found to try to prove this has fallen through - it seems having slightly similar title tags with brand name / locales included doesn't affect search results.
Some of the title tags you are using on pages are identical to one another, not "slightly similar". That's why I raised it.
Issue #2
_I don't believe this is the issue either as the actual pages still exist. _
Hm. I see. Those pages appear to be dynamically created, indexed, and canonicalized to themselves. Can you tag them as no-index?
-
Hi Donna, thanks for your reply.
Issue #1
Neither of the results that Google has indexed when executing the site operator are duplicated pages - we also have canonical URLs setup on all pages to avoid duplicated URLs.With regards to the Title tags; unless there's a crowd of people agreeing with this, nearly everything I have found to try to prove this has fallen through - it seems having slightly similar title tags with brand name / locales included doesn't affect search results.
Issue #2
I don't believe this is the issue either as the actual pages still exist.Thanks for your help though! Anything else you come up with, I'm open ears.
-
Issue #1:
You're right, you do seem to have a "variety of issues at the moment". The thing that stands out the most to me is duplicate content.
When I did a site search (site:http://www.localsearch.com.au/", Google indicates it has more than 5 million pages indexed on the site. When I did a site search for the specific URL in your example (site:http://www.localsearch.com.au/gosford,NSW/Plumbers), it found 2 results, neither of which the page in question. Yet your keywords were replicated in the page URLs, content, meta tags, and internal links. Google is probably having a heck of time figuring out which page to rank for what.
It also looks like you have your entire site replicated because URLs are indexed with and without a trailing "/".
Many of the title tags for Gosford pages are replicated containing "Gosford, NSW - LocalSearch" for example, www.localsearch.com.au/Gosford,NSW/Carriers-Light-Transport, www.localsearch.com.au/Gosford.../Radio-Communication-Equipment, www.localsearch.com.au/Gosford,NSW/Hair-Treatment-Replacement, www.localsearch.com.au/Gosford,NSW/Hobbies-Models-Accessories, www.localsearch.com.au/Gosford,NSW/Stone-Masons-Monumental, and so on. Can you see why Google might be confused.
That's probably the first thing you need to fix, duplicate content.
Issue #2:
This is a guess. These might be errors caused by pages that have been renamed or removed from the site and not properly redirected. Google can't find them. I'll be interested to hear if anyone else has any ideas.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why is page still indexing?
Hi all, I have a few pages that - despite having a robots meta tag and no follow, no index, they are showing up in Google SERPs. In troubleshooting this with my team, it was brought up that another page could be linking to these pages and causing this. Is that plausible? How could I confirm that? Thanks,
Intermediate & Advanced SEO | | SSFCU
Sarah0 -
HTTPS pages - To meta no-index or not to meta no-index?
I am working on a client's site at the moment and I noticed that both HTTP and HTTPS versions of certain pages are indexed by Google and both show in the SERPS when you search for the content of these pages. I just wanted to get various opinions on whether HTTPS pages should have a meta no-index tag through an htaccess rule or whether they should be left as is.
Intermediate & Advanced SEO | | Jamie.Stevens0 -
Better for SEO to No-Index Pages with High Bounce Rates
Greeting MOZ Community: I operate www.nyc-officespace-leader.com, a New York City commercial real estate web site established in 2006. An SEO effort has been ongoing since September 2013 and traffic has dropped about 30% in the last month. The site has about 650 pages. 350 are listing pages, 150 are building pages. The listing and building pages have an average bounce rate of about 75%. The other 150 pages have a bounce rate of about 35%. The building and listing pages are dragging down click through rates for the entire site. My SEO firm believe there might be a benefit to "no-index, follow" these high bounce rate URLs. From an SEO perspective, would it be worthwhile to "no-index-follow" most of the building and listing pages in order to reduce the bounce rate? Would Google view the site as a higher quality site if I had these pages de-indexed and the average bounce rate for the site dropped significantly. If I no-indexed these pages would Google provide bette ranking to the pages that already perform well? As a real estate broker, I will constantly be adding many property listings that do not have much content so it seems that a "no-index, follow" would be good for the listings unless Google penalizes sites that have too many "no-index, follow" pages. Any thoughts??? Thanks,
Intermediate & Advanced SEO | | Kingalan1
Alan0 -
Why are some pages indexed but not cached by Google?
The question is simple but I don't understand the answer. I found a webpage that was linking to my personal site. The page was indexed in Google. However, there was no cache option and I received a 404 from Google when I tried using cache:www.thewebpage.com/link/. What exactly does this mean? Also, does it have any negative implication on the SEO value of the link that points to my personal website?
Intermediate & Advanced SEO | | mRELEVANCE0 -
New Web Page Not Indexed
Quick question with probably a straightforward answer... We created a new page on our site 4 days ago, it was in fact a mini-site page though I don't think that makes a difference... To date, the page is not indexed and when I use 'Fetch as Google' in WT I get a 'Not Found' fetch status... I have also used the'Submit URL' in WT which seemed to work ok... We have even resorted to 'pinging' using Pinglar and Ping-O-Matic though we have done this cautiously! I know social media is probably the answer but we have been trying to hold back on that tactic as the page relates to a product that hasn't quite launched yet and we do not want to cause any issues with the vendor! That said, I think we might have to look at sharing the page socially unless anyone has any other ideas? Many thanks Andy
Intermediate & Advanced SEO | | TomKing0 -
Indexing specified entry pages
Hi,We are currently working on location based info.Basically, when someone searches from Florida they will get specific Florida results and when they search from California they will specific California results.How does this location based info affect crawling and indexing?Lets say we have location info for googlebot, sometimes they crawl from a New York ip address, sometimes they do it from Texas and sometimes from California. In this case google will index 3 different pages with 3 different prices and a bit different text, and I'm afraid they might see these as some kind of cloaking or suspicious movement because we serve different versions of the page. What's the best way to handle this?
Intermediate & Advanced SEO | | SEODinosaur0 -
To index or not to index search pages - (Panda related)
Hi Mozzers I have a WordPress site with Relevanssi the search engine plugin, free version. Questions: Should I let Google index my site's SERPS? I am scared the page quality is to thin, and then Panda bear will get angry. This plugin (or my previous search engine plugin) created many of these "no-results" uris: /?s=no-results%3Ano-results%3Ano-results%3Ano-results%3Ano-results%3Ano-results%3Ano-results%3Akids+wall&cat=no-results&pg=6 I have added a robots.txt rule to disallow these pages and did a GWT URL removal request. But links to these pages are still being displayed in Google's SERPS under "repeat the search with the omitted results included" results. So will this affect me negatively or are these results harmless? What exactly is an omitted result? As I understand it is that Google found a link to a page they but can't display it because I block GoogleBot. Thanx in advance guys.
Intermediate & Advanced SEO | | ClassifiedsKing0 -
How do I index these parameter generated pages?
Hey guys, I've got an issue with a site I'm working on. A big chunk of the content (roughly 500 pages) is delivered using parameters on a dynamically generated page. For example: www.domain.com/specs/product?=example - where "example' is the product name Currently there is no way to get to these pages unless you enter the product name into the search box and access it from there. Correct me if I'm wrong, but unless we find some other way to link to these pages they're basically invisible to search engines, right? What I'm struggling with is a method to get them indexed without doing something like creating a directory map type page of all of the links on it, which I guess wouldn't be a terrible idea as long as it was done well. I've not encountered a situation like this before. Does anyone have any recommendations?
Intermediate & Advanced SEO | | CodyWheeler0