Google Index Issue - Indexing pages that don't exhist
-
Hi All,
I have noticed a weird issue when performing a search on Google to show me all the pages it is indexing of our site.
site:www.one2create.co.uk
It brings up most of our website pages but then is also brings up a few HTTPS urls (our site has not been converted to HTTPS yet) but also the URL path, Title, and Meta Description are from one of our clients websites (an Automotive Job site). When clicked they take you to a generic 404 server error page, not our branded 404 page.
The site that it has taken the url, title and meta description from is on a different server completely so I don't see how it has even managed to get that information and linked it to our site?
Has anyone seen anything like this before? And what is the best way to fix it? We have asked Google to re-index the site but still no luck.
-
Although being hacked seems like the only reason I can think of at the moment, I am still scratching my head! Its only showing the HTTPS urls in the Google search results. Not in Bing or Yahoo etc.
I can't see any of the HTTPS urls when I perform a crawl of the site in programs such as Screaming Frog. I could understand if we were hacked that it would show spammy results with dodgy links but the results aren't spammy, the Titles and Descriptions are from one of our customers websites? Their site has not been hacked though.
-
This would make the most sense. We we're hacked a few weeks ago. We had cleaned up the website completely the day we found the issue (uploaded a completely clean version of the whole site to the server). We then installed Wordfence (similar to Sucuri) to protect us from similar things in the future.
I am unsure if we have fixed the issue and it will just take time to correct itself or if the issue is still there?
I will do the redirects though as hadn't done that yet
Thanks for the info!
-
Hello, I'm sorry to be the bearer of bad news, but it looks like your website has been hacked or compromised. I've done some Google searches and one of the first items found was one of your portfolio items which has a redirect to a deceptive site. The portfolio item in question is titled "Callidus Consulting."
The hacker has likely redirected or altered some of your content without your knowledge and it has caused your listing titles and descriptions to be thrown off.
I recommend you do a full security audit and clean up. To start, install the Sucuri plugin to scan, lock down and harden your site against future hacking attempts.
Also, you should consider redirecting https to http because these URLs are indexed. Setup temporary redirects to your homepage for the broken URLs that are indexed so that you don't lose any traffic. Best of luck.
-
Thanks for the response. Those are not the URLs that I am referring to though. Those URLs in your crawl are from a Instagram Plugin which is pulling in Images from an outsourced URL. Those URLS are working fine, not returning a 404 error and not listed on our One2Create.co.uk domain name.
I have attached a screenshot of what I am seeing in Google which shouldn't be...
The listed Page in Google doesn't exist. We are using a Wordpress CMS system. The page is not in the backend, in the sitemap.xml file etc.
-
Those seem to be Google+ and Instagram CDN sources that are https originally from what I can see doing a quick crawl of your site.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Long list or paginated pages
Hi peeps, I am just interested in this from a usability POV and to see what you would prefer to see when you are met with a page that has multiple options. Lets say that the page looks like a list of services, each clearly marked out in its own segment, but there are 50-60 options that match your requirements. Do you like to keep scrolling, or would you prefer to take what is there and then move on if you feel you want to dig deeper? Would you like to see a long list, of have the options loaded in as you get to them? -Andy
Search Behavior | | Andy.Drinkwater2 -
Is it better to find a page without the desired content, or not find the page?
Are there any studies that show which is best? If you find my page but not the specific thing you want on it, you may still find something of value. But, if you don't you may associate my site with poor results, which can be worse than finding what you want at a competitor site. IOW maybe it is best to have pages that ONLY and ALWAYS have the content desired. What do the studies suggest? I'm asking because I have content that maybe 1/3 of the time exists and 2/3 of the time doesn't...think 'out of stock' products. So, I'm wondering if I should look into removing the page from being indexed during the 2/3 or should keep it. If I remove it then my concern is whether I lose the history/age factor that I've read Google finds important for credibility. Your thoughts?
Search Behavior | | friendoffood0 -
& And + symbols - How does Google read these?
How does/can Google read the plus and ampersand symbols? For instance if I optimise a webpage for 'black & white football' and another page for 'black + white football' would there be any difference in ranking position? Does Google take notice of such symbols or will it only pick up on the keywords 'black white football'?
Search Behavior | | Adam_SEO_Learning0 -
Personalised Geo-targeted results - How does Google pass link juice?
Hello, Many websites now serve specific home page offers based on the location of the customer, my question is, how does link juice flow around a site when the links (this case from the homepage) are served up based on a visitors location? Internal links from your homepage are valuable for ranking that product well in the SERPs so how does Google deal with this? So, for example, a car hire website based in the UK. If you arrive on the care hire website sat in Manchester (Northern UK city), on the homepage the website serves offers of car hire deals in Manchester, Leeds, London and international destinations. If you arrived on this website from London (Southern UK City), you would not see the Manchester link at all but London, and other cities in the South. In this case, when Google crawls the car hire website, it will see internal links but a)which version and b) is there any way of sharing this link value around? Basically, we want to understand if Manchester in this case will get the benefit of an internal homepage link from Google even though we only show Manchester to people FROM Manchester, OR, do Google only give juice based on one version of the website, a generic UK version? Or to put it another way, is there any way of cashing in on both geo-targetting the customer based on their location AND getting link juice from those geo-specific home page links? Perhaps there is some code or way of telling Google that people from Manchester (a certain % of our visitors) will see a homepage internal link for Manchester that will pass some small % link value?
Search Behavior | | xoffie0 -
Blog posts not getting indexed and being outranked by scrapper sites.
Our Google traffic has dropped significantly over the last year and now we're struggling to even get our blog posts indexed. It's been extremely discouraging and we're trying to do what ever we can to fix it. I've included a screenshot of our Google traffic as well as Pages Indexed according to WebmasterTools. http://i.imgur.com/Wu1D8.jpg The Problem Our blog posts are frequently not getting indexed. Many times they are outranked by low authority scraper sites, our Twitter/FB account, etc. Sometimes our homepage will rank instead of the blog post. Sometimes we'll break a news story, get tons of quality backlinks, and still be nowhere in Google. Pretty much the only Google traffic we see is from existing posts. Still 3,200 pages indexed when we have only 1,600 posts. I guess this isn't really a problem... just waiting for the meta noindex to take effect. More details We've seen no duplicate content or other warnings from WebmasterTools. We've been constantly acquiring quality backlinks from credible sites. We deleted the useless content and fixed the canonical issues that were a result of switching servers. History Our site is a news/entertainment blog. The traffic usually has spikes depending on what's going on in the news. Nov 1, 2011 - Site kept maxing out at 30k+ visits so we switched servers. Jan 30, 2012 - Hired a writer so we could focus on other aspects of the site. Apr 19, 2012 - Noticed our posts weren't getting indexed like they used to. Suspected our writer was spinning articles but couldn't find any evidence. 90% of our blog posts were nowhere to be found in Google. Scrapper sites would outrank us for our own stories... even our Twitter account was ranking ahead of us. IF our story would show up in Google it would usually be the home page instead of the blog post. Sep 2012 - Finally got more serious about addressing the problem. Noticed a couple potentially big problems and started making changes. Canonical Issues non-www site didn't redirect to www. It showed 2 different link profiles according to OpenSiteExplorer and 0 backlinks according to Webmaster Tools. Wordpress shortlinks weren't redirecting to the actual permalink. For instance http://www.domain.com/?p=123 and http://www.domain.com/post-example were both getting indexed. For every post there were 4 different versions that Google had to choose from. http://domain.com/?p=123, http://www.domain.com/?p=123, http://domain.com/post-example, and http://www.domain.com/post-example I figured the canonical issues must have happened when we switched servers which was the reason for the drop in WebmasterTools indexed pages and increase in Not Selected pages. FIXED (Sep 15): One we fixed the canonical issue the Indexed Pages went back up however the Not Selected is still the same. Duplicate Content When we first created our site we wanted to have tons of images for each musician/athlete/actor/etc. so we uploaded about 5-10 for each person. We created a blog post for each image with no writing and the exact same post titles. As a result there were TONS of low-quality, similar posts, with virtually identical permalinks. e.g. http://www.domain.com/james-smith1, http://www.domain.com/james-smith2, http://www.domain.com/james-smith3, etc. A crawl on Sep 26 showed over 550 duplicate content warnings. FIXED (Oct 1): We deleted/301 redirected the useless pages (they weren't getting traffic anyways) and by the next crawl the number was almost to 0... which it's at now. We also had TONS of tags (since there're constantly new names in the media) that were getting indexed so we had meta robots noindex them. Questions: Why aren't a majority of our posts getting indexed? Were we penalized or just stuck because of a filter? How long should it take for meta robots to noindex the tags pages? (I did it on Sep 25 but they are still there) If a site is scraping our content (same title, image, excert) but linking to us, should we contact them and tell them to remove it? Is there anything else we need to do start getting our blog posts indexed like they used to? Should we try contacting Google to re-evaluate our site? Sorry, that was a LOT of writing. If anyone wants the URL please let me know so I can PM it to you. Any help would be greatly appreciated! Wu1D8.jpg
Search Behavior | | gfreeman230 -
Forced Page Views and Search Engines?
I have a website that was built for the primary purpose of showing HTML 5 capabilities. With this, we have to create forced page views within analytics in order to receive any data about consumer behavior on the site. Are search engines viewing these forced page views as actual webpages? Does it even effect SEO efforts?
Search Behavior | | HughesDigital0 -
Google Query Contamination.... Are you seeing this??
I have been searching on Google.com with FireFox this morning. I searched for a state name.... like "georgia". Then I search for a product... like "guitars". For the Guitars query the first page of the SERPs include some music stores in georgia. So, Google is contaminating your search results with information from your most recent query. Are you seeing this too? Has Google been doing this for a long time? Or, I am going crazy?
Search Behavior | | EGOL3 -
Interesting keyword ranking issue
Hello Everybody, Thanks for taking the time to read this post. Without further ado, I'll jump straight to it: http://www.dataclinic.co.uk is the web site of a UK based data recovery company. Historically the site has always ranked well for popular data recovery keywords in the UK, with page 1 rankings for most things data recovery related. However, lately things seem to have changed for our most important phrase "data recovery". We noticed several months ago that Google had started to favour the page http://www.dataclinic.co.uk/data-recovery.htm instead of http://www.dataclinic.co.uk/ when a search for "data recovery" (and similar) was performed. This didn't concern us that much as our rankings remained good. However now, neither of these pages seems to be ranking well when a search for "data recovery" is performed (I gave up at Page 5 - who looks past there when searching?). I would appreciate your input on this please - especially about the following points: 1. Why have these two pages now seemingly disappeared from SERPS when a search for "data recovery" is performed ? 2. Why has Google chosen http://www.dataclinic.co.uk/data-recovery.htm rather than http://www.dataclinic.co.uk ? 3. Is this just something to do with UK results ? 4. Other sites I would expect NOT to see in the top results have started appearing - despite their link profiles etc remaining poor - perhaps Google is doing a bit of reorganisation with SERPS related to data recovery at the moment ? 5. And perhaps, most importantly - do you think we need to do anything about our current lack of visibility ?? As I mentioned, we've always ranked well, so these results are puzzling... Should search results revert "back to normal" in a day or so, or am I missing something and need to take action ?? Thanks for any input on this - we would be very grateful indeed for you help ! Kind Regards, Sue
Search Behavior | | 3Amigos0