Blog posts not getting indexed and being outranked by scrapper sites.
-
Our Google traffic has dropped significantly over the last year and now we're struggling to even get our blog posts indexed. It's been extremely discouraging and we're trying to do what ever we can to fix it.
I've included a screenshot of our Google traffic as well as Pages Indexed according to WebmasterTools.
The Problem
- Our blog posts are frequently not getting indexed.
- Many times they are outranked by low authority scraper sites, our Twitter/FB account, etc.
- Sometimes our homepage will rank instead of the blog post.
- Sometimes we'll break a news story, get tons of quality backlinks, and still be nowhere in Google.
- Pretty much the only Google traffic we see is from existing posts.
- Still 3,200 pages indexed when we have only 1,600 posts. I guess this isn't really a problem... just waiting for the meta noindex to take effect.
More details
- We've seen no duplicate content or other warnings from WebmasterTools.
- We've been constantly acquiring quality backlinks from credible sites.
- We deleted the useless content and fixed the canonical issues that were a result of switching servers.
History
Our site is a news/entertainment blog. The traffic usually has spikes depending on what's going on in the news.
- Nov 1, 2011 - Site kept maxing out at 30k+ visits so we switched servers.
- Jan 30, 2012 - Hired a writer so we could focus on other aspects of the site.
- Apr 19, 2012 - Noticed our posts weren't getting indexed like they used to. Suspected our writer was spinning articles but couldn't find any evidence. 90% of our blog posts were nowhere to be found in Google. Scrapper sites would outrank us for our own stories... even our Twitter account was ranking ahead of us. IF our story would show up in Google it would usually be the home page instead of the blog post.
- Sep 2012 - Finally got more serious about addressing the problem. Noticed a couple potentially big problems and started making changes.
Canonical Issues
- non-www site didn't redirect to www. It showed 2 different link profiles according to OpenSiteExplorer and 0 backlinks according to Webmaster Tools.
- Wordpress shortlinks weren't redirecting to the actual permalink. For instance http://www.domain.com/?p=123 and http://www.domain.com/post-example were both getting indexed.
For every post there were 4 different versions that Google had to choose from.
http://domain.com/?p=123, http://www.domain.com/?p=123, http://domain.com/post-example, and http://www.domain.com/post-example
I figured the canonical issues must have happened when we switched servers which was the reason for the drop in WebmasterTools indexed pages and increase in Not Selected pages.
FIXED (Sep 15): One we fixed the canonical issue the Indexed Pages went back up however the Not Selected is still the same.
Duplicate Content
When we first created our site we wanted to have tons of images for each musician/athlete/actor/etc. so we uploaded about 5-10 for each person. We created a blog post for each image with no writing and the exact same post titles. As a result there were TONS of low-quality, similar posts, with virtually identical permalinks. e.g. http://www.domain.com/james-smith1, http://www.domain.com/james-smith2, http://www.domain.com/james-smith3, etc.
A crawl on Sep 26 showed over 550 duplicate content warnings.
FIXED (Oct 1): We deleted/301 redirected the useless pages (they weren't getting traffic anyways) and by the next crawl the number was almost to 0... which it's at now.
We also had TONS of tags (since there're constantly new names in the media) that were getting indexed so we had meta robots noindex them.
Questions:
- Why aren't a majority of our posts getting indexed?
- Were we penalized or just stuck because of a filter?
- How long should it take for meta robots to noindex the tags pages? (I did it on Sep 25 but they are still there)
- If a site is scraping our content (same title, image, excert) but linking to us, should we contact them and tell them to remove it?
- Is there anything else we need to do start getting our blog posts indexed like they used to?
- Should we try contacting Google to re-evaluate our site?
Sorry, that was a LOT of writing. If anyone wants the URL please let me know so I can PM it to you. Any help would be greatly appreciated!
-
Thanks for the quick response Dana!
They are sourcing us but at the moment they're constantly showing up instead of our site. There's no reason they should outrank us, but we figured they couldn't really be hurting us since they are linking to us.
It's weird that the site is able to scrape content and still get good indexes... maybe Google just hasn't picked up on it yet.
We are hoping that we don't have to worry about scrapers once our blog posts start getting indexed like they used to.
I think we'll send them a friendly email like you suggested.
-
I sympathize with your frustration. I know what it's like to come into a situation after years and years of technical missteps made by folks who never took SEO into consideration and then have to start cleaning up the mess.
You have a lot of hard technical problems and I'm not a developer so I will let those more technically gifted than myself address some of those.
There is one question I felt I could answer and that is #4: "If a site is scraping our content (same title, image, excert) but linking to us, should we contact them and tell them to remove it?"
You could go that way. But in the event that it is a decent site (despite the fact they scraped your content), first make sure you have a canonical tag properly implemented on your page, then, contact them and say "I see you found my content interesting enough to share on your site. Instead of me asking you to remove it, would you mind adding in an attribution line, giving credit to my site as being the source of the content and including a link back to me please? I think we can both agree this would be better than having me file a DMCA request with Google."
Just a thought
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Index Issue - Indexing pages that don't exhist
Hi All, I have noticed a weird issue when performing a search on Google to show me all the pages it is indexing of our site. site:www.one2create.co.uk It brings up most of our website pages but then is also brings up a few HTTPS urls (our site has not been converted to HTTPS yet) but also the URL path, Title, and Meta Description are from one of our clients websites (an Automotive Job site). When clicked they take you to a generic 404 server error page, not our branded 404 page. The site that it has taken the url, title and meta description from is on a different server completely so I don't see how it has even managed to get that information and linked it to our site? Has anyone seen anything like this before? And what is the best way to fix it? We have asked Google to re-index the site but still no luck.
Search Behavior | | Jvickery0 -
Wanting to see when a user exists form site and its jopurney while on site please
Hi there, Wondering if you can help please? I am needing to know bounce rates on a landing page and also wanting to know if they landed on that page and went from that page to another and another... how can i look at that on MOZ please? Thanks Cass
Search Behavior | | AITLtd0 -
When Googling site:mydomain.com what does listing order tell us?
To find all the pages on my site that are indexed by Google I can search using site:mydomain.com and it gives pages of results. But what does the order of results relate to? Is it page rank or strength? My list of pages doesn't appear to be in order of strength. And it's definitely not by age or alphabetical...
Search Behavior | | GregB1230 -
Massive Google Drop on two sites!
Hi All, We have experienced a massive Google drop on Two of our eCommerce websites in the past week. From Feb - July we were ranked 2 for various key words In Aug we dropped to 8 In September to 96 if not lower. We pay for monthly link building and add unique product info each week (but not much new content such as articles or blogs). My SEO guys has not been that helpful and just sent me a link to the Panda update blog, which doesn't mean a great deal to me. Obviously this has had a massive effect on business, and ideally i need to diagnose the problem and find a new startegy on moving forward, so most likely looking for a for a new SEO guy/company as well. I need someone is proactive, communicates well and make constant suggestions about moving forward not just link building. One thing I would like to add is these two site have a different homepage, category and range structure, but do share some if the 2000 product database. Can anyone help? Any advice on sorting this would be gratefully appreciated. Thanks M
Search Behavior | | etsgroup0 -
Internal Site Search Analysis
Hi Folks, I have about 6,000 internal site search phrases that I want to analyze. There are many variations and duplicates that have similar intent within the data, e.g. Employment, Employment Opportunities, Employment Application. Does anyone have any thoughts on how I can aggregate the data to get an idea of user intent. There's a lot of long-tail in there. The data does not come from Google's site search tool. I just have a spreadsheet of the terms and the number of times they were searched. Cheers!
Search Behavior | | BedeFahey0 -
How to test a site for usability with hundreds users and get genuine feedback?
My question is twofold; firstly is there a recommended company that I could use to get feedback on my site usability. Ideally I would like this company to arrange a focus group of a couple hundred people that fit into the right demographic and organise a report on the opinions of the group and the issues faced. Secondly an in-house understanding of usability Now I would like to think that when I review a site for usability I am not biased in any way. But I believe that if I look at the page that I have created or even collaborated on that I will always miss usability issues as I frankly think my/our creation is amazing. I'm interested to know how people rank and check their site usability and what factors rank highest in their assessments i.e. the tools you use and how you use them.
Search Behavior | | Stefan-Thorpe0 -
Google Analytics Benchmarking Newsletter: How does your site perform?
With Google recently releasing benchmarking data I am curious as to what you all see across the various types of website niches that you work with (eCommerce, news, blog, services, small business, etc). And how SEO'd websites compare with this "raw" data provided by google. We have one medium size (12,000 products) strictly eCommerce website that has a bounce rate of 37% and an avg time on site of 5:20 While two other medium size eCommerce/blog sites have a bounce rate of 57% and 59% with average time on site of 2:37 and 2:30 respectively. Finally, I manage a website for a local small business that provides business and home cleaning services. This site has a bounce rate of 45% and 1:40 average time on site. How do your sites perform in these areas? Is it typical to see this great of a disparity between strict eCommerce websites and those sites that are both informational and transactional in nature? What about other kinds of websites? Cheers!
Search Behavior | | prima-2535091