A doorway-page vendor has made my SEO life a nightmare! Advice anyone!?
-
Hey Everyone,
So I am the SEO at a mid-sized nationwide retailer and have been working there for almost a year and half. This retailer is an SEO nightmare. Imagine the worst possible SEO nightmare, and that is my unfortunate yet challenging everyday reality.
In light of the new algorithm update that seems to be on the horizon from Google to further crack down on the usage of doorway pages, I am coming to the Moz community for some desperately needed help.
Before I was employed here, the eCommerce director and SEM Manager connected with a vendor that told them basically that they can do a PPC version of SEO for long-tail keywords. This vendor sold them on the idea that they will never compete with our own organic content and can bring in incremental traffic and revenue due to all of this wonderful technology they have that is essentially just a scraper.
So for the past three years, this vendor has been creating thousands of doorway pages that are hosted on their own server but our masked as our own pages. They do have a massive index / directory in HTML attached to our website and even upload their own XML site maps to our Google Web Master Tools. So even though they “own” the pages, they masquerade as our own organic pages.
So what we have today is thousands upon thousands of product and category pages that are essentially built dynamically and regurgitated through their scraper / platform, whatever.
ALL of these pages are incredibly thin in content and it’s beyond me how Panda has not exterminated them.
ALL of these pages are built entirely for search engines, to the point that you would feel like the year was 1998.
All of these pages are incredibly over- optimized with spam that really is equivalent to just stuffing in a ton of meta keywords. (like I said – 1998)
Almost ALL of these scraped doorway pages cause an incredible amount of duplicate content issues even though the “account rep” swears up and down to the SEM Manager (who oversees all paid programs) that they do not.
Many of the pages use other shady tactics such as meta refresh style bait and switching.
For example:
The page title in the SERP shows as: Personalized Watch Boxes
When you click the SERP and land on the doorway page the title changes to:
Personalized Wrist Watches. Not one actual watch box is listed.
They are ALL simply the most god awful pages in terms of UX that you will ever come across BUT because of the sheer volume of this pages spammed deep within the site, they create revenue just playing the odds game.
Executives LOVE revenue.
Also, one of this vendor’s tactics when our budget spend is reduced for this program is to randomly pull a certain amount of their pages and return numerous 404 server errors until spend bumps back up. This causes a massive nightmare for me.
I can go on and on but I think you get where I am going.
I have spent a year and half campaigning to get rid of this black-hat vendor and I am finally right on the brink of making it happen. The only problem is, it will be almost impossible to not drop in revenue for quite some time when these pages are pulled. Even though I have helped create several organic pages and product categories that will pick-up the slack when these are pulled, it will still be awhile before the dust settles and stabilizes.
I am going to stop here because I can write a novel and the millions of issues I have with this vendor and what they have done. I know this was a very long and open-ended essay of this problem I have presented to you guys in the Moz community and I apologize and would love to clarify anything I can.
My actual questions would be:
Has anyone gone through a similar situation as this or have experience dealing with a vendor that employs this type of black-hat tactic?
Is there any advice at all that you can offer me or experiences that you can share that can help be as armed as I can when I eventually convince the higher-ups they need to pull the plug?
How can I limit the bleeding and can I even remotely rely on Google LSI to serve my organic pages for the related terms of the pages that are now gone?
Thank you guys so much in advance,
-Ben
-
glad to help
-
You are a genius.
-
Glad i could be of some help,
If I were you I'd definitely grab copies of the pages if they're still live, you could do this from home even using some free tools like
http://phpcrawl.cuab.de/about.html
add a bit of Curl or WGET and you've got the pages plus the links and meta. Then if they do disappear suddenly and the business is stuck, you can hand this to your web people at oracle and they'll probably try and hire you, having said that, I'd imagine they've probably got a decent contingency plan because they're oracle, but you never know. Could save the day.
-
Thanks Jamie!
Yea we actually partner with Oracle for our web design, engineering , implementation and so on. So when it comes to server-side issues, we would have to go through them and there is always red tape involved.
Really I cannot understand how this vendor that does this is even in business and it is beyond me how they even get away with it. The wordpress 404 plug-in is a great idea though and that will definitely help me in the future with freelancing while I am here full-time.
-
We do self-canocialize and that is a very good question. What they will do is just keep spitting out dynamically generated URLS. They have absolutely no restrictions on page quality, content, they literally have no rules. This gives them immense flexibility.
And for the contract portion: One the contract ends, all of these pages will in-fact disappear and that is why they house them on their own servers. So that is what we want in the end.
It is dealing with the massive amount of 404s that will be an issue for awhile.
-
Thanks again!
Yes, that is the conundrum I am in here when it comes to "who actually owns the pages" and honestly, this vendor covered their bases. They actually house all of the pages on their own servers and basically scrape out site, then shoot them out through our CDN via a proxy or something like that. So they made sure we are at their mercy, they can pull them anytime they want.
So technically, If were were to redirect all of their pages and acquired links, it would actually not be too hard because each page is so unbelievably identical to our own organic pages. The problem is, we would have to access their server I believe and that will not happen.
It will also be one hell of a mess with 301s if we were to do that and I know someone I am planning with on our site team fears the length of the 301 chain this would cause in our htaccess file.
But we are thinking in the same ballpark as you mentioned - trying to find ways to somehow limit the 404 tsunami this would cause and see if we can "take back" some of the value they took from us in link juice.
-
Yes, redirections are 100% necessary. I agree whole heartedly.
-
Surely you can block them once the contract has been ended? I don't know how the law works where you are, but in the UK if you sever a contract you are no longer bound by it. But then again, I'm not a lawyer!!! LOL I'd be earning twice as much if I was!!! I'd look into this or get your legal team (assuming you have one) to look into it for after the contract has ended.
If they're scraping, could you put a canonical tag on your pages to self canonicalise? Only just thought of this!!! Might help, if you've not already done it.
-
Hey thanks so much for the response!
And there are no stupid questions!
Before I was hired here, the company was incredibly aggressive with PPC and CSE's and spent absorbent amounts on paid traffic.
The company literally drove 2x more traffic through paid than through organic. That has changed now even though we still spend pretty aggressively. We have an excellent SEM Digital Marketing Manager that handles all paid campaigns and affiliate programs and she is run ragged on a daily basis.
I really do think it would be worth taking a look at how we can compensate with PPC on the black-hat vendor's best performing URLs and thank you so much because it is an excellent idea.
To your robot blocking question:
I would love nothing more than to insert robot text that disallows Google Bot from crawling the tree sub folders that contain all of their doorway pages. Unfortunately, they entered into a legally binding contract and this would be like an act of war against them. I actually dream about doing this to them every night so that is an awesome point you bring up!
-
Thanks so much for the response.
Your advice on having a battle plan is perfect and is something that I have had to try, try try try and once I am done trying, I try again to find more creative ways to present SEO needs, site fixes and strategies.
I even went so far to show them what their page title look like in search when they are 90 characters long and compared them to that shady gas station on an isolated highway when we could be optimizing the titles, increase CTR and add some schema to product page SERPS to make them look like Sheetz!
Full PowerPoint pictures of gas stations!
The enigma of pushing SEO when nothing is "guaranteed" but the numbers they are seeing from this black-hat vendor are.
Yesterday, digging deeper and deeper using Screaming Frog, I dug into one of this vendor's sub folders that is a giant index (They have three of these sub folders they upload to our site)
I actually found that they are literally completely copying our product pages and making exact copies. They then insert basically meta spam links on the product pages that ensures that their copies will usually always out rank our original content that we have three writers working on.
Unbelievable I know. So with your awesome advice and internal reminder on how much more I need to think outside the box with presenting, I am going to make an entire roster of this plagiarized pages and show them that if all of these copied product pages were removed, our own organic product pages would show as they are meant to.
I cannot believe vendors still can get away with this. No one monitored them or had any idea what they were doing until I was hired it is just beyond belief.
Thank you so much for the advice and inspiration.
-
Nicely put, Amelia!
PPC would definitely be a great alternative to make up any losses from organic search. And, PPC Hero is indeed a great resource as is the AdWords help center.
From a technical standpoint, one would still want to have all of those crappy vendor pages re-directed somewhere, which would be a pain to manually do but a necessary pain. If not, they would be sending a huge amount of 404 errors and that's not going to be a good sign for Google. The pages are already indexed since they are getting traffic and you'd want to send that traffic, and any links associated with that page, somewhere - ideally, in your situation, a much better (relevant) page from a user and search engine perspective.
-
What a pig awful situation to be in. I feel for you.
The previous poster has some great suggestions which I would follow.
May I also suggest that you start a PPC campaign of your own to 'pick up the slack' as you put it? Assuming the budget previously allocated to this vendor would cover it? If the vendor was using PPC as the revenue driver to these horrible UX pages, imagine how much better the conversion would be from one of your 'good' pages?
If you've never used Adowrds before, then I would look at the adwords education center for a bit (sorry I can't remember what it's called). A good site I used to use when first starting out learning Adwords is PPC Hero - they had some good tips a few years ago, and I have no reason to believe they've gone downhill! I think (and hope I don't inadvertently offend anyone here, but it's my experience) that if you can do SEO then running PPC (though time consuming) should be easy enough for you to get your head around.
I don't know if this is a stupid suggestion or not as I'm not very technical (I rely on brilliant developers in my team) but could the vendor's dodgy pages be disallowed by your robots file? Could you also remove them from the index via webmaster tools (especially if the pages are just PPC landing pages and not built for organic search, which I understand is the case from your post)? Like I say, this may be a stupid suggestion... Please go easy on me if it is!!!
Good luck - and remember, 'what doesn't kill us, makes us stronger'. I bet you're a much better SEO now than you were a year and a half ago!
-
I feel like your best plan of attack will be two sided:
1.) Education - Which is a definite struggle, but helping your higher-ups really understand WHY these practices are an issue and how it could and eventually will impact their bottom line might resonate more than just saying there are issues present (which I am sure you have been doing anyway). Perhaps reiterating the amount of revenue that is a result of natural search and how much would be lost if the site were penalized would paint a more clear picture. Having data to support your arguments is always helpful. Maybe you can even do some research and present a few summarized case studies on other sites that have been penalized and how it impacted their natural search metrics.
2.) Plan - Have a plan of attack ready. Ok, so you get rid of these pages... Now what? Preparing a very clear, step-by-step plan on what changes need to be made, what these changes will accomplish and what issues they will address, how you will make them and how long it will take, and what the expected outcome will be will help them better understand the process and how it will help save and possibly even improve revenue.
Hope this is helpful - good luck!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Active, Old Large site with SEO issues... Fix or Rebuild?
Looking for opinions and guidance here. Would sincerely appreciate help. I started a site long, long ago (1996 to be exact) focused on travel in the US. The site did very well in the search results up until panda as I built it off templates using public databases to fill in the blanks where I didn't have curated content. The site currently indexes around 310,000 pages. I haven't been actively working on the site for years and while user content has kept things somewhat current, I am jumping back into this site as it provides income for my parents (who are retired). My questions is this. Will it be easier to track through all my issues and repair, or rebuild as a new site so I can insure everything is in order with today's SEO? and bonus points for this answer ... how do you handle 301 redirects for thousands of incoming links 😕 Some info to help: CURRENTLY DA is in the low 40s some pages still rank on first page of SERPs (long-tail mainly) urls are dynamic (I have built multiple versions through the years and the last major overhaul was prior to CMS popularity for this size of site) domain is short (4 letters) but not really what I want at this point Lots of original content, but oddly that content has been copied by other sites through the years WHAT I WANT TO DO get into a CMS so that anyone can add/curate content without needing tech knowledge change to a more relevant domain (I have a different vision) remove old, boilerplate content, but keep original
White Hat / Black Hat SEO | | Millibit1 -
Unique page URLs and SEO titles
www.heartwavemedia.com / Wordpress / All in One SEO pack I understand Google values unique titles and content but I'm unclear as to the difference between changing the page url slug and the seo title. For example: I have an about page with the url "www.heartwavemedia.com/about" and the SEO title San Francisco Video Production | Heartwave Media | About I've noticed some of my competitors using url structures more like "www.competitor.com/san-francisco-video-production-about" Would it be wise to follow their lead? Will my landing page rank higher if each subsequent page uses similar keyword packed, long tail url? Or is that considered black hat? If advisable, would a url structure that includes "san-francisco-video-production-_____" be seen as being to similar even if it varies by one word at the end? Furthermore, will I be penalized for using similar SEO descriptions ie. "San Francisco Video Production | Heartwave Media | Portfolio" and San Francisco Video Production | Heartwave Media | Contact" or is the difference of one word "portfolio" and "contact" sufficient to read as unique? Finally...am I making any sense? Any and all thoughts appreciated...
White Hat / Black Hat SEO | | keeot0 -
Subdomain and root domain effects on SEO
I have a domain lets say it's mydomain.com, which has my web app already hosted on this domain. I wanted to create a sub-product from my company, the concept is a bit different than my original web app that is on mydomain.com and I am planning to host this on mynewapp.mydomain.com. I am having doubts that using a sub-domain will have an impact on my existing or new web app. Can anyone give me any pointers on this? As much as I wanted to use a directory mydomain.com/mynewapp, this is not possible because it will just confuse existing users of the new product/web app. I've heard that subdomains are essentially treated as a new site, is this true? If it is then I am fine with this, but is it also true that subdomains are harder to reach the top rank rather than a root domain?
White Hat / Black Hat SEO | | herlamba0 -
Does the Traffic boost SEO/SERP ranks?
Hello, I know a guy that sells Organic traffic, bought 10k from him, will this help me to bost google seo ranks? Attached a screenshoot thank you!
White Hat / Black Hat SEO | | 7liberty0 -
Sponsoredreviews.com , anyone ever used it?
I came across this site http://www.sponsoredreviews.com/, thought its idea was a place were you can offer your product to be reviewed by bloggers, (fairly white hat I would have thought), I had a quick look and it seemed to me its for for selling back links on blogs, but before I dismissed it completely I just wanted to see if anyone else had any experience with it? Update: if this website is no good, are there any genuine places were you can offer you products for review?
White Hat / Black Hat SEO | | PaddyDisplays0 -
Dust.js Client-side JavaScript Templates & SEO
I work for a commerce company and our IT team is pushing to switch our JSP server-side templates over to client-side templates using a JavaScript library called Dust.js Dust.js is a JavaScript client-side templating solution that takes the presentation layer away from the data layer. The problem with front-end solutions like this is they are not SEO friendly because all the content is being served up with JavaScript. Dust.js has the ability to render your client-side content server-side if it detects Google bot or a browser with JavaScript turned off but I’m not sold on this as being “safe”. Read about Linkedin switching over to Dust.js http://engineering.linkedin.com/frontend/leaving-jsps-dust-moving-linkedin-dustjs-client-side-templates http://engineering.linkedin.com/frontend/client-side-templating-throwdown-mustache-handlebars-dustjs-and-more Explanation of this: “Dust.js server side support: if you have a client that can't execute JavaScript, such as a search engine crawler, a page must be rendered server side. Once written, the same dust.js template can be rendered not only in the browser, but also on the server using node.js or Rhino.” Basically what would be happening on the backend of our site, is we would be detecting the user-agent of all traffic and once we found a search bot, serve up our web pages server-side instead client-side to the bots so they can index our site. Server-side and client-side will be identical content and there will be NO black hat cloaking going on. The content will be identical. But, this technique is Cloaking right? From Wikipedia: “Cloaking is a SEO technique in which the content presented to the search engine spider is different from that presented to the user's browser. This is done by delivering content based on the IP addresses or the User-Agent HTTP header of the user requesting the page. When a user is identified as a search engine spider, a server-side script delivers a different version of the web page, one that contains content not present on the visible page, or that is present but not searchable.” Matt Cutts on Cloaking http://support.google.com/webmasters/bin/answer.py?hl=en&answer=66355 Like I said our content will be the same but if you read the very last sentence from Wikipdia it’s the “present but not searchable” that gets me. If our content is the same, are we cloaking? Should we be developing our site like this for ease of development and performance? Do you think client-side templates with server-side solutions are safe from getting us kicked out of search engines? Thank you in advance for ANY help with this!
White Hat / Black Hat SEO | | Bodybuilding.com0 -
Anybody have useful advice to fix a very bad link profile?
Hello fellow mozzers. I am interested in getting the communities opinion on how to fix an extremely bad link profile, or whether it would be easier to start over on a new domain. This is for an e-commerce site that sells wedding rings. Prior to coming to our agency, the client had been using a different service that was doing some serious black hat linkbuilding on a truly staggering scale. Of the roughly 53,000 links that show up in OSE, 16,500 of them have the anchor text "wedding rings", 1,300 "wedding ring sets", etc. For contrast, there are only two "visit website", and just one domain name anchor text. So it is about the farthest from natural you can get. Anyway, the site traffic was doing great until the end of February, when it took a massive hit and lost over half the day to day traffic volume, and steadily declined until April 24th (Penguin), when it took another huge hit and lost almost 70% of traffic from Google. Note that the traffic from Yahoo/Bing stayed the same. So the question is, is it worth trying to clean up this mess of a backlink profile or would it be smarter to start fresh with a new domain?
White Hat / Black Hat SEO | | CustomCreatives0 -
Need clarification on what is a landing page vs. doorway page
Hello everyone - I just became a PRO member today and wanted to say hello and ask this question... I am launching a new product, but 6 months before I created 4 different domains with landing pages to "prime" my SEO for the keywords I am trying to pursue. Now that I have launched my new product, it resides on the main domain name (let's call it "MainDomain.com"). Here's my dilemma... I want to create landing pages on each of the different domains for my PPC and optimized organic search traffic. For example, on one of the other domains (let's call it "LandingDomain1.com"), I have created a page to optimize for the keyword "event planning software" and sending my PPC traffic for "event planning software" there as well as my email campaigns. This page has original content that I have written for it (it's not duplicate content used elsewhere), but it also has navigation and links pointing to MainDomain.com, which is where we convert and collect registrations. My question is, will this activity be considered a doorway page even though I'm using it for a landing page for a particular audience? And, if it could be considered a doorway page, would I be better off moving all these optimized landing pages to my MainDomain.com and then doing a 301 redirect from those other domains to the MainDomain.com. Your input is much appreciated ... thanks.
White Hat / Black Hat SEO | | DenverDude1