Duplicate content on recruitment website
-
Hi everyone,
It seems that Panda 4.2 has hit some industries more than others. I just started working on a website, that has no manual action, but the organic traffic has dropped massively in the last few months. Their external linking profile seems to be fine, but I suspect usability issues, especially the duplication may be the reason.
The website is a recruitment website in a specific industry only. However, they posts jobs for their clients, that can be very similar, and in the same time they can have 20 jobs with the same title and very similar job descriptions. The website currently have over 200 pages with potential duplicate content.
Additionally, these jobs get posted on job portals, with the same content (Happens automatically through a feed).
The questions here are:
- How bad would this be for the website usability, and would it be the reason the traffic went down?
- Is this the affect of Panda 4.2 that is still rolling
- What can be done to resolve these issues?
Thank you in advance.
-
Hi Issa,
You're right, duplicate content and bad usability could be triggering the slow rolling Panda 4.2, but I'd dig in a little more (apologies if you already did this research):
-
You mentioned 200 pages are potentially duplicate; how many are on the site in total? If you have thousands of pages indexed, 200 duplicates probably aren't going to cause a Panda penalty.
-
How similar are these postings? Just the page title? Or is the entire page extremely similar in content? (To answer this: if you made a keyword cloud for these similar job descriptions, would they show roughly the same mapping?)
-
If it's just the page title that's similar, make sure to set the pages apart by including the name of the hiring company (which I assume makes the different positions unique) towards the beginning of the page title
-
If the entire page is similar, then add more content to make the pages more unique, like a blurb about the hiring company, how long the job has been up, how many applicants the job has (if available), etc.
-
Either way, make sure you don't have any old jobs that still have live pages! If possible, I'd redirect them to a similar job posting.
-
Like John asked, did your traffic drop dramatically one day, or has it been tapering off? If it's tapering off, I'd guess it's not Panda.
-
And, last, which pages lost traffic and rankings? Which keywords dropped in rankings? You may be able to tell how you were penalized by which keywords were most affected.
Hope this helps,
Kristina
-
-
Hi Issa -
Great question here. Seems your client is potentially in a tough spot with this!
There is a ton to unpack here and it is hard to know specifics without the site (feel free to private message it to me), but to your specific questions:
- Re: if it is a problem that the jobs have the same title, that is only something you can answer with the analytics data you have access to. It usually is not a problem, but when you have this sort of situation I'd also ask if you have category pages for those terms (eg 20 Growth Hacker jobs in SF a day, but also a "Growth Hacker Jobs in SF" category where all those individual jobs link back up to
- Regarding syndication of content, this can cause an issue if not done correctly. You'd have to see where they lost traffic (you hopefully already know), but if it's the case with syndicated listings losing traffic and non-syndicated not, this is an issue. What I've often done is either get the site we are syndicating to to implement a canonical back to my listing, or get a followed link from their version back to yours. Also, you can be selective about what you syndicate so that it's a small duplication vs complete. Also, make your pages more robust and only syndicate the necessary info if possible.
- Website usability can be bad for Panda, especially if bounce rates are really high. Check those and see if they are high. If they are, you should fix it anyways because you'll get better conversions. I've also heard of cases where they made their site "stickier" and they bounced back from Panda.
I guess it's hard to know if Panda is still rolling out, but from everything I have heard it is. I assume this was not just a one-time drop on one day, but rather a slow leak of traffic? That makes it harder to investigate if the second.
Good luck!
John
-
Great thank you.
Will have a read.
Still though, with the situation above, is it OK for this industry to have such duplicate content and what to do about it if its not.
Thanks
-
I was reading an article earlier from SEO RoundTable, where it details that Duplicate content is a side issue and not necessarily related to the Panda Update - read more here - https://www.seroundtable.com/google-duplicate-content-panda-issues-different-21039.html
John Mueller stated that sites with low quality content are hit by Panda and that duplicate content is a separate side issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
SEM Rush & Duplicate content
Hi SEMRush is flagging these pages as having duplicate content, but we have rel = next etc implemented: https://www.key.co.uk/en/key/brand/bott https://www.key.co.uk/en/key/brand/bott?page=2 Or is it being flagged as they're just really similar pages?
Intermediate & Advanced SEO | | BeckyKey0 -
Duplicate Content Pages - A Few Queries..
I am working through the latest Moz Crawl Report and focusing on the 'high priority' issues of Duplicate Page Content. There are some strange instances being flagged and so wondered whether anyone has any knowledge as to why this may be happening... Here is an example; This page; http://www.bolsovercruiseclub.com/destinations/cruise-breaks-&-british-isles/bruges/ ...is apparently duplicated with these pages; http://www.bolsovercruiseclub.com/guides/excursions http://www.bolsovercruiseclub.com/guides/cruises-from-the-uk http://www.bolsovercruiseclub.com/cruise-deals/norwegian-star-europe-cruise-deals Not sure why...? Also, pages that are on our 'Cruise Reviews' section such as this page; http://www.bolsovercruiseclub.com/cruise-reviews/p&o-cruises/adonia/cruising/931 ...are being flagged as duplicated content with a page like this; http://www.bolsovercruiseclub.com/destinations/cruise-breaks-&-british-isles/bilbao/ Is this a 'thin content' issue i.e. 2 pages have 'thin content' and are therefore duplicated? If so, the 'destinations' page can (and will be) rewritten with more content (and images) but the 'cruise reviews' are written by customers and so we are unable to do anything there... Hope that all makes sense?! Andy
Intermediate & Advanced SEO | | TomKing0 -
Noindex Valuable duplicate content?
How could duplicate content be valuable and why question no indexing it? My new client has a clever african safari route builder that you can use to plan your safari. The result is 100's of pages that have different routes. Each page inevitably has overlapping content / destination descriptions. see link examples. To the point - I think it is foolish to noindex something like this. But is Google's algo sophisticated enough to not get triggered by something like this? http://isafari.nathab.com/routes/ultimate-tanzania-kenya-uganda-safari-july-november
Intermediate & Advanced SEO | | Rich_Coffman
http://isafari.nathab.com/routes/ultimate-tanzania-kenya-uganda-safari-december-june0 -
Can a website be punished by panda if content scrapers have duplicated content?
I've noticed recently that a number of content scrapers are linking to one of our websites and have the duplicate content on their web pages. Can content scrapers affect the original website's ranking? I'm concerned that having duplicated content, even if hosted by scrapers, could be a bad signal to Google. What are the best ways to prevent this happening? I'd really appreciate any help as I can't find the answer online!
Intermediate & Advanced SEO | | RG_SEO0 -
Wordpress and duplicate content
Hi, I have recently installed wordpress and started a blog but now loads of duplicate pages are cropping up for tags and authors and dates etc. How do I do the canonical thing in wordpress? Thanks Ian
Intermediate & Advanced SEO | | jwdl0 -
Duplicate Content | eBay
My client is generating templates for his eBay template based on content he has on his eCommerce platform. I'm 100% sure this will cause duplicate content issues. My question is this.. and I'm not sure where eBay policy stands with this but adding the canonical tag to the template.. will this work if it's coming from a different page i.e. eBay? Update: I'm not finding any information regarding this on the eBay policy's: http://ocs.ebay.com/ws/eBayISAPI.dll?CustomerSupport&action=0&searchstring=canonical So it does look like I can have rel="canonical" tag in custom eBay templates but I'm concern this can be considered: "cheating" since rel="canonical is actually a 301 but as this says: http://googlewebmastercentral.blogspot.com/2009/12/handling-legitimate-cross-domain.html it's legitimately duplicate content. The question is now: should I add it or not? UPDATE seems eBay templates are embedded in a iframe but the snap shot on google actually shows the template. This makes me wonder how they are handling iframes now. looking at http://www.webmaster-toolkit.com/search-engine-simulator.shtml does shows the content inside the iframe. Interesting. Anyone else have feedback?
Intermediate & Advanced SEO | | joseph.chambers1 -
How to manage duplicate content?
I have a real estate site that contains a large amount of duplicate content. The site contains listings that appear both on my clients website and on my competitors websites(who have better domain authority). It is critical that the content is there because buyers need to be able to find these listings to make enquiries. The result is that I have a large number pages that contain duplicate content in some way, shape or form. My search results pages are really the most important ones because these are the ones targeting my keywords. I can differentiate these to some degree but the actual listings themselves are duplicate. What strategies exist to ensure that I'm not suffereing as a result of this content? Should I : Make the duplicate content noindex. Yes my results pages will have some degree of duplicate content but each result only displays a 200 character summary of the advert text so not sure if that counts. Would reducing the amount of visible duplicate content improve my rankings as a whole? Link back to the clients site to indicate that they are the original source Any suggestions?
Intermediate & Advanced SEO | | Mulith0 -
Duplicate Content from Article Directories
I have a small client with a website PR2, 268 links from 21 root domains with mozTrusts 5.5, MozRank 4.5 However whenever I check in google for the amount of link: Google always give the response none. My client has a blog and many articles on the blog. However they have submitted their blog article every time to article directories as well, plain and simle creating duplicate and content. Is this the reason why their link: is coming up as none? Is there something to correct the situation?
Intermediate & Advanced SEO | | danielkamen0