Could this be seen as duplicate content in Google's eyes?
-
Hi
I'm an in-house SEO and we've recently seen Panda related traffic loss along with some of our main keywords slipping down the SERPs.
Looking for possible Panda related issues I was wondering if the following could be seen as duplicate content. We've got some very similar holidays (travel company) on our website. While they are different I'm concerned it may be seen as creating content that is too similar:
They do all have unique text but as you can see from the titles, they are very similar (note from an SEO point of view the tabbed content is all within the same page at source level).
At the top level of the holiday pages we have a filtered search:
http://www.naturalworldsafaris.com/destinations/africa-and-the-indian-ocean/kenya/suggested-holidays.aspxThese pages have a unique introduction but the content snippets being pulled into the boxes is drawn from each of the individual holiday pages.
I'm just concerned that these could be introducing some duplicating issues. Any thoughts?
-
Hi Cyrus,
Thanks for taking the time to answer.
It seems that there is no firm answer on this one - interesting to see you felt there wasn't necessarily an issue of duplicated content but that grouping these pages into themes with a hub page would be of benefit (assuming I've understood your suggestions).
The issue is that in some ways the pages and content is similar, so the trips are focused on the beaches and wildlife of Kenya - a lot of the difference is in the accommodation and level of luxury, which is dealt with in the on page copy. I think we will have to revisit how we handle page titles.
We only fairly recently changed those pages to ensure that all content in the individual tabs is visible to search engines (previously they were only able to crawl the content in the overview tabs, the content of other tabs was effectively hidden). I have checked this in Google Webmaster Tools and it all displays fine / all the tabbed content is found within the html.
Many thanks
Kate -
I'm going to go against the grain and say this doesn't look like a duplicate content issue to me - at least based on text. There's enough unique content on those pages that you shouldn't be falling into those filters. No one can say for sure - that's simply based on my experience.
That said, there are other signals around these pages that are very similar. Namely things like title tags and anchor text.
Title Tags:
- The Wildlife & Beaches of Kenya - Natural World Safaris
- Ultimate Kenya Wildlife and Beaches Safari - Natural World Safaris
- Wildlife & Beach Family Safari - Natural World Safaris
From a topic perspective, are these differentiated enough? They seem to target very similar topics and keywords. ... and the anchor text to these pages follows similar patterns, mostly internal links from the sidebar.
So long story short, these pages may not be differentiated enough that they may be interpreted as dupe content (or thin content topics, as it were) and there simply aren't enough external signals to keep these pages afloat.
The solution may be to consolidate or group these pages into themes. Make sure you have strong "hub" pages that link everything together (think Trip Advisor)
One other thing of note - I notice the page is JavaScript dependent. Because of this, make sure to perform a "Fetch and Render" in Google Webmaster Tools, and make sure the page displays correctly. If it doesn't, be sure to address any issues.
-
Thanks for the replies Andy and Amelia
We cover around 30 destinations and each one has a suggested-holidays page and then maybe 5-15 individual itineraries. Using the copy from any of those itinerary pages will show multiple results in Google as the opening text is being pulled into several other areas on the site.
However, individually a lot of these itinerary pages and overview suggested-holiday main pages rank reasonably well and account for quite a lot of traffic to the site. We can't no-index or use canonicalisation really as each page does have unique content and is different - there is just quite a bit of cross over. At the same time we saw a significant drop with Panda 4.0 and see smaller drops every month with each subsequent update.
Has anyone got any suggestions on how else we can handle this content?
Thanks
Kate -
Hi Kate,
Your assumption about duplicate / similar content appears to be well founded. Just to test a sample, I took the following snippet from this page, and searched in Google:
"Acacia House sits in Ol Chorro Losoit Valley, within the Lemak Hills"
Google returns 4 pages, so yes, there are issues here - and it isn't as straight forward as canonicalisation to fix as this can mean other pages could miss out on a chance to be indexed and returned. However, what you can't tell, is to what degree Google is objecting to these kids of issues. Some say that Google is smart enough to understand what a snippet is, and won't penalise based on this - others disagree. Myself, I try to ensure my clients have unique content on each page and always err on the side of caution.
I also took a snippet from itinerary here and did the same - this time it came back with 5 different pages.
My opinion is that yes, you do have problems that need to be rectified. I know this was only a very quick look, but I shouldn't be seeing so many pages with the same snippets of content in Google. The odd one you can get away with, but I bet I would find lots.
How many unique pages with content like this do you think you have?
-Andy
-
If you're aggregating content from different pages into one, then you may want to look at canonical tags. I'm sure someone much smarter than me will tell you how to do it
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to get a large number of urls out of Google's Index when there are no pages to noindex tag?
Hi, I'm working with a site that has created a large group of urls (150,000) that have crept into Google's index. If these urls actually existed as pages, which they don't, I'd just noindex tag them and over time the number would drift down. The thing is, they created them through a complicated internal linking arrangement that adds affiliate code to the links and forwards them to the affiliate. GoogleBot would crawl a link that looks like it's to the client's same domain and wind up on Amazon or somewhere else with some affiiiate code. GoogleBot would then grab the original link on the clients domain and index it... even though the page served is on Amazon or somewhere else. Ergo, I don't have a page to noindex tag. I have to get this 150K block of cruft out of Google's index, but without actual pages to noindex tag, it's a bit of a puzzler. Any ideas? Thanks! Best... Michael P.S., All 150K urls seem to share the same url pattern... exmpledomain.com/item/... so /item/ is common to all of them, if that helps.
Intermediate & Advanced SEO | | 945010 -
How to Evaluate Original Domain Authority vs. Recent 'HTTPS' Duplicate for Potential Domain Migration?
Hello Everyone, So our site has used ‘http’ for the domain since the start. Everything has been set up for this structure and Google is only indexing these pages. Just recently a second version was created on ‘httpS’. We know having both up is the worst case scenario but now that both are up is it worth just switching over or would the original domain authority warrant just keeping it on ‘http’ and redirecting the ‘httpS’ version? Assuming speed and other elements wouldn’t be an issue and it's done correctly. Our thought was if we could do this quickly it would be easier to just redirect the ‘httpS’ version but was not sure if the Pros of ‘httpS’ would be worth the resources. Any help or insight would be appreciated. Please let us know if there are any further details we could provide that might help. Looking forward to hearing from all of you! Thank you in advance for the help. Best,
Intermediate & Advanced SEO | | Ben-R1 -
The images on site are not found/indexed, it's been recommended we change their presentation to Google Bot - could this create a cloaking issue?
Hi We have an issue with images on our site not being found or indexed by Google. We have an image sitemap but the images are served on the Sitecore powered site within <divs>which Google can't read. The developers have suggested the below solution:</divs> Googlebot class="header-banner__image" _src="/~/media/images/accommodation/arctic-canada/arctic-safari-camp/arctic-cafari-camp-david-briggs.ashx"/>_Non Googlebot <noscript class="noscript-image"><br /></span></em><em><span><div role="img"<br /></span></em><em><span>aria-label="Arctic Safari Camp, Arctic Canada"<br /></span></em><em><span>title="Arctic Safari Camp, Arctic Canada"<br /></span></em><em><span>class="header-banner__image"<br /></span></em><em><span>style="background-image: url('/~/media/images/accommodation/arctic-canada/arctic-safari-camp/arctic-cafari-camp-david-briggs.ashx?mw=1024&hash=D65B0DE9B311166B0FB767201DAADA9A4ADA4AC4');"></div><br /></span></em><em><span></noscript> aria-label="Arctic Safari Camp, Arctic Canada" title="Arctic Safari Camp, Arctic Canada" class="header-banner__image image" data-src="/~/media/images/accommodation/arctic-canada/arctic-safari-camp/arctic-cafari-camp-david-briggs.ashx" data-max-width="1919" data-viewport="0.80" data-aspect="1.78" data-aspect-target="1.00" > Is this something that could be flagged as potential cloaking though, as we are effectively then showing code looking just for the user agent Googlebot?The devs have said that via their contacts Google has advised them that the original way we set up the site is the most efficient and considered way for the end user. However they have acknowledged the Googlebot software is not sophisticated enough to recognise this. Is the above solution the most suitable?Many thanksKate
Intermediate & Advanced SEO | | KateWaite0 -
How important is the user experience for SEO in google's eyes?
So far I've gathered that backlinks are really king, however you can't get good backlinks without well written content that serves a purpose. As well you can't do a great job with that content and not keep a good user experience, since why would anyone want to backlink to content that can be helpful if you squint an eye and suffer a few scrolling cramps. So how would you rank user experience in the everlasting war of SEO for Google? With this in mind, why would using bootstrap resources pose a problem? I've seen it could add issue to pageload times, however seems minifying could easily solve that. I personally enjoy the use of Bootstrap since it's very easy on the eyes and can have real positive effects when a user looks at content on such a framework.
Intermediate & Advanced SEO | | Deacyde0 -
How to handle duplicate content with Bible verses
Have a friend that does a site with bible verses and different peoples thoughts or feelings on them. Since I'm an SEO he came to me with questions and duplicate content red flag popped up in my head. My clients all generate their own content so not familiar with this world. Since Bible verses appear all over the place, is there a way to address this from an SEO standpoint to avoid duplicate content issues? Thanks in advance.
Intermediate & Advanced SEO | | jeremyskillings0 -
Duplicate Content... Really?
Hi all, My site is www.actronics.eu Moz reports virtually every product page as duplicate content, flagged as HIGH PRIORITY!. I know why. Moz classes a page as duplicate if >95% content/code similar. There's very little I can do about this as although our products are different, the content is very similar, albeit a few part numbers and vehicle make/model. Here's an example:
Intermediate & Advanced SEO | | seowoody
http://www.actronics.eu/en/shop/audi-a4-8d-b5-1994-2000-abs-ecu-en/bosch-5-3
http://www.actronics.eu/en/shop/bmw-3-series-e36-1990-1998-abs-ecu-en/ate-34-51 Now, multiply this by ~2,000 products X 7 different languages and you'll see we have a big dupe content issue (according to Moz's Crawl Diagnostics report). I say "according to Moz..." as I do not know if this is actually an issue for Google? 90% of our products pages rank, albeit some much better than others? So what is the solution? We're not trying to deceive Google in any way so it would seem unfair to be hit with a dupe content penalty, this is a legit dilemma where our product differ by as little as a part number. One ugly solution would be to remove header / sidebar / footer on our product pages as I've demonstrated here - http://woodberry.me.uk/test-page2-minimal-v2.html since this removes A LOT of page bloat (code) and would bring the page difference down to 80% duplicate.
(This is the tool I'm using for checking http://www.webconfs.com/similar-page-checker.php) Other "prettier" solutions would greatly appreciated. I look forward to hearing your thoughts. Thanks,
Woody 🙂1 -
Google's Stance on "Hidden" Content
Hi, I'm aware Google doesn't care if you have helpful content you can hide/unhide by user interaction. I am also aware that Google frowns upon hiding content from the user for SEO purposes. We're not considering anything similar to this. The issue is, we will be displaying only a part of our content to the user at a time. We'll load 3 results on each page initially. These first 3 results are static, meaning on each initial page load/refresh, the same 3 results will display. However, we'll have a "Show Next 3" button which replaces the initial results with the next 3 results. This content will be preloaded in the source code so Google will know about it. I feel like Google shouldn't have an issue with this since we're allowing the user action to cycle through all results. But I'm curious, is it an issue that the user action does NOT allow them to see all results on the page at once? I am leaning towards no, this doesn't matter, but would like some input if possible. Thanks a lot!
Intermediate & Advanced SEO | | kirmeliux0 -
Restructuring Menu's
Hi all I am running my site on Wordpress using a slightly modified them from Studiopress on the Genisis frame work. I am extremely over my head but alas until I get some revenue SEO and Design are all on me. I do not know HTML or CSS but I do follow directions well (unless you ask my wife). Disclaimer out of the way I have some questions. I would like to change up my menu's to be more on the line of Products | Services | About Us | Contact Us | Blog Listing various direct mail pieces under Products, Sevices and so on and so forth. I wonder does this mean I will have to figure out how to write 301's and other complicated things or can I just make the changes. I think but might be wrong that this will change the URL's. Any advice before I mess this up would be greatly helpful. My site is http://www.roiautosolutions.com. If you want a few laughs about the car business read the 2 most recent blog post, anything before that and my writing style is pretty boring. Thanks, Mark Hilger
Intermediate & Advanced SEO | | mhilger0