Could this be seen as duplicate content in Google's eyes?
-
Hi
I'm an in-house SEO and we've recently seen Panda related traffic loss along with some of our main keywords slipping down the SERPs.
Looking for possible Panda related issues I was wondering if the following could be seen as duplicate content. We've got some very similar holidays (travel company) on our website. While they are different I'm concerned it may be seen as creating content that is too similar:
They do all have unique text but as you can see from the titles, they are very similar (note from an SEO point of view the tabbed content is all within the same page at source level).
At the top level of the holiday pages we have a filtered search:
http://www.naturalworldsafaris.com/destinations/africa-and-the-indian-ocean/kenya/suggested-holidays.aspxThese pages have a unique introduction but the content snippets being pulled into the boxes is drawn from each of the individual holiday pages.
I'm just concerned that these could be introducing some duplicating issues. Any thoughts?
-
Hi Cyrus,
Thanks for taking the time to answer.
It seems that there is no firm answer on this one - interesting to see you felt there wasn't necessarily an issue of duplicated content but that grouping these pages into themes with a hub page would be of benefit (assuming I've understood your suggestions).
The issue is that in some ways the pages and content is similar, so the trips are focused on the beaches and wildlife of Kenya - a lot of the difference is in the accommodation and level of luxury, which is dealt with in the on page copy. I think we will have to revisit how we handle page titles.
We only fairly recently changed those pages to ensure that all content in the individual tabs is visible to search engines (previously they were only able to crawl the content in the overview tabs, the content of other tabs was effectively hidden). I have checked this in Google Webmaster Tools and it all displays fine / all the tabbed content is found within the html.
Many thanks
Kate -
I'm going to go against the grain and say this doesn't look like a duplicate content issue to me - at least based on text. There's enough unique content on those pages that you shouldn't be falling into those filters. No one can say for sure - that's simply based on my experience.
That said, there are other signals around these pages that are very similar. Namely things like title tags and anchor text.
Title Tags:
- The Wildlife & Beaches of Kenya - Natural World Safaris
- Ultimate Kenya Wildlife and Beaches Safari - Natural World Safaris
- Wildlife & Beach Family Safari - Natural World Safaris
From a topic perspective, are these differentiated enough? They seem to target very similar topics and keywords. ... and the anchor text to these pages follows similar patterns, mostly internal links from the sidebar.
So long story short, these pages may not be differentiated enough that they may be interpreted as dupe content (or thin content topics, as it were) and there simply aren't enough external signals to keep these pages afloat.
The solution may be to consolidate or group these pages into themes. Make sure you have strong "hub" pages that link everything together (think Trip Advisor)
One other thing of note - I notice the page is JavaScript dependent. Because of this, make sure to perform a "Fetch and Render" in Google Webmaster Tools, and make sure the page displays correctly. If it doesn't, be sure to address any issues.
-
Thanks for the replies Andy and Amelia
We cover around 30 destinations and each one has a suggested-holidays page and then maybe 5-15 individual itineraries. Using the copy from any of those itinerary pages will show multiple results in Google as the opening text is being pulled into several other areas on the site.
However, individually a lot of these itinerary pages and overview suggested-holiday main pages rank reasonably well and account for quite a lot of traffic to the site. We can't no-index or use canonicalisation really as each page does have unique content and is different - there is just quite a bit of cross over. At the same time we saw a significant drop with Panda 4.0 and see smaller drops every month with each subsequent update.
Has anyone got any suggestions on how else we can handle this content?
Thanks
Kate -
Hi Kate,
Your assumption about duplicate / similar content appears to be well founded. Just to test a sample, I took the following snippet from this page, and searched in Google:
"Acacia House sits in Ol Chorro Losoit Valley, within the Lemak Hills"
Google returns 4 pages, so yes, there are issues here - and it isn't as straight forward as canonicalisation to fix as this can mean other pages could miss out on a chance to be indexed and returned. However, what you can't tell, is to what degree Google is objecting to these kids of issues. Some say that Google is smart enough to understand what a snippet is, and won't penalise based on this - others disagree. Myself, I try to ensure my clients have unique content on each page and always err on the side of caution.
I also took a snippet from itinerary here and did the same - this time it came back with 5 different pages.
My opinion is that yes, you do have problems that need to be rectified. I know this was only a very quick look, but I shouldn't be seeing so many pages with the same snippets of content in Google. The odd one you can get away with, but I bet I would find lots.
How many unique pages with content like this do you think you have?
-Andy
-
If you're aggregating content from different pages into one, then you may want to look at canonical tags. I'm sure someone much smarter than me will tell you how to do it
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content across domains?
Does anyone have suggestions for managing duplicate product/solution website content across domains? (specifically parent/child company domains) Is it advisable to do this? Will it hurt either domain? Any best practices when going down this path?
Intermediate & Advanced SEO | | pilgrimquality0 -
What's the best way to check Google search results for all pages NOT linking to a domain?
I need to do a bit of link reclamation for some brand terms. From the little bit of searching I've done, there appear to be several thousand pages that meet the criteria, but I can already tell it's going to be impossible or extremely inefficient to save them all manually. Ideally, I need an exported list of all the pages mentioning brand terms not linking to my domain, and then I'll import them into BuzzStream for a link campaign. Anybody have any ideas about how to do that? Thanks! Jon
Intermediate & Advanced SEO | | JonMorrow0 -
Can I, in Google's good graces, check for Googlebot to turn on/off tracking parameters in URLs?
Basically, we use a number of parameters in our URLs for event tracking. Google could be crawling an infinite number of these URLs. I'm already using the canonical tag to point at the non-tracking versions of those URLs....that doesn't stop the crawling tho. I want to know if I can do conditional 301s or just detect the user agent as a way to know when to NOT append those parameters. Just trying to follow their guidelines about allowing bots to crawl w/out things like sessionID...but they don't tell you HOW to do this. Thanks!
Intermediate & Advanced SEO | | KenShafer0 -
What NAP format do I use if the USPS can't even find my client's address?
My client has a site already listed on Google+Local under "5208 N 1st St". He has some other NAPs, e.g., YellowPages, under "5208 N First Street". The USPS finds neither of these, nor any variation that I can possibly think of! Which is better? Do I just take the one that Google has accepted and make all the others like it as best I can? And doesn't it matter that the USPS doesn't even recognize the thing? Or no? Local SEO wizards, thanks in advance for your guidance!
Intermediate & Advanced SEO | | rayvensoft0 -
Duplicate content clarity required
Hi, I have access to a masive resource of journals that we have been given the all clear to use the abstract on our site and link back to the journal. These will be really useful links for our visitors. E.g. http://www.springerlink.com/content/59210832213382K2 Simply, if we copy the abstract and then link back to the journal source will this be treated as duplicate content and damage the site or is the link to the source enough for search engines to realise that we aren't trying anything untoward. Would it help if we added an introduction so in effect we are sort of following the curating content model? We are thinking of linking back internally to a relevant page using a keyword too. Will this approach give any benefit to our site at all or will the content be ignored due to it being duplicate and thus render the internal links useless? Thanks Jason
Intermediate & Advanced SEO | | jayderby0 -
Duplicate Content Help
seomoz tool gives me back duplicate content on both these URL's http://www.mydomain.com/football-teams/ http://www.mydomain.com/football-teams/index.php I want to use http://www.mydomain.com/football-teams/ as this just look nice & clean. What would be best practice to fix this issue? Kind Regards Eddie
Intermediate & Advanced SEO | | Paul780 -
Tool to calculate the number of pages in Google's index?
When working with a very large site, are there any tools that will help you calculate the number of links in the Google index? I know you can use site:www.domain.com to see all the links indexed for a particular url. But what if you want to see the number of pages indexed for 100 different subdirectories (i.e. www.domain.com/a, www.domain.com/b)? is there a tool to help automate the process of finding the number of pages from each subdirectory in Google's index?
Intermediate & Advanced SEO | | nicole.healthline0