Could this be seen as duplicate content in Google's eyes?
-
Hi
I'm an in-house SEO and we've recently seen Panda related traffic loss along with some of our main keywords slipping down the SERPs.
Looking for possible Panda related issues I was wondering if the following could be seen as duplicate content. We've got some very similar holidays (travel company) on our website. While they are different I'm concerned it may be seen as creating content that is too similar:
They do all have unique text but as you can see from the titles, they are very similar (note from an SEO point of view the tabbed content is all within the same page at source level).
At the top level of the holiday pages we have a filtered search:
http://www.naturalworldsafaris.com/destinations/africa-and-the-indian-ocean/kenya/suggested-holidays.aspxThese pages have a unique introduction but the content snippets being pulled into the boxes is drawn from each of the individual holiday pages.
I'm just concerned that these could be introducing some duplicating issues. Any thoughts?
-
Hi Cyrus,
Thanks for taking the time to answer.
It seems that there is no firm answer on this one - interesting to see you felt there wasn't necessarily an issue of duplicated content but that grouping these pages into themes with a hub page would be of benefit (assuming I've understood your suggestions).
The issue is that in some ways the pages and content is similar, so the trips are focused on the beaches and wildlife of Kenya - a lot of the difference is in the accommodation and level of luxury, which is dealt with in the on page copy. I think we will have to revisit how we handle page titles.
We only fairly recently changed those pages to ensure that all content in the individual tabs is visible to search engines (previously they were only able to crawl the content in the overview tabs, the content of other tabs was effectively hidden). I have checked this in Google Webmaster Tools and it all displays fine / all the tabbed content is found within the html.
Many thanks
Kate -
I'm going to go against the grain and say this doesn't look like a duplicate content issue to me - at least based on text. There's enough unique content on those pages that you shouldn't be falling into those filters. No one can say for sure - that's simply based on my experience.
That said, there are other signals around these pages that are very similar. Namely things like title tags and anchor text.
Title Tags:
- The Wildlife & Beaches of Kenya - Natural World Safaris
- Ultimate Kenya Wildlife and Beaches Safari - Natural World Safaris
- Wildlife & Beach Family Safari - Natural World Safaris
From a topic perspective, are these differentiated enough? They seem to target very similar topics and keywords. ... and the anchor text to these pages follows similar patterns, mostly internal links from the sidebar.
So long story short, these pages may not be differentiated enough that they may be interpreted as dupe content (or thin content topics, as it were) and there simply aren't enough external signals to keep these pages afloat.
The solution may be to consolidate or group these pages into themes. Make sure you have strong "hub" pages that link everything together (think Trip Advisor)
One other thing of note - I notice the page is JavaScript dependent. Because of this, make sure to perform a "Fetch and Render" in Google Webmaster Tools, and make sure the page displays correctly. If it doesn't, be sure to address any issues.
-
Thanks for the replies Andy and Amelia
We cover around 30 destinations and each one has a suggested-holidays page and then maybe 5-15 individual itineraries. Using the copy from any of those itinerary pages will show multiple results in Google as the opening text is being pulled into several other areas on the site.
However, individually a lot of these itinerary pages and overview suggested-holiday main pages rank reasonably well and account for quite a lot of traffic to the site. We can't no-index or use canonicalisation really as each page does have unique content and is different - there is just quite a bit of cross over. At the same time we saw a significant drop with Panda 4.0 and see smaller drops every month with each subsequent update.
Has anyone got any suggestions on how else we can handle this content?
Thanks
Kate -
Hi Kate,
Your assumption about duplicate / similar content appears to be well founded. Just to test a sample, I took the following snippet from this page, and searched in Google:
"Acacia House sits in Ol Chorro Losoit Valley, within the Lemak Hills"
Google returns 4 pages, so yes, there are issues here - and it isn't as straight forward as canonicalisation to fix as this can mean other pages could miss out on a chance to be indexed and returned. However, what you can't tell, is to what degree Google is objecting to these kids of issues. Some say that Google is smart enough to understand what a snippet is, and won't penalise based on this - others disagree. Myself, I try to ensure my clients have unique content on each page and always err on the side of caution.
I also took a snippet from itinerary here and did the same - this time it came back with 5 different pages.
My opinion is that yes, you do have problems that need to be rectified. I know this was only a very quick look, but I shouldn't be seeing so many pages with the same snippets of content in Google. The odd one you can get away with, but I bet I would find lots.
How many unique pages with content like this do you think you have?
-Andy
-
If you're aggregating content from different pages into one, then you may want to look at canonical tags. I'm sure someone much smarter than me will tell you how to do it
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
After hack and remediation, thousands of URL's still appearing as 'Valid' in google search console. How to remedy?
I'm working on a site that was hacked in March 2019 and in the process, nearly 900,000 spam links were generated and indexed. After remediation of the hack in April 2019, the spammy URLs began dropping out of the index until last week, when Search Console showed around 8,000 as "Indexed, not submitted in sitemap" but listed as "Valid" in the coverage report and many of them are still hack-related URLs that are listed as being indexed in March 2019, despite the fact that clicking on them leads to a 404. As of this Saturday, the number jumped up to 18,000, but I have no way of finding out using the search console reports why the jump happened or what are the new URLs that were added, the only sort mechanism is last crawled and they don't show up there. How long can I expect it to take for these remaining urls to also be removed from the index? Is there any way to expedite the process? I've submitted a 'new' sitemap several times, which (so far) has not helped. Is there any way to see inside the new GSC view why/how the number of valid URLs in the indexed doubled over one weekend?
Intermediate & Advanced SEO | | rickyporco0 -
Duplicate content on product pages
Hi, We are considering the impact when you want to deliver content directly on the product pages. If the products were manufactured in a specific way and its the same process across 100 other products you might want to tell your readers about it. If you were to believe the product page was the best place to deliver this information for your readers then you could potentially be creating mass content duplication. Especially as the storytelling of the product could equate to 60% of the page content this could really flag as duplication. Our options would appear to be:1. Instead add the content as a link on each product page to one centralised URL and risk taking users away from the product page (not going to help with conversion rate or designers plans)2. Put the content behind some javascript which requires interaction hopefully deterring the search engine from crawling the content (doesn't fit the designers plans & users have to interact which is a big ask)3. Assign one product as a canonical and risk the other products not appearing in search for relevant searches4. Leave the copy as crawlable and risk being marked down or de-indexed for duplicated contentIts seems the search engines do not offer a way for us to serve this great content to our readers with out being at risk of going against guidelines or the search engines not being able to crawl it.How would you suggest a site should go about this for optimal results?
Intermediate & Advanced SEO | | FashionLux2 -
If a website trades internationally and simply translates its online content from English to French, German, etc how can we ensure no duplicate content penalisations and still maintain SEO performance in each territory?
Most of the international sites are as below: example.com example.de example.fr But some countries are on unique domains such example123.rsa
Intermediate & Advanced SEO | | Dave_Schulhof0 -
What to do when all products are one of a kind WYSIWYG and url's are continuously changing. Lots of 404's
Hey Guys, I'm working on a website with WYSIWYG one of a kind products and the url's are continuously changing. There are allot of duplicate page titles (56 currently) but that number is always changing too. Let me give you guys a little background on the website. The site sells different types of live coral. So there may be anywhere from 20 - 150 corals of the same species. Each coral is a unique size, color etc. When the coral gets sold the site owner trashes the product creating a new 404. Sometimes the url gets indexed, other times they don't since the corals get sold within hours/days. I was thinking of optimizing each product with a keyword and re-using the url by having the client update the picture and price but that still leaves allot more products than keywords. Here is an example of the corals with the same title http://austinaquafarms.com/product-category/acans/ Thanks for the help guys. I'm not really sure what to do.
Intermediate & Advanced SEO | | aronwp0 -
How to Avoid Duplicate Content Issues with Google?
We have 1000s of audio book titles at our Web store. Google's Panda de-valued our site some time ago because, I believe, of duplicate content. We get our descriptions from the publishers which means a good
Intermediate & Advanced SEO | | lbohen
deal of our description pages are the same as the publishers = duplicate content according to Google. Although re-writing each description of the products we offer is a daunting, almost impossible task, I am thinking of re-writing publishers' descriptions using The Best Spinner software which allows me to replace some of the publishers' words with synonyms. I have re-written one audio book title's description resulting in 8% unique content from the original in 520 words. I did a CopyScape Check and it reported "65 duplicates." CopyScape appears to be reporting duplicates of words and phrases within sentences and paragraphs. I see very little duplicate content of full sentences
or paragraphs. Does anyone know whether Google's duplicate content algorithm is the same or similar to CopyScape's? How much of an audio book's description would I have to change to stay away from CopyScape's duplicate content algorithm? How much of an audio book's description would I have to change to stay away from Google's duplicate content algorithm?0 -
301 redirect for duplicate content
Hey, I have just started working on a site which is a video based city guide, with promotional videos for restaurants, bars, activities,etc. The first thing that I have noticed is that every video on the site has two possible urls:- http://www.domain.com/venue.php?url=rosemarino
Intermediate & Advanced SEO | | AdeLewis
http://www.domain.com/venue/rosemarino I know that I can write a .htaccess line to redirect one to the other:- redirect 301 /venue.php?url=rosemarino http://www.domain.com/venue/rosemarino but this would involve creating a .htaccess line for every video on the site and new videos that get added may get missed. Does anyone know a way of creating a rule to rewrite these urls? Any help would be most gratefully received. Thanks. Ade.0 -
Duplicate content - canonical vs link to original and Flash duplication
Here's the situation for the website in question: The company produces printed publications which go online as a page turning Flash version, and as a separate HTML version. To complicate matters, some of the articles from the publications get added to a separate news section of the website. We want to promote the news section of the site over the publications section. If we were to forget the Flash version completely, would you: a) add a canonical in the publication version pointing to the version in the news section? b) add a link in the footer of the publication version pointing to the version in the news section? c) both of the above? d) something else? What if we add the Flash version into the mix? As Flash still isn't as crawlable as HTML should we noindex them? Is HTML content duplicated in Flash as big an issue as HTML to HTML duplication?
Intermediate & Advanced SEO | | Alex-Harford0 -
How do you rank in the "brands for:" section in Google's search results ?
There's a "brands for:" section that appears above the first organic listing for certain search queries. For example, if you search for "dedicated servers" in Google, you will see that a "brands for:" appears. How do you get listed there? Thanks, Brian
Intermediate & Advanced SEO | | InMotionHosting0