Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Product descriptions, when do they become classed as duplicate content, how different do they have to be?
I look after 3 sites which have a lot of crossover on products. We have 1000s of products and I've made it a requirement that we give each it's on description on each of the sites. This sounds like the right thing to but it's very hard for our content writers to write three different versions descriptions, especially when we have variations on the products so potentially writing unique product descriptions for 4-5 very similar products on three separate sites. We've worked very hard to create unique content deep through the site on all categories, subcategories and tag combinations and along with the other SEO work we've done over the last couple of years is producing great results. My question is now far do we have to go? I'm busy writing some product descriptions for a 3rd party site for some of our products, the easy thing to do is just copy and paste but I want Google to see the descriptions as unique. Whilst all SEO advice will say 'write unique descriptions' from a practical point of view this isn't especially useful as there doesn't really seem to be much guidance on how different they need to be. I gather we can't just move around the paragraphs or jumble up sentences a bit but it is easier to work from a description and change it than it is to start from a blank slate (our products range form being very interesting and unique, to quite everyday so sometimes tough to create varied unique content for). Does anyone know of any guidance or evidence of just how clever the Google algorithm is and how close content has to be before it becomes classed as the same or similar? Thanks Pete
Content Development | | PeterLeatherland0 -
Are Duplicate Bio's Duplicate Content?
I'm wondering if I need to go through all the various bio's our firm has on all the various legal directory and client review sites to make sure they have unique bio's? I really, really don't want to do that, but if they are going to flagged as duplicate content, I will. I'm hoping sites like "findlaw," "avvo," etc, have some built in rel:cannonical or something that says these bio's aren't to be seen as unique content and therefore conflict with what's on our site. Anybody know? Just to clarify, any of the sties that have asked for unique content/bio/about us, I have complied with that. However, a lot don't specifically state it has to be unique, so I've just copy and pasted from our site in those cases. Thanks, Ruben
Content Development | | KempRugeLawGroup0 -
Content Curation & Duplicate Content
Hi, I have a client that wants to do content curation but it has been my understanding that adding external content that is already live on another website to your website, you get penalized for duplicate content. I have read that you can create an excerpt and then Google won't penalizes you for duplicate content. Can anyone shed more light on this topic. Thanks
Content Development | | M_80 -
Content Duplication for Job Posting
Hello! I am responsible for SEO of a job portal of a recruitment agency. We get 30-40 jobs every month which we post on a) our job portal b) other job portals (like monster, career builder, Naukri , etc). How do avoid content duplication? We can only post the same job descriptions everywhere. We always post the job on our site first and then the other job portals**. How to ensure that Google knows our portal is the original job posters and not other job portals.** Thank you.
Content Development | | peoplesutra0 -
Blog Anchor Text SEO Query
Apologies if the following is a basic question but I am just starting out with SEO and I have seen this quite a lot recently on sites I have been on. My question is if you have a self hosted blog e.g. blog.site.com or site.com/blog and you use keyword anchor text in your blog post does that bring SEO benefits in itself or does the blog post have to be shared/commented by readers to have an effect? I have seen/heard of many sites spending a lot of money getting copy done or spending their own time and resources starting a blog but the blogs remain unshared or comment-less. I am starting my own blog to go with my social media and website so I wanted to establish the basics. Thanks in advance guys.
Content Development | | jannkuzel0 -
Is it considered as duplicate content ?
Hello, I see a lot of errors on my webmaster tools because of this ajax code on my questions pages of the site (screen) : www.dismoicomment.fr The code : | / ADD ANSWER FORM |
Content Development | | elitepronostic
| | $("#answer-add-button").click(function () { |
| | $.ajax({ |
| | type: 'POST', |
| | url: '/answers/quelle-assurance-choisir-pour-un-scooter/', |
| | data: $("form#answer-add").serialize(), |
| | dataType: 'html', |
| | success: function(data) { |
| | |
| | if(data=="answer") { |
| | $('.answer-add-message').show().empty(); |
| | $(document).ready(function() { |
| | $(' Vous avez déjà répondu à cette question. ').appendTo('.answer-add-message'); |
| | }); | I have add a line on my robots.txt : http://www.dismoicomment.fr/robots.txt for remove all urls with /answers/. These urls with /answers/ aren't indexed in google. Do you think that it is dangerous and that can be considered as duplicate content ? 1129546035.png0 -
Duplicate Terms of Use and Privacy Policy, is it a problem?
Hi, If i use same terms of use and privacy policy content across my websites, does it amounts to duplicate content issues? Does it affect my websites in any manner? Regards
Content Development | | IM_Learner0 -
Duplicate content
Hello Seomoz team, i'm french and so my english is not very good ;-). I work for a brand site and we publish content about our products. The problem is : as a brand site, many sites that sell our products, copy our content. And we have duplicate content. And since these sites have worked SEO, they put in place rel canonical tag. as a brand, how to avoid being accused by Google duplicate content? tanks for you answer. I hope it's clear. Take care Denis
Content Development | | android_lyon0