Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Free Duplicate Content Checker Tools ?
Hi Moz, I am really looking for free tools which can carry my content duplication issue, as i visited http://moz.com/community/q/are-there-tools-to-discover-duplicate-content-issues-with-the-other-websites suggested copyscape which is paid. I want FREE to handle my duplication issue.' Thanks in Advance. Best,
Content Development | | Futura
Teginder1 -
Is this duplicate content?
I'm optimizing a Magento site and have a question regarding duplicate content. Currently, you can dig down to an individual product listings with URLs similar to this: (1) http://www.foo.com/category/sub-category/sub-sub-category/item.html However, we also have a "Top 50" area, with a link to the same page; however, the URL for that page is: (2) http://www.foo.com/item.html Both are dynamic, so a static page for (2) with different content is out of the question. I asked IT to have both (1) and (2) point to exactly the same page, within the same categor(ies), but they said I would have choose one or the other So, here are my questions: Will Google consider the pages to be duplicates of each other, and thus incur a penalty; If I were to choose one structure, which would be the "friendliest?" I've think I've come across questions similar to this in Q&A, but haven't been able to locate them; so, I'm sorry to be posting a "duplicate question." I've been busy writing completely different product descriptions, nice and deep and value-rich, for more than 300 items and categories and am only now starting to look at current SEO protocols; I'm hoping to ask Google for a site reevaluation in another 2 weeks or so. Thanks.
Content Development | | RScime250 -
Duplicate Text on Blog & Internal News Page
I have two places I post news for our company. Our blog - typically more informal posts
Content Development | | seo-hunter
mycompany.wordpress.com & Our news page - typically more newsworthy than the blog
mycompany.com/news My question is, It is okay to just copy the exact text from my wordpress blog and paste to my news area of my site and vice versa? Does this hurt ranking potential for either page?0 -
Handling duplicate content in Blogs
Many wordpress themes like mine have a homepage where the last 3 to 4 posts are displayed on the frontpage. Each post also has its own url where the post are shown seperately. How do I avoid beeing seen as duplicate content by Google?
Content Development | | wellnesswooz0 -
Wordpress Duplicate Pages/ URL's - Help !
Hi guys, I have been running SEOMoz for just over a month and slowly cleaning up one of my Wordpress Blogs. While going through the crawl reports I have noticed that I have duplicate pages showing on the crawl. For example, the main post would be; www.xxxxx.com/blog/post-title Then I see another URL which would be; **www.xxxx.com/blog/page/59 ** When I click on either URL it goes back to the actual post title URL. What's with these page URL's ? Isn't these two URL's showing duplicate content to the search engines ? Any suggestions would be greatly appreciated.
Content Development | | dcc0 -
Blogger - Multiple partial duplicate content and canonical
In Blogger, have at least three pages produced for each post - main post, archive and tag - each has their own canonical tag - are these considered duplicate content by Google? Not sure the best way to handle this.
Content Development | | holdtheonion0 -
Is it considered as duplicate content ?
Hello, I see a lot of errors on my webmaster tools because of this ajax code on my questions pages of the site (screen) : www.dismoicomment.fr The code : | / ADD ANSWER FORM |
Content Development | | elitepronostic
| | $("#answer-add-button").click(function () { |
| | $.ajax({ |
| | type: 'POST', |
| | url: '/answers/quelle-assurance-choisir-pour-un-scooter/', |
| | data: $("form#answer-add").serialize(), |
| | dataType: 'html', |
| | success: function(data) { |
| | |
| | if(data=="answer") { |
| | $('.answer-add-message').show().empty(); |
| | $(document).ready(function() { |
| | $(' Vous avez déjà répondu à cette question. ').appendTo('.answer-add-message'); |
| | }); | I have add a line on my robots.txt : http://www.dismoicomment.fr/robots.txt for remove all urls with /answers/. These urls with /answers/ aren't indexed in google. Do you think that it is dangerous and that can be considered as duplicate content ? 1129546035.png0