Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Images & Duplicate Content Issues
Here's a scenario for you: The site is running WordPress and the images are uploaded to the media section. You can set image attributes there such as the Description & Alt Tag. Let's say you'd like to reuse the same image in two different blog posts. The image keeps the same Description & Alt Tag associated with it in the media section. Would this be considered duplicate content? What would be the best practice in this case to reuse the same image in multiple posts?
Content Development | | VicMarcusNWI0 -
Will having duplicate content on four websites cause a problem?
A client of ours has four websites for different shops they run in the surrounding area. Each website has original content as well as duplicate content. This is for things like product advice which needs to be the same Will having duplicate content on these four websites cause a problem? How can it be mitigated? We can't refer the visitor to another website to get the product information as this will break the user experience, and of course shopping cart sessions will not pass on.
Content Development | | Rebecca.Holloway0 -
Duplicate Content behind a Paywall
We have a website that is publicly visible. This website has content. We'd like to take that same content, put it on another website, behind a paywall. Since Google will not be able to crawl those pages behind the paywall is there any risk to ua doing this? Thanks! Mike
Content Development | | FOTF_DigitalMarketing0 -
Google adsense image - text and alternatives
Hi, there has been a lot of talk on the internet about google adsense and which is better, image or text as well as what are the best alternatives to google adsense, so i thought i would throw out the topic on here and see what real website experts think about the topic. I have experienced google adsense image as better than text and would like to know what other people think and beside affliate ads what alternatives to google adsense do people like
Content Development | | ClaireH-1848860 -
Is this duplicate content?
I'm optimizing a Magento site and have a question regarding duplicate content. Currently, you can dig down to an individual product listings with URLs similar to this: (1) http://www.foo.com/category/sub-category/sub-sub-category/item.html However, we also have a "Top 50" area, with a link to the same page; however, the URL for that page is: (2) http://www.foo.com/item.html Both are dynamic, so a static page for (2) with different content is out of the question. I asked IT to have both (1) and (2) point to exactly the same page, within the same categor(ies), but they said I would have choose one or the other So, here are my questions: Will Google consider the pages to be duplicates of each other, and thus incur a penalty; If I were to choose one structure, which would be the "friendliest?" I've think I've come across questions similar to this in Q&A, but haven't been able to locate them; so, I'm sorry to be posting a "duplicate question." I've been busy writing completely different product descriptions, nice and deep and value-rich, for more than 300 items and categories and am only now starting to look at current SEO protocols; I'm hoping to ask Google for a site reevaluation in another 2 weeks or so. Thanks.
Content Development | | RScime250 -
Duplicate external links?
I have been guest posting at a variety of reputable blogs in my niche. I generally write once or twice a month and have a bio link with a link to my blog. I'm wondering if multiple links from the same domain (but different pages) helps, or if there are some diminishing returns here. Should I only be writing one post for them? Of course, there are other non-SEO benefits too, because these are reputable sites. But I'm wondering how this helps my SEO? Thanks in advance!
Content Development | | JodiFTM0 -
How Google judge about duplicate content?
With recent Search engines updates one thing is clear we cannot ignore content. Content marketing definitely going to be most important part of our SEO strategy. I have few doubts about content marketing (circulation of content over web) where I want suggestions of community members. There would be different thoughts so I would like to have as many as responses to know what majority thinks: When we are writing guest posts, does article needs to be unique with each and every blog we are writing or we can safely circulate one good piece of content to 10-15 blogs who are interested in our creative. We have written a good blog post for our own domain. Apart from social sharing should it be posted to other related blogs too or it should be unique to our domain only. Social sharing, mentions, like of blog matters in rankings?Seems yes they do but need to know what majority thinks. Finally what is the safe number to circulate your content over web.
Content Development | | EG0CENTRIX0 -
Press Releases and Duplicate Content on Event Related Site
I have a site that lists events. I ask those submitting events to submit original content if possible, but frequently they submit press releases which are already published elsewhere. I rewrite some of the press releases, but do not have time to rewrite every press release that comes my way. I want my users to get a comprehensive list of events, but I don't want get a penalty for duplicate content. What is the best solution?
Content Development | | andywozhere0