Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Image Search - How to rank?
Hi, How would you optimise for rank higher in image search? Any tips/rules which need to be applied. Thanks.
Content Development | | Bondara0 -
How to Get Rid of Duplicate Content Captured on Article Lists
We have a ton of articles and blog posts on our site. Currently, we display summary lists of articles that contain the first paragraph of the article in the summary list. However, in my reports, this is coming back as duplicate content with the full article itself. How do I fix this? Ex: article main page- http://www.robots.com/articles/10 First article on that page- http://www.robots.com/articles/viewing/grippers-for-robots (which shows up as duplicate content with the main artilce page). With our blogs, we have the most recent 5 blogs (in the same summary format) listed on our main blog page. We then have categories that people can sort by. But again, this is causing us duplicate content because those pages show the first paragraph of the blogs related to that category. Ex: blog main page- http://www.robots.com/blog. First blog listed on that page- http://www.robots.com/blog/viewing/robots-and-automation-bringing-jobs-back-to-the-united-states (which then shows as duplicate content with the main blog page). And then you can also select categories to see related topics: http://www.robots.com/blog/category/buying-a-robot which is showing as duplicate content also. Help! How can I prevent this? Thanks! JWanner
Content Development | | jwanner0 -
Typepad.com blog migration & duplicate content
I've migrated a typepad.com blog with a bunch of content (but little traffic) onto a hosted WordPress site under my own domain name (the way I should've done it in the first place). Now I don't want to confuse Google that the new site is duplicating content from the other site, so would I be better off with: 1) meta-refresh redirecting each typepad.com post to the same post on the new blog, or 2) just killing the typepad.com blog entirely so Google will not find duplicate posts anywhere. In favor of #2 is the fact that these posts get very little traffic today. I figure I will lose more traffic from duplicate content ranking penalties than from losing the posts themselves in the original blog. What do you think?
Content Development | | chriscrabtree0 -
Duplicate external links?
I have been guest posting at a variety of reputable blogs in my niche. I generally write once or twice a month and have a bio link with a link to my blog. I'm wondering if multiple links from the same domain (but different pages) helps, or if there are some diminishing returns here. Should I only be writing one post for them? Of course, there are other non-SEO benefits too, because these are reputable sites. But I'm wondering how this helps my SEO? Thanks in advance!
Content Development | | JodiFTM0 -
Will our two retail sites get hit with duplicate content?
Our retail site just rolled out a second online store. The URL is new and it is showing some of the same products from the same vendors (probably about 40% of the fist store is in the second store). Down the road, we will remove the products from the first site, however, we are keeping it for now. The products show up on both sites, with the same images, and the same descriptions and almost the same URL query string. Are we going to get hit with any penalties due to duplicate content?
Content Development | | klmarketing0 -
Correction Duplicate Page Title Problems for a Blog
EDITED: To just focus on the issue at hand. I am trying to figure out the SEO rules instead of just working on the content. Please bear with me. I am adept technically. I just do not know the rules of the SEO process or even some of the termology. So I’m trying to attack problems one at time. Today’s problem – **Duplicate Page Titles ** We evidently have thousands of Duplicate Page Titles. We are using Joomla 2.5 & Easyblog. Our sitemap is automated from XML Sitemap Easyblog takes the title of the sites and uses it for a name of the summary pages. We post 5 blog items per page and all the names are the same. http://www.OursiteName.com/?start=5 Page Title = Site Name http://www.OursiteName.com/?start=10 Page Title = Site Name A similar thing happens on the sorting by Author or Category etc etc. Basically non-duplicate pages are looking like duplicates. What is the best practice / approach? Using the Robot.txt or XML Sitemap to tell Google not to crawl these pages? Writing a script or edit the Easyblog code to edit the 2000 duplicate Page Titles? Other thoughts?
Content Development | | Romana0 -
Duplicate Websites
What would you do if a competitor had their main domain, and then another domain targeting your local area with the same exact content? That's currently happening to me, and I'm not sure what I should do about it, if anything: http://www.bozemanchevrolet.com and http://www.montanachevy.com
Content Development | | ResslerMotors0