Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Descriptions
Hi, Our Woo Commerce store has about 30 product tags, of which a separate page is created for each, therefore needing the usual titles and descriptions adding. I want to know if the pages will be penalised if I 90% duplicate the 150 word description for product tags that have areas of overlap. How important is this? Thanks.
Content Development | | Arnikabhupal0 -
Duplicates from weird domains
My sit is http://www.webdesign.org/, but the other day I found these sites that have duplicates of my original site: <a>http://wordpresswww.webdesign.org/</a><a>http://fdsfsdswww.webdesign.org/</a><a>http://gfdgdfgdfgwordpresswww.webdesign.org/</a><a>http://w54354353w.webdesign.org/</a>http://wojhkhjkhw.webdesign.org/What really freaks me out is that the content on those sites is 100% up to date. Same as on http://www.webdesign.org/. Now here's my question. 1. Since my site http://www.webdesign.org/ is the root domain, I take it that I can somehow disable those sites (subdomains) from my domain 'admin panel' or what not?2. If you can use those subdomains even if you don't own the root domain (http://www.webdesign.org/), it looks that some negative SEO has been done to my site?Which of my assumptions are right? Please help me to figure that out.
Content Development | | VinceWicks0 -
Is it Possible for an Internal Page to Rank for Various Terms Based ONLY on Blogging Anchor Text?
Hi everyone, Our company provides about 6 different services, each with a specific page on our website: 1. Accept ACH Payments (/accept_ach_payments.html) 2. Client Management & Billing Software (/customer_management.html) 3. Small Business Merchant Accounts (/small_business_merchant_account.html) etc etc Now, here's the question. One of our blogging strategies is to write content about how our online platform can help various types of businesses manage and grow their business. "5 Ways Fitness Business Can...." "How Law Firms Can Benefit...." etc In these blog posts, we don't specify our product, but we do link back into one of those main service pages, so I might link fitness management software to the Client Management & Billing Software (/customer_management.html) page as well as legal billing software to the same client management page Since there are so many different companies that could use our software, we don't want to include them on the Cl_i_ent Management & Billing Software page. That page is just about the benefits of the system and how it works as a great CRM. So....to make a long question short, are we able to rank the Client Management page for "fitness management software" and "legal billing software" if we don't use those terms on the "client management" page itself, and only use it as the anchor text when linking? Instead of making a separate page about how we can be used as a fitness management platform, we'd like our "client management" page to rank for various terms like "fitness management software" "legal billing software" "online church donation software" etc BUT, we don't want to bloat the client management page will all those other topics and content. Hope that makes sense, Patrick
Content Development | | SmallBizSmarts0 -
Duplicate content penalty
Hi there, I'd like to ensure I avoid a duplicate content penalty and could do with some advice. There is a popular blogger in my industry. I have agreed to add his blog to my website. He currently posts his blog on one of the popular free blogger platforms, and will continue to do this. The issue is that I will be posting duplicate content onto my site and I want to ensure that I do not trigger a google penalty. Is there a simple way form me to inform Google of the original source of the content. My intitial thoughts are: 1. Add a noindex to the Robots.txt file 2. Add a link at the beginning of the article pointing to the original source 3. Adding a rel=canonical tag in the header of each blog entry pointing to the original blog post which resides on a completely different domain. Thanks DBC
Content Development | | DBC011 -
With the structure of WordPress when multiple tags are selected, SEOMoz reports show each URL/tag as duplicated content? What to do?
wordpress.com/blogpost/tag/word1 wordpress.com/blogpost/tag/word 2 etc. Same page, but WP generates multiple URLs for each tag. in reports, this shows as duplicate content. Is it something to worry about? If yes, what is the best fix?
Content Development | | VividImage0 -
Duplicate content
Hi I keep getting errors for duplicate content and long url, when i look at these pages its all related to the news pages on my sites how do i define each new news article?
Content Development | | emmanis0 -
How does google react to duplicate shops on ecommerce sites
Surely shopping cart sites are going to have a lot of duplicate content? Does google recognise this? Is there anything I can do let google know?
Content Development | | borderbound0 -
Changing Text on Pages
For one of my sites I'm in a situation where I have 6 main pages that are for lack of a better word "showcased," one of which being the homepage. The problem is that I am seeing pretty good traffic growth, but my conversions/sales are really weak, and I'm about 95% positive that this is because there is too much information on all of those pages --- each one has about 1500 words or so. The site architecture and link structure on the site is good as out of the couple of hundred pages on the site only 3 of them aren't indexed according to Google webmaster tools. What I want to do is rewrite the text on those six main pages with more of a sales type of feel and limit them to 500-700 words or so. This will have no impact on the link architecture whatsoever, but I'm a bit worried that it will have a negative impact on my continual traffic growth. Actually, I'm not as much concerned about the continual part as the steady new content stream should take care of it, but I would be very concerned if I lost the rankings that I have right now. Granted, those rankings aren't worth as much as they could be because conversions are down, so so it's kind of a catch-22. The question is, how dangerous is what I'm planning on doing? On a side note, my lack of conversions has nothing to do with my description or title tags that show up in the results as they are targeted properly and written for sales. The problem is that the pages, though rich in content, are a bit too rich in content and need to be fixed to work in unison with the descriptions and titles.
Content Development | | RussNauta0