Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content
Dear community, We have 15 product specific landing pages. They all share a block called "Why invest in VanEck ETFs?", see e.g., https://www.vaneck.com/de/en/mining-etf https://www.vaneck.com/de/en/space-etf/ https://www.vaneck.com/de/en/esports-etf/
Content Development | | marketing-europe
Can this lead to SEO penalization because of duplicate content?0 -
How can I use the images to improve my SEO?
Hi there, I am starting to work on my web and I have a question regarding the featured images of the articles. How can I work with them to improve the SEO of my posts? Thank you in advance! 🙂
Content Development | | lucywrites0 -
Are Duplicate Bio's Duplicate Content?
I'm wondering if I need to go through all the various bio's our firm has on all the various legal directory and client review sites to make sure they have unique bio's? I really, really don't want to do that, but if they are going to flagged as duplicate content, I will. I'm hoping sites like "findlaw," "avvo," etc, have some built in rel:cannonical or something that says these bio's aren't to be seen as unique content and therefore conflict with what's on our site. Anybody know? Just to clarify, any of the sties that have asked for unique content/bio/about us, I have complied with that. However, a lot don't specifically state it has to be unique, so I've just copy and pasted from our site in those cases. Thanks, Ruben
Content Development | | KempRugeLawGroup0 -
Is it Possible for an Internal Page to Rank for Various Terms Based ONLY on Blogging Anchor Text?
Hi everyone, Our company provides about 6 different services, each with a specific page on our website: 1. Accept ACH Payments (/accept_ach_payments.html) 2. Client Management & Billing Software (/customer_management.html) 3. Small Business Merchant Accounts (/small_business_merchant_account.html) etc etc Now, here's the question. One of our blogging strategies is to write content about how our online platform can help various types of businesses manage and grow their business. "5 Ways Fitness Business Can...." "How Law Firms Can Benefit...." etc In these blog posts, we don't specify our product, but we do link back into one of those main service pages, so I might link fitness management software to the Client Management & Billing Software (/customer_management.html) page as well as legal billing software to the same client management page Since there are so many different companies that could use our software, we don't want to include them on the Cl_i_ent Management & Billing Software page. That page is just about the benefits of the system and how it works as a great CRM. So....to make a long question short, are we able to rank the Client Management page for "fitness management software" and "legal billing software" if we don't use those terms on the "client management" page itself, and only use it as the anchor text when linking? Instead of making a separate page about how we can be used as a fitness management platform, we'd like our "client management" page to rank for various terms like "fitness management software" "legal billing software" "online church donation software" etc BUT, we don't want to bloat the client management page will all those other topics and content. Hope that makes sense, Patrick
Content Development | | SmallBizSmarts0 -
Duplicate Page Content & Rel-Canonicals
The SEO Moz duplicate page content tool lists the following URL's as having duplicate content: http://www.savvyboater.com/1988-newer-8-tooth-15-hp-honda-outboard-props.aspx http://www.savvyboater.com/1988-newer-8-tooth-15-hp-honda-outboard-props.aspx?sort=PriceAsc&pi=2 The second URL is the price sorter/second page of the category and contains the following rel-canonical: | http://www.savvyboater.com/1988-newer-8-tooth-15-hp-honda-outboard-props.aspx"> Are we using the rel-canonical correctly in this case? If so, why does it continue to show up as duplicate content in our SEO Moz report? There are over 1,000 URLS listed in the report with the exact same issue. |
Content Development | | ironpac0 -
Can you use creative commons non-commercial images on a company blog?
Does anyone know if it is okay to use creative commons images on your company blog if they are under the Attribution-NonCommercial-NoDerivs 2.0 Generic license. Technically you are using it on a commercial site, but you are not directly making money from the image or selling it.
Content Development | | ProjectLabs0 -
How does google react to duplicate shops on ecommerce sites
Surely shopping cart sites are going to have a lot of duplicate content? Does google recognise this? Is there anything I can do let google know?
Content Development | | borderbound0 -
How to titling images in WP blog
What is the best way to title an image in a blog post. The wording will relate to the post discussion so I am not discussing the word stuffing, rather how to enter the words. Here are my options for the title: 1. dayton engagement photos 2. dayton_engagement_photos 3. daytonengagementphotos 4. Is there another preferred method? Should I increment the image title for each image such as: daytonengagementphotos1, daytonengagementphotos2, etc. What about the alternative text area? Does the same concept apply there? Thanks hR3ua.jpg
Content Development | | maximphotostudio0