Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Index Issue
2 months ago, I registered a domain named www.nextheadphone.com I had a plan to learn SEO and create a affiliate blog site. In my website I had 3 types of content. Informative Articles Headphone Review articles Product Comparision Review articles Problem is, Google does not index my informative articles. I dont know the reasons. https://www.nextheadphone.com/benefits-of-noise-cancelling-headphones/
Content Development | | NextHeadphone
https://www.nextheadphone.com/noise-cancelling-headphones-protect-hearing/ Is there anyone who can take a look and find the issues why google is not indexing my articles? I will be waiting for your reply0 -
Should cornerstone content have 3,500 words? Does Google discern words from the main text and from the references?
Is it true that cornerstone content should have at least 3,500 words? I've done some research and found that the recommended amount is between 2K-10k. Also, the content that we create/publish has a lot of references/citations at the end of each article. Does Google discern words from the main text and from the references? Meaning should I count references as part of the word count? Thanks for the help!
Content Development | | kvillalobos0 -
Does a blog appearing in diff. categories cause duplicate content?
Having a particular blog appearing in different categories or under different tags on a site- does it cause duplicate content? If yes, why ?
Content Development | | Personnel_Concept0 -
How can I resolve a duplicate page issue?
I have (an attached) report that shows duplicate content for a blog page and I'm not sure how to resolve the issue. The blog/website is hosted on wordpress.org, maybe it's something to do with having to add categories or tags - can anyone help please? SDtXT.png
Content Development | | lindsayjhopkins0 -
How can i solve duplicate problem with different url needed?
My client is a big international firm with 10 websites with different url (.co.uk, .com, .com.au, .pl... etc). All websites are exactly the same except the price. I suggested them to only use .com and use region as a sub domain like au.xxx.com instead of xxx.com.au. However they cannot do that for some reason. I am trying to solve the duplicate issue. I dont think i can use 301 redirect or canonial link because all regions are making even traffics. Any suggestions?
Content Development | | ringochan0 -
Why is this store getting hurt in SERPs when they removed duplicate content?
I work with an e-commerce client who got hit hard by Panda. They are very cautious, and want small-scale tests to prove each hypothesis before committing to larger changes. Recently, we reworked content on 30 product detail pages. Before, these product pages featured some original content mixed with some manufacturer content. The change we made was to remove the manufacturer content completely from the product page, leaving about 300 words of high-quality, original content--all of which was written by subject matter experts. I assumed that Google viewed this manufacturer text as duplicate content. However, when these 30 modified pages were compared to the control, they performed significantly worse. Question 1: Does any have any idea why these pages would perform worse than the control?
Content Development | | merch_zzounds
Question 2: Do you have any tips for convincing this client to try another test or get the buy-in to make the larger changes that--in theory--need to happen? FWIW, this client has about 10,000 product detail pages--the vast majority of which contain just manufacturer content. I appreciate your thoughts.0 -
Duplicate Content Penalty
If our pages are to have roughly 30% of non-original textual content, can we be penalized by Google? Or are we OK as long as this non-original content is relevant to the pages?
Content Development | | Quidsi0 -
Duplicate Content on WordPress Blogs?
We are getting ready to add a WordPress blog to our established website. Our plans are to place it in a subfolder on our website to maximize rank. My question is...Do we need to utilize a Meta Robots WordPress plugin by Yoast or similar so that noindex,follow robots meta tags will prevent search engine indexing of search result pages, subpages and category archives? We want to avoid the dreaded Duplicate Content Error and penalty. Any other great SEO WordPress plugins? Thank you for your time. Brian
Content Development | | gw3seo0