Duplicate content list by SEOMOZ
-
Hi Friends,
I am seeing lot of duplicate (about 10%) from the crawl report of SEOMOZ.
The report says, "Duplicate Page Content"
But the urls it listed have different title, different url and also different content. I am not sure how to fix this issue..
My site has both Indian cinema news and photo gallery. The problme mainly coming in photo gallery posts.
for example:
this is the main url of a post.
apgossips.com/2012/12/18/telugu-actress-poonam-kaur-photos . But in this post, each image is a link to its enlarged images (default wordpress). The problem is coming with each individual image with in this post.
examples of SEOMOZ report 3 individual urls as duplicate content...from the same above post.:
Some body please advise me.. Appreciate your help.
-
You can always edit the image in the Wordpress post and then remove the link to the media file. This will prevent those pages from getting crawled from the post page.
If you are rewriting the URL you could install the Yoast SEO plugin and then choose the noindex/follow meta tag for date based archives as shown here:
-
No, I am not indexing archives at all. But the url is customized to include month/dd/yr in the permalink structure.
the issue here is, in a single post of photogallery, each individual photo is getting indexed I guess...
-
Are you using the date based archives? I would stop using those if I were you and assign articles to the appropriate category so your URL's are optimized for SEO.
The reason I'm suggesting this is because the easiest way to solve the problem is to disable indexing on date based archives which will also eliminate the duplicate content on the photo's b/c they are basically date based photo archives.
Let me know,
-
Thank you James, for the quick response.
I am kind of new to SEO, may I request you how to do the two step you said above?
1. What is the code i need to add in robots.txt?
2. How to remove default media links in wordpress?
I appreciate your response.
thanks,
KS
-
I prefer to disallow photo archive pages via robots.txt.
I also recommend removing the default link WP inserts to the media image page.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Canonical: Same content but different countries
I'm building a website that has content made for specific countries. The url format is: MyWebsite.com/<country name="">/</country> Some of the pages for <specific url="">are the same for different countries, the <specific url="">would be the same as well. The only difference would be the <country name="">.</country></specific></specific> How do I deal with canonical issues to avoid Google thinking I'm presenting the same content?
On-Page Optimization | | newbyguy0 -
How to explain to a client that duplicate content is bad...
Afternoon! An SEO client of ours has copied a load of landing/category page content from other sites. Lots of emails have been sent back and forth asking them to remove it, but they are adamant to keep it up there until we have time to amend it. We have explained to them: The Google penalty risks The copyright risks The short and long-term implications for their brand new business/website The money they are spending on our SEO package could be completely wasted if they're caught I think the above is pretty black and white, but the director of this company will not budge. Does anyone have any different approaches? The director said he's happy for us to amend the content but, in the meantime, the plagiarised content will not be removed. Cheers, Lewis
On-Page Optimization | | PeaSoupDigital0 -
How does Indeed.com make it to the top of every single search despite of having aggregated content or duplicate content
How does Indeed.com make it to the top of every single search despite of having duplicate content. I mean somewhere google says they will prefer original content & will give preference to them who have original content but this statement contradict when I see Indeed.com as they aggregate content from other sites but still rank higher than original content provider side. How does Indeed.com make it to the top of every single search despite of having aggregated content or duplicate content
On-Page Optimization | | vivekrathore0 -
Duplicate Issue
Hello Mozzers! We have a client going through a website revamp. The client is The Michelangelo Hotel, and they are part of Star Hotels. Star Hotels plans to create a section on their site for The Michelangelo, as opposed to maintaining a stand alone site. They will then take the michelangelohotel.com domain, and point it to the corresponding pages on the Star site. The guest will key in www.michelangelohotel.com, and will see the same content that can be found on www.starhotel.com/en/michelangelo-hotel-new-york. The problem we have is this: Essentially the same content will be indexed twice, once on starhotels.com and once on michelangelohotel.com. This would seem to cause a duplicate content issue. What are your thoughts? Edit: I apologize, because I was not nearly clear enough here. The Star Hotels site will have 5 pages dedicated to The Michelangelo Hotel. The content will sit solely on that server as those 5 pages. Those 5 pages will each be indexed as 2 URLs. www.michelangelohotel.com <-> www.starhotels.com/en/michelangelo/ www.michelangelohotel.com/accommodations <-> www.starhotels.com/en/michelangelo/accommodations And so on. Thanks!
On-Page Optimization | | FrankSweeney0 -
E-Commerce Site - Duplicate Content
We run an e-commerce site with about 250,000 SKUs. Certain items, such as a micro USB car charger, will be applicable to several different phones. Example: http://www.wirelessemporium.com/p-165787-samsung-galaxy-proclaim-illusion-sch-i110-heavy-duty-car-charger.asp http://www.wirelessemporium.com/p-165856-sony-xperia-ion-4g-lte-att-heavy-duty-car-charger.asp As one can imagine with so many items, unique content for each item description page can be a challenge. What would be the best way to address this on a large scale?
On-Page Optimization | | eugeneku0 -
Is content aggregation good SEO?
I didn't see this topic specifically addressed here: what's the current thinking on using content aggregation for SEO purposes? I'll use flavors.me as an example. Flavors.me lets you set up a domain that pulls in content from a variety of services (Twitter, YouTube, Flickr, RSS, etc.). There's also a limited ability to publish unique content as well. So let's say that we've got MyDomain.com set up, and most of the content is being drawn in from other services. So there's blog posts from WordPress.com, videos from YouTube, a photo gallery from Flickr, etc. How would Google look at this scenario? Is MyDomain.com simply scraped content from the other (more authoritative) sources? Is the aggregated content perceived to "belong" to MyDomain.com or not? And most importantly, if you're aggregating a lot of content related to Topic X, will this content aggregation help MyDomain.com rank for Topic X? Looking forward to the community's thoughts. Thanks!
On-Page Optimization | | GOODSIR0 -
Can duplicate content issues be solved with a noindex robot metatag?
Hi all I have a number of duplicate content issues arising from a recent crawl diagnostics report. Would using a robots meta tag (like below) on the pages I don't necessarily mind not being indexed be an effective way to solve the problem? Thanks for any / all replies
On-Page Optimization | | joeprice0 -
Does putting content in tabs devalue it at all?
Hello! Still very new to the SEO world and just trying to soak in as much information as I can. The site I work for took a substantial hit with the panda update, so we are looking into adding as much quality content as we can in the upcoming months. With our current site layout, space will quickly become an issue. Assuming the content is relevant and useful for the page, will putting the content into tabs be counter productive or devalue it at all?
On-Page Optimization | | davegtt0