Can Sitemap Be Used to Manage Canonical URLs?
-
We have a duplicate content challenge that likely has contributed to us loosing SERPs especially for generic keywords such as "audiobook," "audiobooks," "audio book," and "audio books."
Our duplicate content is on two levels.
1. The first level is at our web store, www.audiobooksonline.com.
Audiobooks are sometimes published in abridged, unabridged, on compact discs, on MP3 CD by the same publisher. In this case we use the publisher description of the story for each "flavor" = duplicate content.
Can we use our sitemap to identify only one "flavor" so that a spider doesn't index the others?
2. The second level is that most online merchants of the same publisher's audio book use the same description of the story = lots of duplicate content on the Web.
In that we have 11,000+ audio book titles offered at our Web store, I expect Google sees us as having lots of duplicated (on the Web) content and devalues our site.
Some of our competitors who rank very high for our generic keywords use the same publisher's description.
Any suggestions on how we could make our individual audio book title pages unique will be greatly appreciated.
-
Your sitemap.xml can't be used to solve this issue, Larry. The sitemap is only used to tell the search engines which pages exist on the site, not what to do if many of those pages share similar content.
In your case, likely the best approach is to use the rel=canonical tag to inform the search engines that you aware that the different formats of the audiobooks share similar descriptions, and to pick one format to be the primary page. Once you've determined the primary page, the other formats' pages would use the canonical tag in their headers to point to the primary page.
This essentially tells the search engines "these other pages are useful to the user, so I don't want to hide them, but they are really variations of the primary page, so assign all their value to the primary page, please".
This process is only a suggestion to the search engines, but it is usually heeded. The only real alternative would be to combine all the different format pages into one page with a description of the book, then listing the other formats and their prices. Kinda doubt your eCommerce system would allow this "out of the box". (You would then 301-redirect all the other format pages to the new main page.)
As for the fact that the book descriptions are the same as the publisher's and all the other sites - the only way around this is to write your own custom descriptions. There are many reasons the other sites could be ranking well even with those duplicate descriptions, ranging from better overall site authority, to having been online longer, to having better, more powerful incoming links.
It's a tough spot to be in, but you could start by rewriting the descriptions for, say, the top 25 books (according to your Analytics and your own instincts for which ones are the most valuable sales) and see if that results in an improvement to rankings and conversions.
One other way to beat the duplicate content in this case would be to get customers to leave reviews which are included on each page. These reviews would be different from other sites, making the overall content look different to the search engines. But this is also a lot of work to get to scale up as your customers must be encouraged to come back to your site at a later date to leave the review.
Hope that helps;
Paul
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Canonical: Same content but different countries
I'm building a website that has content made for specific countries. The url format is: MyWebsite.com/<country name="">/</country> Some of the pages for <specific url="">are the same for different countries, the <specific url="">would be the same as well. The only difference would be the <country name="">.</country></specific></specific> How do I deal with canonical issues to avoid Google thinking I'm presenting the same content?
On-Page Optimization | | newbyguy0 -
Paginated URLs are getting Indexed
Hi, For ex: - My site is www.abc.com and Its paginated URLs for www.abc.com/jobs-in-delhi are in the format of : www.abc.com/jobs-in-delhi-1, www.abc.com/jobs-in-delhi-2 and vice versa also i have used pagination tags rel=next and rel=prev. My concern is all the paginated URLs are getting indexed so is their any disadvantage if these URLs are getting indexed as somewhere i have read that link juice may get distributed in case of pagination. isn't it good to use Noindex, Follow so that we can make the Google to understand that paginated page are not so much important and that should not be ranked.
On-Page Optimization | | vivekrathore0 -
Wordpress sitemap url problem causing WMT errors
The following types of links are appearing in my webmaster tools crawl errors report under 'other'. I've noticed they are in my sitemaps ( I run wordpress and use a plugin called Google XML sitemaps). How do I get rid of this error? http://www.musicliveuk.com/bands/postname%/
On-Page Optimization | | SamCUK0 -
Redirect both / and non-/ URLs?
I am doing SEO on WP site. Due to some duplicate pages (rel canonical was done before) I am doing 301 redirects at the moment. And I wonder if I need to redirect both links w/ and w/o trailing slash. Default is non www, w/o trailing slash. Like there is .com/category/news but there is same page linked in .com/news (well it works when permalink structure is set to /%category%/%postname% and returns 404 error when structure is set to /%postname%).
On-Page Optimization | | OVJ
I redirected .lt/naujienos to .lt/category/naujienos. Should I also redirect .lt/naujienos/ (with trailing slash)? There's absolutely no problem redirecting this, but there are some more pages which I want to edit their URLs and I wonder If I should do both redirects from links /w and w/o slash?1 -
Sorry, but that URL is inaccessible
Our site is all on the https protocol so every time I use the on-page grader it tells me the link is unavailable. What can we do? When I use the http protocol (which is 301 redirected to the https) it still gives me the same message.
On-Page Optimization | | whiskeyfl1 -
Should i make all of my pages with canonical tag
Hi, Im using thesis Wordpress theme, and their default option is "Add canonical <acronym title="Uniform Resource Locator">URL</acronym>s to your site" im just wandering if i should keep that box checked and apply canonical <acronym title="Uniform Resource Locator">URL</acronym>s to all of my pages? Thank You
On-Page Optimization | | Vmezoz0 -
Conflicting Canonical Tag in On-Page Report
I'm going through my site with the "One-Page report card for some PPC landing pages. If I have this canonical tag on my page : I get a Fail in the "Critical Factors Section here: Appropriate Use of Rel Canonical but a Pass in the Optional Factors here: Canonical URL Tag Usage If I take the canonical link out, the opposite happens, what am I missing? Is the format wrong? Thanks Michael
On-Page Optimization | | mjrinvent0 -
Moving our current homepage to a new URL
Our homepage currently speaks to a specific product and we're re-doing our homepage to be more about the brand which links to the product. The current home page has PA of 62 with thousands of links to the page. Question is are there any best practices around this or any risks? So current page is: www.xyz.com which we will be refreshing then moving the existing content to www.xyz.com/product so all the subdirectories gets shifted over 1 Thank in advance for the help!
On-Page Optimization | | JoeLin0