Duplicate Content within Website - problem?
-
Hello everyone,
I am currently working on a big site which sells thousands of widgets. However each widget has ten sub widgets (1,2,3... say)
My strategy with this site is to target the long tail search so I'm creating static pages for each possibly variation.
So I'll have a main product page on widgets in general, and also a page on widget1, page on widget2 etc etc.
I'm anticipating that because there's so much competition for searches relating to widgets in general, I'll get most of my traffic from people being more specific and searching for widget1 or widget 7 etc.
Now here's the problem - I am getting a lot of content written for this website - a few hundred words for each widget. However I can't go to the extreme of writing unique content for each sub widget - that would mean 10's of 1,000's of articles.
So... what do I do with the content. Put it on the main widget page was the plan but what do I do about the sub pages. I could put it there and it would make perfect sense to a reader and be relevant to people specifically looking for widget1, say, but could there be a issue with it being viewed as duplicate content.
One idea was to just put a snippet (first 100 words) on each sub page with a link back to the main widget page where the full copy would be.
Not sure whether I've made myself clear at all but hopefully I have - or I can clarify.
Thanks so much in advance
David
-
What's wrong with having ten brass widgets in ten different colors and ten buy buttons all listed on a single page?
I do that I we see lots of people buying a brass widget in every color. I think that this is great for getting more sales. If I was a shopper it would be a real frustration to visit ten pages to get one of each color - or just visit all of those pages to see which color I like best.
Most important, Google might see that and say.... This page has brass widgets in EVERY FREEKING COLOR! and decide to show it to visitors who search for them.
Now, if you are compulsive about having one page per widget and having your writer create yada yada yada content for all of them, keep in mind that you are wasting a lot of money on near duplicate content, boring your writers and spreading your pagerank out over a lot of pages.
-
David, the sub-pages as far as Goggle was concerned fed all the juice to the product page.
No the subpages were not indexed as we told Google they all came from the same page in the canonical.
How do you describe a red widget1 differently to blue widget1? The item is the same but there is only one word different in the content, so we decided to skip a physically different url for the different colours and just use different anchors on the thumbnail images. The title and alt tags would contain specific information about the colour of the widget.
If someone searches for red widget1 and we have keyword strength in widget1 they will get to the widget1 page where they will see the red widget1 and any other colours for that widget1.
The canonical allows you to specify the content origin. So if you have /category/widget1/red and /category/widget1/blue describing the same content you could use /category/widget1 in the canonical ref and both pages would give juice to the main page and get no duplicate content penality.
This only works if you have a small number of variants on each widget as Ryan pointed out, such as size, colour variations etc. Otherwise it is too confusing for humans to follow.
With the amount of content you are looking at, it is probably worthwhile getting a usability study done.
-
SEO = Manipulation doesn't it?
You can call me naive but those days of SEO are either gone or disappearing fast.
I view SEO as working to understand the ever-changing metrics search engines use to rank search results, then applying that knowledge to websites.
We are manipulated into improving our sites to provide a better user experience. The changes we make have lasting value. Other forms of SEO are always one update away from making a post asking "what happened to my site's rankings?"
-
Thanks for the replies, guys.
Oznappies - did that structure mean that all your subproduct pages were pretty much devoid of link juice? Were they even indexed? The big question is if someone searhed for 'red product a' which page showed up? Excuse my ignorance re the canonical stuff.
Ryan, Yes you are right to some degree. I am reverse engineering the website so to speak. But nevertheless I plan to offer huge value to visitors - I have spared little expense with the content writing, usability etc plus we have some fairly radical ideas that should be hugely popular with the visitors.
But I take exception that this is the wrong way to go about it. SEO = Manipulation doesn't it? The old adage 'Just make great content then users will find it and link to it and you'll dominate the serps' is a great theory but we all know in practise it doesn't work like that in 99% of the cases. To get your great product out there you have to give it a push, find an angle to exploit and this targeting of long tail is my angle.
It will be a great site I assure you
-
If the widgets are truly different products, then they should have separate product pages. If you have a weather widget, a currency exchange widget, a local time widget, etc. then you can clearly build unique content for each page.
If you offer a widget in different colors, sizes, etc. but it is really the same widget, you can't effectively generate new content for each page. Your best approach is creating a single, strong page for the widget. The "blue", "yellow" and other widget pages should be canonicalized to the main widget page.
I am getting a lot of content written for this website - a few hundred words for each widget. However I can't go to the extreme of writing unique content for each sub widget - that would mean 10's of 1,000's of articles.
That sums it up pretty well. You are having content "written" which often means it is not quality content. You are not willing to write unique content for each sub widget either. You are not developing your site for the best user experience, but instead to manipulate search engine traffic. Google is focused on preventing you from doing exactly what you are trying to do. Even if you succeed, you will be back here in a couple months asking "why did my site drop so far" after Google makes an update to adjust for this type of manipulation.
You have two options. Condense all your content to one widget page, or develop each widget page as if it was the only page on your website. When you sit down and think "I have 10k pages and I need to have content on all of them" your content will be inferior to other sites, and your SERP will reflect as much.
-
We had a similar issue but not to that scale. We had product A in Red, Blue, Green etc the first approach we used a url /category/product?id=subproduct and set id as a parameter in Google Webmaster Tools site config. This passed all the link juice to /category/product and ensured that all pages had the appropriate for the link juice page.
We then decided that all those page loads just to basically show an image for each subproduct were a pain for the customer and so decided to show small images on the /category/product page an use a jquery call to overlay a larger image when the customer clicked a particular product. This produced faster load time and better customer experience.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Internal link is creating duplicate content issues and generating 404s from website crawl.
Not sure what the best way to describe it but the site is built with Elementor page builder. We are finding out that a feature that is included with a pop modal window renders an HTML code as so: Click So when crawled I think the crawling is linking itself for some reason so the crawl returns something like this: xyz.com/builder/listing/ - what we want what we don't want xyz.com/builder/listing/ xyz.com/builder/listing/%23elementor-action%3Aaction%3Dpopup%3Aopen%26settings%3DeyJpZCI6Ijc2MCIsInRvZ2dsZSI6ZmFsc2V9/ xyz.com/builder/listing/%23elementor-action%3Aaction%3Dpopup%3Aopen%26settings%3DeyJpZCI6Ijc2MCIsInRvZ2dsZSI6ZmFsc2V9//%23elementor-action%3Aaction%3Dpopup%3Aopen%26settings%3DeyJpZCI6Ijc2MCIsInRvZ2dsZSI6ZmFsc2V9/ so you'll notice how that string in the HREF is appended each time and it loops a couple times. Could I 301 this issue, what's the best way to go about handling something like this? It's causing duplicate meta descriptions/content errors for some listing pages we have. I did add a rel='nofollow' to the anchor tag with JavaScript but not sure if that'll help.
Technical SEO | | JoseG-LP0 -
Auto genrated content problem?
Hi all, I operate a Dutch website (sneeuwsporter.nl), the website is a a database of European ski resorts and accommodations (hotels, chalets etc). We launched about a month ago with a database of about 1700+ accommodations. Of every accommodation we collected general information like what village it is in, how far it is from the city centre and how many stars it has. This information is shown in a list on the right of each page (e.g. http://www.sneeuwsporter.nl/oostenrijk/zillertal-3000/mayrhofen/appartementen-meckyheim/). In addition a text of this accomodation is auto generated based on some of the properties that are also in the list (like distance, stars etc). Below the paragraph about the accommodation is a paragraph about the village the accommodation is located in, this is a general text that is the same with all the accommodations in this village. Below that is a general text about the resort area, this text is also identical on all the accommodation pages in the area. So a lot of these texts about the village and area are used many times on different pages. Things went well at first and every day we got more Google traffic, and more and more pages. But a few days ago our organic traffic took a near 100% dive, we are hardly listed anymore and if we are at very low places. We expect the Google gave us a penalty. We expect this to be the case because of 2 reasons: we have auto generated text that only vary slightly per page we re-use the content about villages and area's on many pages We quickly removed the content of the villages and resort area's because we are pretty sure that this is definitely something Google does not want. We are less sure about the auto generated content, is this something we should remove as well? These are normal readable text, they just happen to be structured more or less the same way on every page. Finally, when we made these and maybe some other fixes, what is the best and quickest ways to let Google see us again and show them we improved? Thanks in advance!
Technical SEO | | sneeuwsporter0 -
Duplicate content problem from an index.php file
Hi One of my sites is flagging a duplicate content problem which is affecting the search rankings. The duplicate problem is caused by http://www.mydomain.com/index.php which has a page rank of 26 How can I sort the duplicate content problem, as the main page should just be http://www.mydomain.com which has a page rank of 42 and is the stronger page with stronger links etc Many Thanks
Technical SEO | | ocelot0 -
Duplicate Content - Just how killer is it?
Yesterday I received my ranking report and was extremely disappointed that my high-priority pages dropped in rank for a second week in a row for my targeted keywords. This is after running them through the gradecard and getting As for each of them on the keywords I wanted. I looked at my google webmaster tools and saw new duplicate content pages listed, which were the ones I had just modified to get my keyword targeting better. In my hastiness to work on getting the keyword usage up, I neglected to prevent these descriptions from coming up when viewing the page with filter parameters, sort parameters and page parameters... so google saw these descriptions as duplicate content (since myurl.html and myurl.html?filter=blah are seen as different). So my question: is this the likely culprit for some pretty drastic hits to ranking? I've fixed this now, but are there any ways to prevent this in the future? (I know _of _canonical tags, but have never used them, and am not sure if this applies in this situation) Thanks! EDIT: One thing I forgot to ask as well: has anyone inflicted this upon themselves? And how long did it take you to recover?
Technical SEO | | Ask_MMM0 -
If two websites pull the same content from the same source in a CMS, does it count as duplicate content?
I have a client who wants to publish the same information about a hotel (summary, bullet list of amenities, roughly 200 words + images) to two different websites that they own. One is their main company website where the goal is booking, the other is a special program where that hotel is featured as an option for booking under this special promotion. Both websites are pulling the same content file from a centralized CMS, but they are different domains. My question is two fold: • To a search engine does this count as duplicate content? • If it does, is there a way to configure the publishing of this content to avoid SEO penalties (such as a feed of content to the microsite, etc.) or should the content be written uniquely from one site to the next? Any help you can offer would be greatly appreciated.
Technical SEO | | HeadwatersContent0 -
Duplicate content
I have just ran a report in seomoz on my domain and has noticed that there are duplicate content issues, the issues are: www.domainname/directory-name/ www.domainname/directory-name/index.php All my internal links and external links point to the first domain, as i prefer this style as it looks clear & concise, however doing this has created duplicate content as within the site itself i have an index.php page inside this /directory-name/ to show the page. Could anyone give me some advice on what i should do please? Kind Regards
Technical SEO | | Paul780 -
Duplicate content on my home
Hello, I have duplication with my home page. It comes in two versions of the languages: French and English. http://www.numeridanse.tv/fr/ http://www.numeridanse.tv/en/ You should know that the home page are not directories : http://www.numeridanse.tv/ Google indexes the three versions: http://bit.ly/oqKT0H To avoid duplicating what is the best solution?
Technical SEO | | android_lyon
Have a version of the default language? Thanks a lot for your answers. Take care. A.0 -
Duplicate content issues caused by our CMS
Hello fellow mozzers, Our in-house CMS - which is usually good for SEO purposes as it allows all the control over directories, filenames, browser titles etc that prevent unwieldy / meaningless URLs and generic title tags - seems to have got itself into a bit of a tiz when it comes to one of our clients. We have tried solving the problem to no avail, so I thought I'd throw it open and see if anyone has a soultion, or whether it's just a fault in our CMS. Basically, the SEs are indexing two identical pages, one ending with a / and the other ending /index.php, for one of our sites (www.signature-care-homes.co.uk). We have gone through the site and made sure the links all point to just one of these, and have done the same for off-site links, but there is still the duplicate content issue of both versions getting indexed. We also set up an htaccess file to redirect to the chosen version, but to no avail, and we're not sure canonical will work for this issue as / pages should redirect to /index.php anyway - and that's we can't work out. We have set the access file to point to index.php, and that should be what should be happening anyway, but it isn't. Is there an alternative way of telling the SE's to only look at one of these two versions? Also, we are currently rewriting the content and changing the structure - will this change the situation we find ourselves in?
Technical SEO | | themegroup0