Duplicate content issues, I am running into challenges and am looking for suggestions for solutions. Please help.
-
So I have a number of pages on my real estate site that display the same listings, even when parsed down by specific features and don't want these to come across as duplicate content pages. Here are a few examples:
http://luxuryhomehunt.com/homes-for-sale/lake-mary/hanover-woods.html?feature=waterfront
http://luxuryhomehunt.com/homes-for-sale/lake-mary/hanover-woods.html
This happens to be a waterfront community so all the homes are located along the waterfront. I can use a canonical tag, but I not every community is like this and I want the parsed down feature pages to get index.
Here is another example that is a little different:
http://luxuryhomehunt.com/homes-for-sale/winter-park/bear-gully-bay.html
http://luxuryhomehunt.com/homes-for-sale/winter-park/bear-gully-bay.html?feature=without-pool
http://luxuryhomehunt.com/homes-for-sale/winter-park/bear-gully-bay.html?feature=4-bedrooms
http://luxuryhomehunt.com/homes-for-sale/winter-park/bear-gully-bay.html?feature=waterfront
So all the listings in this community happen to have 4 bedrooms, no pool, and are waterfront. Meaning that they display for each of the parsed down categories. I can possible set something that if the listings = same then use canonical of main page url, but in the next case its not so simple.
So in this next neighborhood there are 48 total listings as seen at:
http://luxuryhomehunt.com/homes-for-sale/windermere/isleworth.html
and being that it is a higher end neighborhood, 47 of the 48 listings are considered "traditional listings" and while it is not exactly all of them it is 99%.
Any recommendations is appreciated greatly.
-
Endorsing Jared for the full thread/follow-up. Unfortunately, when it comes to indexing all of these pages, you can't really have your cake and eat it too in 2012. These pages do look thin to Google - honestly, when the results don't change (and I get that that's just because the filters don't always impact the search), then it starts to look like you're just spinning out duplicates to target new keywords in the header. At high volume, that could get you into trouble (and is the kind of thing Panda has targeted).
You're right, though, if you canonical these pages, they won't get indexed and ranked. These days, my gut reaction is that the trade-off is worth it. If you focus your ranking power, the core category/neighborhood/etc. pages will get more authority, you'll reduce the risks of thin content, and you'll land search users on core pages that they can use to navigate to the options they want.
There's no solution that doesn't involve a trade-off, but I think focusing your index would be a positive trade-off. Keep in mind, too, that Google isn't really that fond of search pages - ultimately, you want them indexing the core property listings. The key is to have clear paths to those listings and to index and ranking prominent category pages. If you try to rank for every variations of ever search/sort/etc., you'll just end up diluting your ranking ability in most cases.
-
I see, and yes it will.
I know for my real estate clients, the main listings page usually ranks naturally for info that is found in listings so for example "4 bedrooms" - we have a real estate client that ranks for "x real estate" and "x homes for sale" but also ranks for "4 bedroom homes for sale in x" simply because the listings summary have number of bedrooms in them (like yours does).
However for other variables, like "no pool", its gets trickier since no one lists a house on MLS citing "no pool".
The only two ways around this are: write unique content on every main page, and include the keywords you want like 'no pool' or
write some unique content for each variable - ie write some unique copy on the "no pool" page, write some unique copy on the 'waterfront' page, etc. Even then you are still running a risk of duplicate copy. Having the titles, breadcrumbs and h1's dynamically change just might not be enough. I would put all of my efforts (including linkbuilding) to the main landing page and just make sure to include the keywords i want (thats just an opinion).
What is the data showing now - are you being penalized? Are you ranking for any "without pool" or "waterfront" terms and if so, are they getting traffic?
-
First, thanks again for responding. The challenge I have with using the canonical tag for the variable pages is that, won't it prevent google from indexing the variable pages that include some terms/ phrases I am trying to rank for?
Like Hanover Woods foreclosure homes for sale or Hanover 4 bedroom homes for sale
-
Hi Joshua,
There are a number of ways to stop Google from counting your dynamic urls as duplicates. Its unclear from your question why you can't use canonical tags for this. If you went here:
http://luxuryhomehunt.com/homes-for-sale/lake-mary/hanover-woods.html
And add the canonical tag in the HEAD section:
It will solve your issue of duplication when people choose property variables like waterfront or bedroom #. I think you were trying to point out the reason this wont work at the end of your question but Im not exactly sure what you are eluding to there?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content and Subdirectories
Hi there and thank you in advance for your help! I'm seeking guidance on how to structure a resources directory (white papers, webinars, etc.) while avoiding duplicate content penalties. If you go to /resources on our site, there is filter function. If you filter for webinars, the URL becomes /resources/?type=webinar We didn't want that dynamic URL to be the primary URL for webinars, so we created a new page with the URL /resources/webinar that lists all of our webinars and includes a featured webinar up top. However, the same webinar titles now appear on the /resources page and the /resources/webinar page. Will that cause duplicate content issues? P.S. Not sure if it matters, but we also changed the URLs for the individual resource pages to include the resource type. For example, one of our webinar URLs is /resources/webinar/forecasting-your-revenue Thank you!
Technical SEO | | SAIM_Marketing0 -
Http v https Duplicate Issues
Hello, I noticed earlier an issue on my site. http://mysite.com and https://mysite.com both had canonical links pointing to themselves so in effect creating duplicate content. I have now taken steps to ensure the https version has a canonical that points to the http version but I was wondering what other steps would people recommend? Is it safe to NOINDEX the https pages? Or block them via robots.txt or both? We are not quite ready to go fully HTTPS with our site yet (I know Google now prefers this) Any thoughts would be very much appreciated.
Technical SEO | | niallfred0 -
How different should content be so that it is not considered duplicate?
I am making a 2nd website for the same company. The name of the company, our services, keywords and contact info will show up several times within the text of both websites. The overall text and paragraphs will be different but some info may be repeated on both sites. Should I continue this? What precautions should I take?
Technical SEO | | savva0 -
URL query considered duplicate content?
I have a Magento site. In order to reduce duplicate content for products of the same style but with different colours I have combined them on to 1 product page. I would like to allow the pictures to be dynamic, i.e. allow a user to search for a colour and all the products that offer that colour appear in the results, but I dont want the default product image shown but the product image for that colour applying to the query. Therefore to do this I have to append a query string to the end of the URL to produce this result: www.website.com/category/product-name.html?=red My question is, will the query variations then be picked up as duplicate content: www.website.com/category/product-name.html www.website.com/category/product-name.html?=red www.website.com/category/product-name.html?=yellow Google suggest it has contingencies in its algorithm and I will not be penalised: http://googlewebmastercentral.blogspot.co.uk/2007/09/google-duplicate-content-caused-by-url.html But other sources suggest this is not accurate. Note the article was written in 2007.
Technical SEO | | BlazeSunglass0 -
Does turning website content into PDFs for document sharing sites cause duplicate content?
Website content is 9 tutorials published to unique urls with a contents page linking to each lesson. If I make a PDF version for distribution of document sharing websites, will it create a duplicate content issue? The objective is to get a half decent link, traffic to supplementary opt-in downloads.
Technical SEO | | designquotes0 -
E-commerce solution and subdomain issues
Hello All,
Technical SEO | | CherieP
In light of Wil Reynold's closing keynote at Portland's Searchfest, I thought I might try posting here to get some advice. We run a family business on the side and we're looking at starting to use volusion.com for our e-commerce solution. The catch is we currently have a wordpress site summitmining.com running on thesis with great SEO. Ranking #1 & #2 for our highest trafficked terms. Ideally, I'd like Summitmining.com to direct to the Volusion store and then summitmining.com/blog to go to our wordpress installation BUT since the volusion site will be hosted with the company and they will not host our wordpress installation we'd have to use a subdomain instead of a subdirectory which I understand will be bad for SEO. Does anyone have any recommendation on how to set this up without totally screwing up our ranking OR any recommendations of an easy to use shopping cart (I've worked on a magento site before and it's too complex for us) that wouldn't require a separate or subdomain? Thank you so much!
-Cherie Prochaska
503-816-3557
cherie@c-squaredassociates.com
@cherieprochaska0 -
Please help....
Hi Guys! Ok a bit of a funny one here which is causing a confusion between us and a web designer and I was wondering if anyone on here might be able to help. Just a bit of back ground for you, the website has been built on Concrete 5 and when we tried to building a sitemap we found over 110,000 pages. When we spoke to the web designer they have told us that within Google webmaster tools, Google has only indexed 58. But.... (and this is where things get a little confusing, so bare with me.) I thought that cant be right so into the Google search bar I put in site:www.sitename.co.uk and had 217 results appear. So google cant have just 58 pages indexed, right? So after speaking to the designer he then posted on the Concrete 5 help forum, to try and help figure it out. I have posted his exact forum post below that the web designer has asked: I'm having some issues where a site we are working on seems to be making multiple pages going to the same page. An SEO specialist has run a report and found a number of duplicate pages created by C5. We are concerned that this is going to dilute or worse penalise the way google sees the site. http://www.sitename.co.uk/
Technical SEO | | NoisyLittleMonkey
[http://www.sitename.co.uk/index.php?cID=?akID[155]atSelectOptionID...
[http://www.sitename.co.uk/index.php?cID=?akID[155]atSelectOptionID...
[http://www.sitename.co.uk/index.php?cID=?akID[155]atSelectOptionID... Is there a way of stopping google from accessing these duplicate 'cID' pages and stop them being made? Also is there a way of getting rid of the ones that are there? We've done a number of sites in C5 and are beginning to get concerned about this... So I guess my question is: If I can access the same content via 4-5 different cID's is that classed as duplicate content? Thanks in advance guys, and any help would greatly appreciated. 🙂0