Drupal SEO help - Duplicate content but very similar URLS?
-
Hi,
This is a very strange problem and not sure how it has happened. I am adding packages to my website and a duplicate page & almost identical URL is being picked up by Google.
E.g. the page I make is http://www.ukgirlthing.co.uk/hen-party/bristol-spa-rty-lunch-pampering-h... but then also appearing is http://www.ukgirlthing.co.uk/hen-party/bristol-spa-rty-pampering-hen-party. The node's are exactly the same, and if i edit one of them, the other also updates. You will notice that the URL's are almost exactly the same, except the words are re-organised slightly?
Shall i just delete the URL alias of the duplicate entry or is there something else which is making this happen?
These URL's are being picked up as duplicate content, although it's the same node!
Hope you can help,
Thank you!
-
Thanks for your help in this Yiannis.
We are now sifting through a lot of the duplicate alias's, creating redirects, and then going into delete them. Also altered the option going forward. I have decided to go with 'Do nothing' as i don't want it to make any URL alterations - That should be OK shouldnt it?
I guess this is case closed - Really appreciate your time and help in this mate!
Sunny
-
HI mate, sorry for the late response.
It should and I agree with you but Open Source was never perfect Drupal is more time consuming and complicated than Wordpress and there is a reason it refers to more "Geeky" people if I may say than the average WP user.
However the end sites are 20.000 times cleaner than any wordpress site and definitely worth the effort and time put into it!
-
No problem at all, I appreciate your help!
Got a question about this module - Why does it ask what to do with an alias, shouldn't it just keep the alias once i make a page, and even if i make changes to it, it stays the same? I don't understand the 'create a new...' etc.. and why it would do that.
When it creates the new alias, they both still point to the same node, so confused why it would make a new alias, rather than leaving the old one intact and not creating a new one?
Thanks!
-
Sorry for the late response, got hooked up with work
Try the 3rd option please which you delete old alias once you create a new one. Select the desirable URL as the new one and save.
Let me know if that solves the problem.
ps. If the old url is indexed and ranked you might want to set a 301 redirect just in case
-
No problems at all Yiannis, feel free to ask as much information as possible!
Just looked in the auto-path settings and found this (see image attachment)
I have a feel this maybe on the wrong setting? If so, what would you advise? Should i change to 'Do nothing, leave the old alias intact' and then go through URL Alias's, find the duplicate wrong ones and manually delete them? Then going forward, no more will occur?
Look forward to hearing your thoughts!
Thanks!
-
Again check this with your developers as its been a while but I think it has to do with your setting on Pathauto (I cant log in to be 100% sure of what you've selected there). Did you install Pathauto after you manualy changed the url? or were both generated via pathauto? Also both pages have canonical urls set on those 2 different urls. Is this intended?
Sorry for blasting you with questions but as I cant see your dashbaord I am trying to get a clear picture
-
Hi Yiannis!
Thanks for this!
Modules we have are Pathauto, <label style="display: inline !important;" for="edit-status-canonical-url">Canonical URL and </label>****<label style="display: inline !important;" for="edit-status-canonical-url"><label style="display: inline !important;" for="edit-status-seochecklist">SEO Checklist. Drupal version is 6.</label></label>
<label style="display: inline !important;" for="edit-status-canonical-url"><label style="display: inline !important;" for="edit-status-seochecklist">I have recently requested our developers to install Global Redirect which I have read solves many other SEO issues with Drupal.</label></label>****<label style="display: inline !important;" for="edit-status-canonical-url"><label style="display: inline !important;" for="edit-status-seochecklist"> </label></label>
I checked within the URL aliases and the 2 different aliases are showing, and are reporting the same node - See both image uploads.
Thanks
-
It's been a VERY long time since I did a web site on Drupal but I will give this a go. To start with can you please list me the modules you use on your web site? Most importantly do you use "Pathauto"? If yes I think there might be a conflict with your URL aliases. Let me know. Also is this Drupal 6 or 7?
-
Hi Andy,
If I make an amendment on one page, the other also shows the amendment, and so for this reason, i can't delete it as both will go. I think you maybe onto something, as in this might be caused by a plugin somewhere, so I will check this out - Moz has shown that I have hundreds of these duplicate pages which is worrying, although they don't physically exist (same node for duplicate).
Tricky one this is!
-
So the word "lunch" is being added to the URL, yet the page doesn't exist?
I'll be honest, I don't work on Drupal because it does do weird things from time to time so can't advise on this specific issue.
I would check the settings though and see if there is an SEO plugin (or site setting) that is causing this to write a second page. Also check to make sure this isn't a page that has been deleted and is still able to be crawled. Has that page with the word "lunch" in the URL, ever been created?
If it is easy enough to do, delete the page and re-create it to see if that clears the problem.
-Andy
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is this going to be seen by google as duplicate content
Hi All, Thanks in advance for any help that you can offer in regards to this. I have been conducted a bit of analysis of our server access file to see what googlebot is doing, where it is going etc. Now firstly, I am not SEO but have an interest. What I am seeing a lot of is that we have URL's that have an extension that sets the currency that is displayed on the products so that we can conduct Adwords campaigns in other countries, these show as follows: feedurl=AUD, feedurl=USD, feedurl=EUR etc. What I can see is that google bot is hitting a URL such as /some_product, then /someproduct?feedurl=USD and then /someproduct?feedurl=EUR and then /someproduct?feedurl=AUD all after each other. Now this is the same product page and just has the price shown slightly different on each. Would this count as a duplicate content issue? Should I disavow feedurl? Any assistance that you can offer would be greatly appreciated. Thanks, Tim
Technical SEO | | timsilver0 -
Mobile and hidden content - Any issue for SEO?
In reference to mobile - am I walking a fine SEO line when it comes to hidden content on mobile? On the responsive variations of sites we are working on some content is hidden (that displays on the desktop version of the site) so that pages on mobile can display correctly. Is this negative for SEO? Appreciate any feedback Cheers.
Technical SEO | | Oxfordcomma0 -
Squarespace Duplicate Content Issues
My site is built through squarespace and when I ran the campaign in SEOmoz...its come up with all these errors saying duplicate content and duplicate page title for my blog portion. I've heard that canonical tags help with this but with squarespace its hard to add code to page level...only site wide is possible. Was curious if there's someone experienced in squarespace and SEO out there that can give some suggestions on how to resolve this problem? thanks
Technical SEO | | cmjolley0 -
Duplicate pages, overly dynamic URL’s and long URL’s in Magento
Hi there, I’ve just completed the first crawl of my Magento site and SEOMOZ has picked up 1,000’s of duplicate pages, overly dynamic URL’s and long URL’s due to the sort function which appends URL’s with variables when sorting products (e.g. www.example.com?dir=asc&order=duration). I’m not particularly concerned that this will affect our rankings as Google has stated that they are familiar with the structure of popular CMS’s and Magento is pretty popular. However it completely dominates my crawl diagnostics so I can’t see if there are any real underlying issues. Does anyone know a way of preventing this? Cheers,
Technical SEO | | WendyWuTours
Al.1 -
Block Quotes and Citations for duplicate content
I've been reading about the proper use for block quotes and citations lately, and wanted to see if I was interpreting it the right way. This is what I read: http://www.pitstopmedia.com/sem/blockquote-cite-q-tags-seo So basically my question is, if I wanted to reference Amazon or another stores product reviews, could I use the block quote and citation tags around their content so it doesn't look like duplicate content? I think it would be great for my visitors, but also to the source as I am giving them credit. It would also be a good source to link to on my products pages, as I am not competing with the manufacturer for sales. I could also do this for product information right from the manufacturer. I want to do this for a contact lens site. I'd like to use Acuvue's reviews from their website, as well as some of their product descriptions. Of course I have my own user reviews and content for each product on my website, but I think some official copy could do well. Would this be the best method? Is this how Rottentomatoes.com does it? On every movie page they have 2-3 sentences from 50 or so reviews, and not much unique content of their own. Cheers, Vinnie
Technical SEO | | vforvinnie1 -
Are recipes excluded from duplicate content?
Does anyone know how recipes are treated by search engines? For example, I know press releases are expected to have lots of duplicates out there so they aren't penalized. Does anyone know if recipes are treated the same way. For example, if you Google "three cheese beef pasta shells" you get the first two results with identical content.
Technical SEO | | RiseSEO0 -
Need advanced SEO help!
Hi guys, This is my last attempt to work out what is up with this site before it goes to the big Flipper in the sky (and even then I doubt it will make much more than £1!) This site was a successful site, then one day Google decided it didnt like it, and I have not had much joy with it for nearly a year now. I must admit I tried to forget about it for a while, but it has always been a thorn in my side due to the fact it used to be a nice little earner. I have SEOmoz crawled it and I cant find any issues that would cause such a severe penalty, I removed many of the affiliate links, clocked the rest of the affiliate links and tried numurous other ideas, but now, as a last ditch attempt I am looking for some help! I tried to avoid the typical thin affiliate site by adding relevant content, but I have seen sites with much poorer design and content rank higher than this one. Any ideas welcome! Thanks in advance My site
Technical SEO | | mozUser14692366292850 -
Duplicate content issues caused by our CMS
Hello fellow mozzers, Our in-house CMS - which is usually good for SEO purposes as it allows all the control over directories, filenames, browser titles etc that prevent unwieldy / meaningless URLs and generic title tags - seems to have got itself into a bit of a tiz when it comes to one of our clients. We have tried solving the problem to no avail, so I thought I'd throw it open and see if anyone has a soultion, or whether it's just a fault in our CMS. Basically, the SEs are indexing two identical pages, one ending with a / and the other ending /index.php, for one of our sites (www.signature-care-homes.co.uk). We have gone through the site and made sure the links all point to just one of these, and have done the same for off-site links, but there is still the duplicate content issue of both versions getting indexed. We also set up an htaccess file to redirect to the chosen version, but to no avail, and we're not sure canonical will work for this issue as / pages should redirect to /index.php anyway - and that's we can't work out. We have set the access file to point to index.php, and that should be what should be happening anyway, but it isn't. Is there an alternative way of telling the SE's to only look at one of these two versions? Also, we are currently rewriting the content and changing the structure - will this change the situation we find ourselves in?
Technical SEO | | themegroup0