Dates in URL's
-
I have an issue of duplicate content errors and duplicate page titles which is penalising my site. This has arisen because a number of URLs are suffixed by date(s) and have been spidered . In principle I do not want any url with a suffixed date to be spidered.
Eg:-
www.carbisbayholidays.co.uk/carbis-bay/houses-in-carbis-bay/seaspray.htm/06_07_13/13_07_13
http://www.carbisbayholidays.co.uk/carbis-bay/houses-in-carbis-bay/seaspray.htm/20_07_13/27_07_13
Only this URL should be spidered:-
http://www.carbisbayholidays.co.uk/carbis-bay/houses-in-carbis-bay/seaspray.htm
I have over 10,000 of these duplicates and firstly wish to remove them on block from Google ( not one by one ) and secondly wish to amend my robots.txt file so the URL's are not spidered. I do not know the format for either.
Can anyone help please.
-
Thanks Kyle.
Particularly grateful for the Disallow format, they are the only URL's using an underscore so will work for me. WIll be checking why these are being created.
Do I need to remove them using the Removal Tool in Google, is there a format for doing this on block ?
Thanks again,
Alan
-
Hi Alan,
I would probably start by adding a disallow rule to robots.txt.
**Disallow: /*_** _may work and block all your dated URLs from being indexed but may also have adverse affects if you have any URLs containing underscores. To test whether this solution would work I would firstly implement a disallow directly on a chosen dated URL, _**Disallow: /20_07_13 **_for example, and then test whether Google has noindexed the page. GWT should tell you whether you have inadvertently blocked any other pages by doing so.
You should also be thinking about how these URLs are being created and taking actions to prevent it. Consider implementing canonical tags if you haven't already to clean up any potential duplication issues.
Cheers,
K
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Changing url (permalink) structure of website??
Currently I'm working on SEO of one website www.mocomi.com. I want to change url (permalink) structure of entire website which has more than 5000 pages. Currently website have structure of http://mocomi.com/tenali-raman-the-kings-condition/ Which I want to change it to http://mocomi.com/fun/stories/tenali-raman/tenali-raman-the-kings-condition/ Likewise I want to change entire website permalink url structure to make site architecture more SEO friendly. Which means I'am going to add only categories & subcategories before actual link. Kindly guide with following questions which I need to move forward with this step. How much is it worth to change URL structure? Checklist or factors I need to consider while making this decision? Is it a good practice to change URL's of entire website at once or Should I change it in Parts? How much time it takes google to rank those urls again? Which are the best practices to do so?
On-Page Optimization | | Mocomi1 -
Optimal URL structure for location-specific pages
I'm in the middle of revamping a website for a restaurant that has multiple locations and am trying to decide what the best URL/internal link structure would be. Right now, each restaurant has a single location page, but we are going to add additional pages for catering. Sitewide-linked pages exist for /catering and /locationname. The way I see it, we have two basic options: Option #1: Catering page - /locationname/catering/ Option #2: Catering page - /catering/locationname/ In both cases, there would be links from the /locationname an /catering pages to the location-specific catering pages. Is either option preferable to the other?
On-Page Optimization | | mblair0 -
Mixing hyphens and underscores in a url
Hello. I am working on a site that was built with underscores in the urls, but only in the page names, not in the subdirectories. All the subdirectories have one-word names. So a typical url is "example.com/sub1/sub2/page_name." We would like to change the name of one of the subdirectories to a name that would be very useful for SEO, but this new name is a hyphenated word, let's call it "new-sub." If we changed "sub2" to "new-sub" then our url would have a mix of underscores and hyphens: example.com/sub1/new-sub/page_name. But if I used "new_sub" instead, google would read the words as connected with an underscore, instead of reading the subdirectory as a hyphenated word, which would be less useful for SEO. It seems like it might be a problem to have a hyphen in a subdirectory and underscores in the page names. But I want the SEO value of the hyphenated word. Any recommendations? Thank you!
On-Page Optimization | | nyc-seo0 -
Keyword density and it's impact?
How beneficial is properly optimised text on your website? I have been reading copy blogger and they seem to think it's almost the foundations and can have a massive impact - thus their software for improving optimised text. So... The way I see it, content can fit into 3 areas: 1. Over optimised - keyword stuffed 2. Produced without the keyword in mind and then small changes, maybe the keyword used once or twice within 500 words, slotted into the h1 tag. 3. Optimised - At the front of the h1 tag, density of roughly 3-4%, emphasised with bold and italic. What kind of impact can number 3 really have on rankings? If your position 7/8 could it be possible to see position movement from content changes? Cheers
On-Page Optimization | | activitysuper0 -
Don't understand this ... :-(
Hello, I'm going nuts as I don't understand what's going on with this domain of a client. We have this classical htaccess redirect from http://domain.com to http://www.domain.com But I'm getting Page Authority for both domains, and the non-www, which shouldn't be crawled, is gettting higher PA .. http://www.myanamar.rundreisen.de - PA 34 http://myanamr-rundreisen.de - PA 36 I attach a file, you see there that google robot is recognizing the 301 redirecht from non-www to www ... But, the site isn't doing good at all in google, it seems the home page has a penalty ... duplicate content due to non-www and www home page? So it would be great if somebody has a hint for me ... my client is losing trust in me Thx! GbDC4.jpg
On-Page Optimization | | hgw570 -
404 crawl errors with all url+domain
We have 187 crawl 404 errors. All urls on web make a 404 error that this http://www.domain.com/[.....]l/www.domain.com all errors added to the url, the url domain I put an example gestoriabarcelona.com/www.gestoriabarcelona.com
On-Page Optimization | | promonet
gestoriabarcelona.com/tarifas/www.gestoriabarcelona.com
gestoriabarcelona.com/category/noticias/page/7/www.gestoriabarcelona.com
gestoriabarcelona.com/2012/08/amortizacion-de-unaconstruccion/
www.gestoriabarcelona.com
[..] I don't know where can i find to solve errors Anyone can help me? Thanks0 -
Multiple H1's
Hi, My SEOMOZ report states that I'm using two H1's on most of my pages, for example on this page: http://www.absolutepower.nl/eiwitshakes/proteine-shakes/ I only see one though. Anyone who could clarify this? Thanks! Jasper
On-Page Optimization | | Japking0 -
What's the impact of # in the main domain page?
After a little research I did in the Source Code of the root domain page of seomoz.org and searchenginejournal.com , I found that the first one contains no at all and that the other contains like 10 . I though that the was something relatively important on a web page for on page optimisation. Did I missed something? What's you opinion on the subject? Thanks for your help!
On-Page Optimization | | Louis-Philippe_Dea0