Google also indexed trailing slash version - PLEASE HELP
-
Hi Guys,
We redesigned the website and somehow our canonical extension decided to add a trailing slash to all URLs. Previously our canonical URLs didn't have a trailing slash.
During the redesign we haven't changed the URLs. They remained same but we have now two versions indexed. One with trailing slash one without.
I've now fixed the issue and removed the the trailing slash from canonical URLs.
Is this the correct way of fixing it? Will our rankings be effected in a negative way?
Is there anything else I need to do.
The website went live last Tuesday.
Thanks
-
Thats great! The canonical URLs are showing URLs without slash as they are probably reflecting their original URL which is without slash. Hope Google clears them soon..
-
Seems like you got the 301-redirect resolved below - if you've got that in place and fixed the canonical tag, it should be ok. It'll just take some time (usually longer than you'd like) for Google to clear out the pages, especially the deeper ones. If you see gradual de-indexation, though, you'll probably be fine.
-
-
Actual rel="canonical" tags.
-
As soon as we relised everything was fixed. Canonical tag is showing urls without slash and also aplied to htaccess to redirect slash version to non slash version.
<cite>we're using www.shopify.com</cite>
-
-
Could you clarify a couple of things:
(1) When you say canonical URLs, do you mean your internal links, or the actual URLs in your rel="canonical" tags?
(2) If it was just the canonical tags, is everything consistent now (tags, internal links, etc.)?
Since both version will resolve, just fixing the canonical tags (if that's the issue) should be enough - it's just going to take a little time. They should be as effective as a 301-redirect in this case. Either way, though, it can take Google a while to kick out the duplicates. I'd just monitor the index closely and make sure the top-level pages are clearing up (i.e. your home-page and major category duplicates should be disappearing). If that's happening, you're ok - you just need to wait a bit. If that's not happening, then you may have some other mixed signals in play.
-
You are welcome.
Well, the first time you did submit the sitemap right, but now since Google has found new URLs on your website and indexed them, it would be good to notify the big G that they are no longer a part of your website and resubmitting would not hurt.
About the redirections, Google does take a bit of time to understand that the URLs have permanently moved and will gradually remove them from the index. So, keep checking the index for the trailing slash URLs and when they are gone, you can remove the redirections.
Cheers,
-
Thanks a lot.
Now when i click the slash version of the indexed URL from google goes to nonslash version. So it seems we're safe now.
The other thing is when I submitted the sitemap.xml after launch it was without slash. Also all internal links are targeting nonslash URLs. I think google should understand that this is a technical issue and now it has been solved.
When should i remove that redirect?
-
Yups, its done. Just need to be sure if the Home Page is fine. The indexed version of the Home Page stays as it is without any redirection.
Cheers,
-
I checked with this website: http://www.internetofficer.com/seo-tool/redirect-check/
It says:
http://www.mydomain.com/jason.html/
Type of redirect: 301 Moved Permanently
http://www.mydomain.com/jason.html
So looks as if it's done the job. Right?
-
Sounds good, do keep a check to make it 100% sure. I believe the SE's will be fine now.
Cheers,
-
RewriteRule ^([^/]+/)*([^/.]+).html/ http://www.mydomain.com/$2.html [R=301,L]
Looks like above did the trick
-
I think some of these posts can help you understand:
http://html5boilerplate.com/docs/Proper-usage-of-trailing-slash-redirects/
Do try this a test environment and take a backup of the .htaccess file before making any changes, Have it go through a programmer.
Cheers,
-
Please can you tell me how to redirect urls with slash to non slash urls using .htaccess.
-
Jvalops,
This is a common scenario in SEO when you have 2 versions indexed of the same URL. This bascially creates a duplicate issue. Now, this situation has a solution which includes 2 things to implement:
1. Fix it from the search engines's perspective.
2. Make changes at the server level.
You did remove the trailing slash so you fixed it at the server level but you left the search engines to think - Where did the URL go? Am I supposed to show a 404 for that or what?.
So, it is important that you first fix them for the SE's and then make any server level changes because you never know how quick the crawlers can re-visit the disappeared URL and take their own action. Since this is just a recent change I hope that the SE;s will not evaluate it in a negative way but you should be quick to inform them. Now, since you have already removed it, do add a code in the .htaccess file stating that any URL with a slash redirects it to the URL without slash. I hope there are no URLs that have to end with a slash (just have a re-look on this, the home page and others).
After this is done, to make things more clear to the search engines, resubmit your XML sitemap with all the correct URLs on the website and I think you will be just fine.
On the rankings, I don't think it will be affected, unless there was a re-crawl after the indexation.
Cheers,
-
I'm not 100% sure how to answer your question, but an .htaccess 301 might work.
/example.html/ example.html
Try that to see if it works.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Subdomain to sub-directory migration: New subdirectory not yet indexed
Hi all, We have recently migrated a sub-domain to sub-directory to claim it's traffic in our website. Like demo.website.com to website.com/demo. We have also set a redirect for same which is working fine; but still old subdomain is showing in google search results and new directory haven't been indexed. We have submitted the new sub-directory in search console multiple times and it got partially indexed as per the status. We have allowed crawlers. Thanks
Web Design | | vtmoz0 -
Affiliate and Earning Disclosure help please
Hello Moz Friends, I wanted to ask about affiliate terms pages. I'm assuming that if I have an affiliate banner where someone clicks on it, and I get a penny, that I need to have an affiliate disclosure or some sort of link saying that I make money. I'm know is some legal ground here, but just curious if this is how the concept works? Thank you so much!
Web Design | | asbchris0 -
Should Blog Category Archive URLs be Set to "No-Index" in Wordpress?
It appears that Google Webmaster Tools is listing about 120 blog archives URLs in Google Index>Index Status that should not be listed. Our site map contains 650 pages, but Google shows 860. Pages like: <colgroup><col width="464"></colgroup>
Web Design | | Kingalan1
| http://www.nyc-officespace-leader.com/blog/category/manhattan-office-space | With Titles Like: <colgroup><col width="454"></colgroup>
| Manhattan Office Space Archives - Metro Manhattan Office Space | Are listed when in the Rogerbot crawl report for the site. How can we remove such pages from Google Webmaster Tools, Index Status? Our site map shows about 650 pages, yet Google show these extra pages. We would prefer that they not be indexed. Note that these pages do not appear when we run a site:www.nyc-officespace-leader.com search. The site has suffered a drop in ranking since May and we feel it prudent to keep Google from indexing useless URLs. Before May 650 pages showed on the Webmaster Tools Index status, and suddenly in early June when we upgraded the site the index grew by about 175 pages. I suspect the 120 blog archives URLs may have something to do with it. How can we get them removed? Can we set them to "No-Index", or should the robot text be used to remove them? Or can some type of removal request be made to Google? My developers have been struggling with this issue since early June. The bloat on the site is about 175 URLs not on the site map. Is there any go to authority on this issue (it is apparently rather complicated) that can provide a definitive answer? Thanks!!
Alan0 -
Can only get a few pages indexed on by google
Hi I've touched upon this before on previous questions so apologies for repeating myself. In a nutshell out of the 60 webpages submitted to Google 11 have been indexed and out of the 140 images submitted none have indexed any ideas would be great! Here is a screen shot of what Google Webmaster is showing http://www.tidy-books.com/sitemapshow.png and here is the sitemap - > http://www.tidy-books.com/sitemap/us/sitemap.xml Thanks
Web Design | | tidybooks0 -
Increasing content, adding rich snippets... and losing tremendous amounts of organic traffic. Help!
I know dramatic losses in organic traffic is a common occurrence, but having looked through the archives I'm not sure that there's a recent case that replicates my situation. I've been working to increase the content on my company's website and to advise it on online marketing practices. To that end, in the past four months, I've created about 20% more pages — most of which are very high quality blog posts; adopted some rich snippets (though not all that I would like to see at this point); improved and increased internal links within the site; removed some "suspicious" pages as id'd by Moz that had a lot of links on it (although the content was actually genuine navigation); and I've also begun to guest blog. All of the blog content I've written has been connected to my G+ account, including most of the guest blogging. And... our organic traffic is preciptiously declining. Across the board. I'm befuddled. I can see no warnings (redirects &c) that would explain this. We haven't changed the site structure much — I think the most invasive thing we did was optimize our title tags! So no URL changes, nothing. Obviously, we're all questioning all the work I've done. It just seems like we've sunk SO much energy into "doing the right thing" to no effect (this site was slammed before for its shady backlink buying — though not from any direct penalty, just as a result of the Penguin update). We noticed traffic taking a particular plunge at the beginning of June. Can anyone offer insights? Very much appreciated.
Web Design | | Novos_Jay0 -
Does stock art photo attribution negatively impact SEO by leaking Google Page Rank?
Greetings: Companies such as Shutterstock often require that buyers place credit attribution on their web pages when photos you buy from them appear on these pages.. Shutterstock requests that credit attribution links such as these be added: Songquan Deng / Shutterstock.com Do these links negatively impact SEO? Or do search engines view them as a positive? Thanks,
Web Design | | Kingalan1
Alan0 -
Does Google have problem crawling ssl sites?
We have a site that was ranking well and recently dropped in traffic and ranking. The whole site is https and and not just the shopping pages. Thats the way the server is setup, they make whole site https. My manager thinks the drop in ranking is due to google not crawling https. I think contrary, but would like some feedback on this. Site is here
Web Design | | anthonytjm0 -
Google Analytics
Hi Folks I'm currently measuring multiple goals with Google analytics. At the end of the month I compare the Analytics numbers with my actual, from a diffident data base, and there is often a variance. My questions are; 1- Is there an accepted Delta % in goals eg: + or - 10% 2- Is there a resource that anybody found incredibly useful on this topic 3- does anybody have any tips on trouble shooting here? Thank you in advance
Web Design | | Intrested0