How do i actually use the canonicalization rule for Apache?
-
Hi Guys,
Moz is reporting lots of duplicate content on my site. I think this is partly from session id's and partly from category pages and on-site search generated pages. I know I have to use the canonicalization rule but don't know exactly how to determine the correct URL and where to put the code. Can anyone offer any advice on this? I'm new to this so apologies for any etiquette breaching etc.
Many thanks, Stewart.
-
Stewart,
The canonicalization tag identifies the URL where the one true version of content can be found. The tag used on the one true version AND all on the copies of the true version are exactly same tag because the copies will point to the true version and the true version will point to itself. Often the version you're going to set as the one true version is that which has the shortest URL or the one without any parameters added to the end of it.
The whole purpose of the canonicalization tag is to be able to say "Hey Google, I know I have a bunch of URLs that have the same content--I know it's not ideal but there's nothing I can do about that. So what I want you to do is only count this specific URL in your index so all the others don't count against me as duplicate content."
Here's more detail on the topic: Canonicalization and the Canonical Tag - Learn SEO - Moz
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Sitemap use for very large forum-based community site
I work on a very large site with two main types of content, static landing pages for products, and a forum & blogs (user created) under each product. Site has maybe 500k - 1 million pages. We do not have a sitemap at this time.
Technical SEO | | CommManager
Currently our SEO discoverability in general is good, Google is indexing new forum threads within 1-5 days roughly. Some of the "static" landing pages for our smaller, less visited products however do not have great SEO.
Question is, could our SEO be improved by creating a sitemap, and if so, how could it be implemented? I see a few ways to go about it: Sitemap includes "static" product category landing pages only - i.e., the product home pages, the forum landing pages, and blog list pages. This would probably end up being 100-200 URLs. Sitemap contains the above but is also dynamically updated with new threads & blog posts. Option 2 seems like it would mean the sitemap is unmanageably long (hundreds of thousands of forum URLs). Would a crawler even parse something that size? Or with Option 1, could it cause our organically ranked pages to change ranking due to Google re-prioritizing the pages within the sitemap?
Not a lot of information out there on this topic, appreciate any input. Thanks in advance.0 -
Canonicalization
I understand what canonicalization does, however I'm a bit confused on one point. Generally, of course it's used to determine the main article out of two which are identical. But what happens to the keywords if the content isn't quite identical? Example:-
Technical SEO | | seoman10
Let's say the 'first page' it is optimised for 'racing cycles'.
The 'second page' is optimised for 'second-hand racing cycles' Let's assume that the 'first page' doesn't have any reference to 'used' or 'second-hand' so it would be essentially unrelated to the 'second page'. If I then add an canonical tag to the 'second page' that points to the 'first page' in theory, the 'second page' will drop from the search rankings and pass any link authority back to the 'first page' What I want to know is will the 'first page', then rank for the keywords that the second page used to rank for? (in this case 'second-hand racing cycles')0 -
Using # in parameters?
I am trying to understand why a website would use # instead of a ? for its parameters? I have put an example of the URL below: http://www.warehousestationery.co.nz/office-supplies/adhesives-tapes-and-fastenings#prefn1=brand&prefn2=colour&prefv1=Command&prefv2=Clear Any help would be much appreciated.
Technical SEO | | CaitlinDW1 -
SEO across sites built using Google Web Toolkit
Hi guys, General question around general SEO best practices, such as url and title, and how they fit in with Google Web Toolkit built sites that use a www.site.com/#!category=12345 format. The space we're getting into is heavily competitive, with many established players doing standard SEO well. I know there are some speed benefits to using GWT, however I'd like to better understand the SEO impact, if any, before the site development progresses too far. Cheers, Jez
Technical SEO | | jez0000 -
Navigating The New Rules - Clarification on NoFollow Usage
I posted some of this elsewhere but would like feedback from some of SEOMoz community. An author. Lets say she has a book out on Relationship Advice.
Technical SEO | | CarlosFernandes
Lets say her book was even called Relationship Help, Advice and Tips. She promotes it for years on her website and implements an affiliate program to get wider reach. Affiliates link to it by the name of the book. One day she even gets a mention or two on a few Yahoo editorial type pages that reviewed said book. A few other very big name websites also link to her and even link to her (without her asking) to her domain no less and make the link say simply Relationship Advice. The links were in the body of the pages. Again, these were unsolicited reviews that she did not even ask for. In the old world - that was ok - in as much as unharmful to her site. In the new world she's toast. She has taken down the book pages she worked 7 years to build up. I don't even think that will help. People linked to her website and put "relationship Advice" in the links because that's what she gave and was an expert at. She didn't ask for those links.
2) A large well known web directory that many have heard of - choose to charge for inclusion into their directory. BUT - you can get a free link if you include some code on your website. A reciprocation that is well known. I have read many many articles and posts by many people over the years on this - and as far as I can tell that reciprocation model for free submission was OK. As long as directories didn't have search functions that served search results that were biased to paid link submissions they seemed to be ok. In terms of the free submission - I read a post way back by Matt that said as long as the directory wasn't asking for the reciprocal link in addition to charging for the submission - that was OK. So, scoot forward to 2012. Said directory has hundreds of thousands of links to it - due tot he reciprocal code that was on many of the free links. The code on the websites that got free links obviously promotes the directory by putting the main keyword in the link. ie "Web Directory". In this new world - is this OK ? That's what they do. They are after all a web directory? The company in scenario 2 with hundreds of thousands of links all saying virtually the same phrase - with the vast majority of the backlinks being from generated reciprocal links for free advertisers in its directory - they are doing FINE. Not hurt at all. The small business owner / author in scenario 1 - who had unsolicited natural links coming to her with anchor text detailing something she did and was an expert at - has gone from the SERPS. Should the company in Scenario 2 - that COULD DO something about the anchor text in the reciprocal links back to their website - now change the recip code so that it just says their brand name instead of "web directory" ? Should the author - if she ever regains from this hell - now have some kind of policy clearly stated on her website - that if any person is ever to link to her website ever again - they MUST only link to it with her name in the anchor text - and never link up words she is an authority on? How can she prevent that? So now is it up to the advertiser or the publisher to ensure we are all safe? If small business person Billy Bob inquires about a paid link on a website and the publisher dosn't tell him that the link may hurt his site and he does not not request a NOFOLLOW on it (because he is just a clueless business owner) - are they (the publishing website) liable for Billy Bob's site tanking if it does? Or is it the advertiser's job to be aware of all said issues - because I know the vast majority of Billy Bob's wouldn't be. How long has everyone got to "get in line"? There are many in the search community offering paid links on their websites in "Sponsored Links" sections - without the use of NOFOLLOWS and i don't see any devaluing of their advertisers websites. If rules are rules let everyone play them. Getting sick of the hypocrisy. I aim to get to Journeyman though just so I can get a DOFOLLOW link on this site 🙂 Incentives eh! Carlos1 -
Using symbols in the html title of a webpage
If you a symbol in the title of a webpage will this dilute the keywords in the title
Technical SEO | | mickey11
thus making it rank worse in search engines here is an example <title><br /> Black Shoe Polish<br /></title> versus <title><br /> ▶ Black Shoe Polish<br /></title> will the extra symbols count as words and thus the dilute the effectiveness of the Black Shoe Polish keyword. sort of making like 4 words instead 3. By the way, The reason to use a symbol is to make it stand on in the search engine results0 -
Using a third party server to host site elements
Hi guys - I have a client who are recently experiencing a great deal of more traffic to their site. As a result, their web development agency have given them a server upgrade to cope with the new demand. One thing they have also done is put all website scripts, CSS files, images, downloadable content (such as PDFs) - onto a 3rd party server (Amazon S3). Apparently this was done so that my clients server just handles the page requests now - and all other elements are then grabbed from the Amazon s3 server. So basically, this means any HTML content and web pages are still hosted through my clients domain - but all other content is accessible through an Amazon s3 server URL. I'm wondering what SEO implications this will have for my clients domain? While all pages and HTML content is still accessible thorugh their domain name, each page is of course now making many server calls to the Amazon s3 server through external URLs (s3.amazonaws.com). I imagine this will mean any elements sitting on the Amazon S3 server can no longer contribute value to the clients SEO profile - because that actual content is not physically part of their domain anymore. However what I am more concerned about is whether all of these external server calls are going to have a negative effect on the web pages value overall. Should I be advising my client to ensure all site elements are hosted on their own server, and therefore all elements are accessible through their domain? Hope this makes sense (I'm not the best at explaining things!)
Technical SEO | | zealmedia0 -
Google not using <title>for SERP?</title>
Today I noticed that Google is not using my title tag for one of my pages. Search for "covered call search" Look at organic result 6: Search - Covered Calls Covered call screener filters 150000 options instantly to find the best high yield covered calls that meet your custom criteria. Free newsletter.<cite>https://www.borntosell.com/search</cite> - CachedNow, if you click through to that page you see the meta title tag is:Covered Call ScreenerEven the cached version shows the title tag as Covered Call ScreenerI am not logged in, so I don't believe personalization has anything to do with it.Have others seen this before?It is possible that "search - covered calls" was the title tag 9 months ago (before I understood SEO); I honestly don't remember. I cleaned all my titles up at least 6 months ago.Can I force Google to re-index the page? Its content has changed a few times in the last few months, and Google crawls my site frequently according to webmaster tools.
Technical SEO | | scanlin0