Does Google crawl and spider for other links in rel=canonical pages?
-
When you add rel=canonical to the page, will Google still crawl your page for content and discover new links in that page?
-
or robots.txt file
also nofollow isn't a rule it's also a guide - most SE's see and listen to it but some ignore it, even Google has been known to ignore it on some sites.
-
Hi RefCandy first of all canonical tag is a recommendation to spiders not a rule, so google will probably crawl your page.
Moreover the canonical tag prevents duplication issues not crawling itself there are many sites which uses self referring canonicals so there's no issue on your crawling rate at the beginning. However when google discovers the duplication of that page with the other you've set up it'll end crawling that page with less frequency, so it will give less value to some links in there.
The only rule which prevent links crawl is the nofollow tag in the page .
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Rel Canonical for HTTP and HTTPS pages
My website has a login that has HTTPS pages. If the visitors doesn't log in they are given an HTTP page that is similar, but slightly different. Should I sure a Rel Canonical for these similar pages and how should that be set up? HTTP to HTTPS version or the other way around? Thank you, Joey
Intermediate & Advanced SEO | | JoeyGedgaud1 -
Using rel="nofollow" when link has an exact match anchor but the link does add value for the user
Hi all, I am wondering what peoples thoughts are on using rel="nofollow" for a link on a page like this http://askgramps.org/9203/a-bushel-of-wheat-great-value-than-bushel-of-goldThe anchor text is "Brigham Young" and the page it's pointing to's title is Brigham Young and it goes into more detail on who he is. So it is exact match. And as we know if this page has too much exact match anchor text it is likely to be considered "over-optimized". I guess one of my questions is how much is too much exact match or partial match anchor text? I have heard ratios tossed around like for every 10 links; 7 of them should not be targeted at all while 3 out of the 10 would be okay. I know it's all about being natural and creating value but using exact match or partial match anchors can definitely create value as they are almost always highly relevant. One reason that prompted my question is I have heard that this is something Penguin 3.0 is really going look at.On the example URL I gave I want to keep that particular link as is because I think it does add value to the user experience but then I used rel="nofollow" so it doesn't pass PageRank. Anyone see a problem with doing this and/or have a different idea? An important detail is that both sites are owned by the same organization. Thanks
Intermediate & Advanced SEO | | ThridHour0 -
Site less than 20 pages shows 1,400+ pages when crawled
Hello! I’m new to SEO, and have been soaking up as much as I can. I really love it, and feel like it could be a great fit for me – I love the challenge of figuring out the SEO puzzle, plus I have a copywriting/PR background, so I feel like that would be perfect for helping businesses get a great jump on their online competition. In fact, I was so excited about my newfound love of SEO that I offered to help a friend who owns a small business on his site. Once I started, though, I found myself hopelessly confused. The problem comes when I crawl the site. It was designed in Wordpress, and is really not very big (part of my goal in working with him was to help him get some great content added!) Even though there are only 11 pages – and 6 posts – for the entire site, when I use Screaming Frog to crawl it, it sees HUNDREDS of pages. It stops at 500, because that is the limit for their free version. In the campaign I started here at SEOmoz, and it says over 1,400 pages have been crawled…with something like 900 errors. Not good, right? So I've been trying to figure out the problem...when I look closer in Screaming Frog, I can see that some things are being repeated over and over. If I sort by the Title, the URLs look like they’re stuck in a loop somehow - one line will have /blog/category/postname…the next line will have /blog/category/category/postname…and the next line will have /blog/category/category/category/postname…and so on, with another /category/ added each time. So, with that, I have two questions Does anyone know what the problem is, and how to fix it? Do professional SEO people troubleshoot this kind of stuff all of the time? Is this the best place to get answers to questions like that? And if not, where is? Thanks so much in advance for your help! I’ve enjoyed reading all of the posts that are available here so far, it seems like a really excellent and helpful community...I'm looking forward to the day when I can actually answer the questions!! 🙂
Intermediate & Advanced SEO | | K.Walters0 -
How to stop pages being crawled from xml feed?
We have a site that has an xml feed going out to many other sites.
Intermediate & Advanced SEO | | jazavide
The xml feed is behind a password protected page so cannot use a cannonical link to point back to original url. How do we stop the pages being crawled on all of the sites using the xml feed? as with hundreds using it after launch it will cause instant duplicate content issues? Thanks0 -
Real impact of canonical links?
I am responsible for 2 e-commerce websites. SEO Moz and Google Web Master tools both inform me regularly that on both sites there are many instances of duplicate titles, headings, decriptions and page content. Obviously from an SEO point of view I am more than a little concerned about this! Out product pages struggle to perform strongly despite the fact that our website is of a decent quality and we are leaders in our field. Our competitors rank above us when they add a product page, whereas we normal flit in between 8-10 or on the 2nd SERP. I know it is hard without viewing the site, but is duplicate content likely to be a strong, leading factor in this? I think it is, but want to put together a business case to spend the cash to sort it out....just need someone confirmation that this is worth sorting as a priority. Here are 2 examples of what I mean: 1) Category pages www.exampledomain.co.uk/category1.aspx We have filters on our category page (so the customer can sort products based on their price, colour, size etc.). When filters are used a new URL is generared. www.exampledomain.co.uk/category1.aspx?prices=0||10 www.exampledomain.co.uk/category1.aspx?prices=10||20 The content, titles, description is the same although the links are different. Do I need to set up a canonical tag on the page that reads: 2) Product pages Product pages on the websites have different URLs depending on how to arrive on them. You get 1 URL if you navigated to the page via the website navigation, but you get another different URL if you used the website search functionality to find the page. Example: Search link: www.exampledomain.co.uk/category1/Product1.aspx Navigation link: www.exampledomain.co.uk/12345/category1/Product1.aspx Again, do I need to set up a canonical tag for 1 of these link types so that the link benefit is not shared over 2 pages? Any feedback would be welcome! At the moment the ability to add canonical tags is locked down by our CMS (I know, rubbish!)...so website development would be needed - hence the need for a business case!
Intermediate & Advanced SEO | | DHS_SH0 -
Removing a Page From Google index
We accidentally generated some pages on our site that ended up getting indexed by google. We have corrected the issue on the site and we 404 all of those pages. Should we manually delete the extra pages from Google's index or should we just let Google figure out that they are 404'd? What the best practice here?
Intermediate & Advanced SEO | | dbuckles0 -
Google swapped our website's long standing ranking home page for a less authoritative product page?
Our website has ranked for two variations of a keyword, one singular & the other plural in Google at #1 & #2 (for over a year). Keep in mind both links in serps were pointed to our home page. This year we targeted both variations of the keyword in PPC to a products landing page(still relevant to the keywords) within our website. After about 6 weeks, Google swapped out the long standing ranked home page links (p.a. 55) rank #1,2 with the ppc directed product page links (p.a. 01) and dropped us to #2 & #8 respectively in search results for the singular and plural version of the keyword. Would you consider this swapping of pages temporary, if the volume of traffic slowed on our product page?
Intermediate & Advanced SEO | | JingShack0 -
Link Request Email on Site`s Link Pages
Hello I have assembled a list of web-sites that have "Links" section that has a list of persons` favorite tools. Those pages have a link to my competitor. I know my tool is just as good if not better and want to request a link. I`m thinking of sending an email asking for a link and offering a small amount of money for it. Questions: A) How much should I offer? Should I offer anything at all B) Is there an email style that someone can suggest that has been tested and proven to work for this type of situtation?
Intermediate & Advanced SEO | | hellopotap0