Link Juice passing through a redirect of a disallowed URL
-
Hey guys!
Suppose I disallow search bots from indexing anything on my secure server in my robots.txt, and 301 redirect all of my secure server traffic to my non-secure site. Will the search bots see the redirect before they realize that they're disallowed from accessing that page? Or will they see that page is disallowed and not follow the redirect?
Should I change my robots.txt to allow search bots to crawl my secure site so they can find the redirects?
-
Thanks Nick!
-
If you think there are any links out there that point to your secure pages and want to preserve any SEO related value from those links, you should remove the disallow from robots.txt and use 301 redirects. However, if you redirect your secure pages to non-secure, what is the point of having the secure pages? They would never be used since users would be 301 redirected to the non-secure address.
If you are trying to keep the secure pages out of the index to avoid duplicate content, and want users to be able to have access to the secure pages, disallow in robots.txt and remove the 301 redirects.
In the case of pages that don't really need to be secure, but are available both as http and https, you could also use the canonical tag to show either the secure or non-secure as the original. Individually redirecting the potentially duplicate pages rather than a sitewide https to http redirect would preserve any "juice" you may be getting from links to either version.
If it is more complicated, take a look at this thread on Googles support forum: http://www.google.com/support/forum/p/Webmasters/thread?tid=6a7fac6f81489bc2&hl=en
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Several hreflang links pointing to same URL
Hi, Does anyone know whether hreflang links can be used using the following markup? I can't seem to find any info on this particular usage, but it "feels" incorrect to me. (duplicate content issues)
Technical SEO | | dimitrihuyghe
Our development team tells me this is the way the markup should be, since languages are initially set using a cookie and all different languages are using the same URL. Thanks! <link rel="<a class="attribute-value">alternate</a>" href="<a class="attribute-value">https://www.littlethingz.be</a>" hreflang="<a class="attribute-value">nl</a>"/><link rel="<a class="attribute-value">alternate</a>" href="<a class="attribute-value">https://www.littlethingz.be</a>" hreflang="<a class="attribute-value">x-default</a>"/><link rel="<a class="attribute-value">alternate</a>" href="<a class="attribute-value">https://www.littlethingz.be</a>" hreflang="<a class="attribute-value">fr</a>"/><link rel="<a class="attribute-value">alternate</a>" href="<a class="attribute-value">https://www.littlethingz.be</a>" hreflang="<a class="attribute-value">en</a>"/><link rel="<a class="attribute-value">alternate</a>" href="<a class="attribute-value">https://www.littlethingz.be</a>" hreflang="<a class="attribute-value">de</a>"/>0 -
Problems with canonical urls / redirect (magento webshop)
Hi all, We're running a Magento webshop and we discover some strangs things regarding canonical urls and redirects after using the Amasty improved navigation extension. To clarify, please check these four urls. They contain the same content (the same product page). https://www.afwerkingshop.be/gyproc-gipskartonplaat-ak-2600x1200x9-5mm.html https://www.afwerkingshop.be/wanden/gyproc-gipskartonplaat-ak-2600x1200x9-5mm.html https://www.afwerkingshop.be/wanden/gipsplaten/gyproc-gipskartonplaat-ak-2600x1200x9-5mm.html https://www.afwerkingshop.be/wanden/gipsplaten/standaard/gyproc-gipskartonplaat-ak-2600x1200x9-5mm.html All these four pages have different canoncials (the page url). Obviously, that's not good. However, in Google (site:...) url (1) is the only one that's indexed. Thereby, if I visit the productpage by first going to a category page (fe. www.afwerkingshop.be/wanden.html), I'm redirected to url (1), but the canonical url is www.afwerkingshop.be/last_visited_category_name/product. So, the canonical seems dynamic depending on the last visited category. And still, only url (1) is indexed. Additionally, all aforementioned pages contain . Is anyone familiar with this issue? And more important, will it cause problems in future? Thanks in advance. Kind regards, Chendon
Technical SEO | | RBijsterveld0 -
Disavow links and domain of SPAM links
Hi, I have a big problem. For the past month, my company website has been scrape by hackers. This is how they do it: 1. Hack un-monitored and/or sites that are still using old version of wordpress or other out of the box CMS. 2. Created Spam pages with links to my pages plus plant trojan horse and script to automatically grab resources from my server. Some sites where directly uploaded with pages from my sites. 3. Pages created with title, keywords and description which consists of my company brand name. 4. Using http-referrer to redirect google search results to competitor sites. What I have done currently: 1. Block identified site's IP in my WAF. This prevented those hacked sites to grab resources from my site via scripts. 2. Reach out to webmasters and hosting companies to remove those affected sites. Currently it's not quite effective as many of the sites has no webmaster. Only a few hosting company respond promptly. Some don't even reply after a week. Problem now is: When I realized about this issue, there were already hundreds if not thousands of sites which has been used by the hacker. Literally tens of thousands of sites has been crawled by google and the hacked or scripted pages with my company brand title, keywords, description has already being index by google. Routinely everyday I am removing and disavowing. But it's just so much of them now indexed by Google. Question: 1. What is the best way now moving forward for me to resolve this? 2. Disavow links and domain. Does disavowing a domain = all the links from the same domain are disavow? 3. Can anyone recommend me SEO company which dealt with such issue before and successfully rectified similar issues? Note: SEAGM is company branded keyword 5CGkSYM.png
Technical SEO | | ahming7770 -
Affiliate Link is Trumping Homepage - URL parameter handling?
An odd and slightly scary thing happened today: we saw an affiliate string version of our homepage ranking number one for our brand, along with the normal full set of site-links. We have done the following: 1. Added this to our robots.txt : User-agent: *
Technical SEO | | LawrenceNeal
Disallow: /*? 2. Reinserted a canonical on the homepage (we had removed this when we implemented hreflang as had read the two interfered with each other. We haven't had canonical for a long time now without issue. Is this anything to do with the algo update perhaps?! The third thing we're reviewing I'm slightly confused about: URL Parameter Handling in GWT. As advised - with regard to affiliate strings - to the question: "Does this parameter change page content seen by the user?" We have NO selected, which means they should be crawling one representative URL. But isn't it the case that we don't want them crawling or indexing ANY affiliate URLs? You can specify Googlebot to not crawl any of particular string, but only if you select: "Yes. The parameter changes the page content." Should they know an affiliate URL from the original and not index them? I read a quote from Matt Cutts which suggested this (along with putting a "nofollow" tag in affiliate links just in case) Any advice in this area would be appreciated. Thanks.0 -
301 vs 302 & Link Juice
Has any one come across any recent cases of a 302 link passing more link juice than before?
Technical SEO | | CeeC-Blogger0 -
Duplicate pages, overly dynamic URL’s and long URL’s in Magento
Hi there, I’ve just completed the first crawl of my Magento site and SEOMOZ has picked up 1,000’s of duplicate pages, overly dynamic URL’s and long URL’s due to the sort function which appends URL’s with variables when sorting products (e.g. www.example.com?dir=asc&order=duration). I’m not particularly concerned that this will affect our rankings as Google has stated that they are familiar with the structure of popular CMS’s and Magento is pretty popular. However it completely dominates my crawl diagnostics so I can’t see if there are any real underlying issues. Does anyone know a way of preventing this? Cheers,
Technical SEO | | WendyWuTours
Al.1 -
Mobile redirection
Hi, What would be the best practice for mobile detection: Best practice for redirections Best practice for detection and inclusion of a front-end element inviting to a mobile version of the site I found this on www.W3C.org but it's from 2008 and I was wondering if any of you tried different approaches concerning mobile detection. Thanks! GaB
Technical SEO | | Pherogab0 -
What is link juice - and how do I utilise it?
Apologies for the very basic question - I am trying to determine exactly what link juice is. Every article I seem to find assumes that you already know what link juice is. From what I can tell it is how your internal links push around from your homepage and how they flow through your site. I don't understand how to optimize this and how to improve it throughout my site - or what the opportunities are. I'll attach an image of my site link numbers compared to a few rivals (names removed) to illustrate the difference - not vs the first column but certainly the other two. Can someone shed some light on Link Juice for me and point me in the right direction? Thanks. Oy2c5.png
Technical SEO | | Benj250