If a URL canonically points to another link, is that URL indexed?
-
Hi,
I have two URL both talking about keyword phrase 'counting aggregated cells'
The first URL has canonical link pointing to the second URL, but if one searches for 'counting aggregated cells' both URLs are shown in the results.
The first URL is the pdf, and i need only second URL (the landing page) to be shown in the search results.
The canonical links should tell Google which URL to index, i don't understand why both URLs are present in search results? Is 'noindex' for the first URL only solution?
I am using Yoast SEO for my website.
Thank you for the answers.
-
Hey Lana,
Similar to what Anthony said, you're setup should keep the PDF url from being indexed. In order to help ensure the PDF doesn't get indexed you can do the following:
- Use the robots.txt file to block Google crawlers:
User-agent: *
Disallow: *.pdf
- Use rel="nofollow" on links that point to the PDF
-
If set up correctly, using the canonical tag as described above will usually keep the actual PDF out of the index. Using NoIndex is a guaranteed method to keep it out of the index.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Can you help by advising how to stop a URL from referring to another URL on my website please?
Stopping a redirect from one URL to another due to a 404 error? Referred URL which is (https://webwritinglab.com/know-exactly-what-your-ideal-clients-want-in-8-easy-steps/%5Bnull%20id=43484%5D) Referring URL (https://webwritinglab.com/know-exactly-what-your-ideal-clients-want-in-8-easy-steps/)
Technical SEO | | Nichole.wynter20200 -
I'm struggling to understand (and fix) why I'm getting a 404 error. The URL includes this "%5Bnull%20id=43484%5D" but I cannot find that anywhere in the referring URL. Does anyone know why please? Thanks
Can you help with how to fix this 404 error please? It appears that I have a redirect from one page to the other, although the referring page URL works, but it appears to be linking to another URL with this code at the end of the the URL - %5Bnull%20id=43484%5D that I'm struggling to find and fix. Thanks
Technical SEO | | Nichole.wynter20200 -
Invert canonicals?
Hi, We have 2 sites, site A and site B. For now, some of our articles are duplicated on site B with rel canonicals towards site A. Starting now, Site B will be the main site for this category, we'll only post the content on this site. We will keep the old content on site A. But what do you think will happen if we invert the canonicals for the old articles? They would go towards site B. Would google eventually update its index, a bit like it would do for a redirect? Thanks !
Technical SEO | | AdrienLargus0 -
We have 302 redirect links on our forum that point to individual posts. Should we add a rel="nofollow" to these links?
Moz is showing us that we have a HUGE amount of 302 redirects. These are coming from our community forum. Forum URL: https://www.foodbloggerpro.com/community/ Example thread URL: https://www.foodbloggerpro.com/community/viewthread/322/ Example URL that points to a specific reply: https://www.foodbloggerpro.com/community/viewreply/1582/ The above link 302 redirects to this URL: https://www.foodbloggerpro.com/community/viewthread/322/#1582 My two questions would be: Do you think we should we add rel=nofollow to the specific reply URLs? If possible, should we make those redirects 301 vs. 302? Screencast attached. nofollow_302.mp4
Technical SEO | | Bjork1 -
URL not indexed but shows in results?
We are working on a site that has a whole section that is not indexed (well a few pages are). There is also a problem where there are 2 directories that are the same content and it is the incorrect directory with the indexed URLs. The problem is if I do a search in Google to find a URL - typically location + term then I get the URL (from the wrong directory) up there in the top 5. However, do a site: for that URL and it is not indexed! What could be going on here? There is nothing in robots or the source, and GWT fetch works fine.
Technical SEO | | MickEdwards0 -
Spam links - which link is most damaging to my rankings.
I have just started using Open Site Explorer and discovered a lot of spam links to my website.
Technical SEO | | A.Ronny
(I have mostly ranked on page for many years one but in the last two weeks ranking have dropped to page two)
The links have Anchor Text such as Scam - Dishonest - Drugs. Most of the of the links are "nofollow".
Will links with "nofollow" affect my ranking and if so which of the links should i priorities to remove?
Do I look at Link Equity - Domain Authority - Page Authority or other criteria? Many thanks
Ronny0 -
IP Indexing of MySQL with 80,000 links , help please
Hi There, We were just using one server for both Mysql & httpd server. The server's IP address was 207.36.90.69. Time to time the server couldn't handle the requests so we decided to get new servers in order to handle them. When i talk to my system engineer and software developer about this, they've suggested clustering our site http://goo.gl/ITNqs. So we moved our site http://goo.gl/ITNqs to the server (216.87.165.21) but we kept the old machine 207.36.90.69 to use as MySQL server and deleted all HTTP related files last year. Somehow google indexed 80K unnatural links from this old IP 207.36.90.69. We have received an unnatural links warning back in august and got a link related penalty on sept 28th. Our webmaster tools are showing that we have 80,000 link pointing to our site from the old server with only mysql on it. Can someone help us understand how google crawler indexed an IP with no http related files on it for almost a year, 10 months to be exact. Thank you
Technical SEO | | orion680 -
Google News not indexing .index.html pages
Hi all, we've been asked by a blog to help them better indexing and ranking on Google News (with the site being already included in Google News with poor results) The blog had a chronicle URL duplication problem with each post existing with 3 different URLs: #1) www.domain.com/post.html (currently in noindex for editorial choices as showing all the comments) #2) www.domain.com/post/index.html (currently indexed showing only top comments) #3) www.domain.com/post/ (very same as #2) We've chosen URL #2 (/index.html) as canonical URL, and included a rel=canonical tag on URL #3 (/) linking to URL #2.
Technical SEO | | H-FARM
Also we've submitted yesterday a Google News sitemap including consistently the list of URLs #2 from the last 48h . The sitemap has been properly "digested" by Google and shows that all URLs have been sent and indexed. However if we use the site:domain.com command on Google News we see something completely different: Google News has indexed actually only some news and more specifically only the URLs #3 type (ending with the trailing slash instead of /index.html). Why ? What's wrong ? a) Does Google News bot have problems indexing URLs ending with .index.html ? While figuring out what's wrong we've found out that http://news.google.it/news/search?aq=f&pz=1&cf=all&ned=us&hl=en&q=inurl%3Aindex.html gives no results...it seems that Google News index overall does not include any URLs ending with /index.html b) Does Google News bot recognise rel=canonical tag ? c) Is it just a matter of time and then Google News will pick up the right URLs (/index.html) and/or shall we communicate Google News team any changes ? d) Any suggestions ? OR Shall we do the other way around. meaning make URL #3 the canonical one ? While Google News is showing these problems, Google Web search has actually well received the changes, so we don't know what to do. Thanks for your help, Matteo0