Too many on page links in sitemap.html
-
My crawl report is flagging an issue with too many links to one of my pages, this page is my sitemap.html. However, I have coded the page so that if required is specified it generates an .xml version of the page and if not then the html version is displayed. What is the best way to stop the crawl finding the html version whilst maintaining it on the site for clients navigation?
-
The thing to remember is that the HTML version should only ever be used for users and not to redirect robots if they hit a 404 on your .xml file. The reason for this is that search engines may still see the file as 404 after the redirect or a 301 redirect, if the later you then have an issue of search engines thinking it was there but is now the html page. Which of course is not a good thing.
I would advise ensuring the fall back never happens to robots / spiders - if the file is just a 404 SE's will return to it, they may not if it is 301 redirect.
-
Thanks for the response,
This was the first thought, but I wasn't 100% sure that hiding it in the robot.txt file should solely remove this issue and it is still early.
Thanks again.
-
hide it using a robots.txt file - though you could also use the noindex meta tag ... this being said search engines in general recognize sitemap pages and aren't too fussed by them, its a good jumping off point for them to find info.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Parsing jQuery Links as Real Links
While trying to diagnose a recent Google penalty I found out that links were being parsed by Google even though they were made using jQuery. I had the linkify plugin on my site and configured it to convert URLs to links on all of my pages. Today I found links to other sites of mine from sites that should not have been linking to them and found that the links came from pages whose links were generated via jQuery. This makes me wonder, how do I know if Google is counting javascript generated links? Is it possible that my native ad widgets are creating links that Google might count? Since I don't own any of the sites that advertise via the widgets I don't know how to tell if they are getting link juice or not. It used to be that Google didn't parse javascript, so you could add as many links to your site via javascript as you wanted without being seen by Google as linking to those sites. Does anyone know of a jQuery plugin that does turn URLs into clickable links that Google won't parse as real links?
On-Page Optimization | | STDCarriers0 -
Page Speed
Google recommends a page load speed of 1.4 seconds, is it recommended to have that page speed for every page on the site, or just the landing pages. Is there a tool that will check the load speed of every page on a site and report the slow pages? The free online tools only check one page at a time.
On-Page Optimization | | Bryan_Loconto0 -
How to improve On-Page Grades for Top Ranking Pages
please help me - i dont know or understand how to improve on-Page Grades for Top Ranking Pages
On-Page Optimization | | pwwukpw0 -
Header Links vs. In Page Links
We have lost considerable rank for some of our top search terms (department names) and the rank loss correlates to a change we made on our homepage. That change was to remove a secondary navigation to the major departments in the content of our homepage. Now all we have is the global header navigation on the homepage (and all other pages on the site). I have read that in-page links pass more value than sitewide header links and I'm wondering if this is really true. These were text links (not linked images) and our header also contains text links (and some javascript). We did not make any other changes on our site at this time and this was not around the time of any major algorithm updates. The site is www.ebags.com.
On-Page Optimization | | SharieBags1 -
How do I do a 301 Redirect in IIS 7 from http://www.freightmonster.com/index.html to http://freightmonster.com/index.html when I don't have a physical page to redirect?
I'm trying to get rid of my Rel Canonical links and use the 301 Redirect instead.
On-Page Optimization | | FreightBoy0 -
Old pages
I have a site where I have 5,000 new products each year, I never waned to deleted the old pages due to links pointing to them and keywords. But I now have 20,000 plus pages, does having that many pages spread out my link juice or does it effect me in any other ways over having a site with 5,000 pages or should I keep not deleting old pages so I dont loose any links? Along with that I currently do not link to my old pages from my site so Im guessing google does not get to them very often if at all, if you agree to still keep them should I link to them somewhere? Because the products are not that simiiar and they do bring added value I dont think canonical would work here
On-Page Optimization | | Dirty0 -
Footer link to home page?
Quick question - is it a best practice to add a footer link on each page of a website that points back to your home page, with the anchor text being your official brand name?
On-Page Optimization | | Bandicoot0 -
Tag clouds: good for internal linking and increase of keyword relevant pages?
As Matt Cutts explained, tag clouds are OK if you're not engaged in keyword stuffing (http://www.youtube.com/watch?v=bYPX_ZmhLqg) - i.e. if you're not putting in 500 tags. I'm currently creating tags for an online-bookseller; just like Amazon this e-commerce-site has potentially a couple of million books. Tag clouds will be added to each book detail page in order to enrich each of these pages with relevant keywords both for search engines and users (get a quick overview over the main topics of the book; navigate the site and find other books associated with each tag). Each of these book-specific tag clouds will hold up to 50 tags max, typically rather in the range of up to 10-20. From an SEO perspective, my question is twofold: 1. Does the site benefit from these tag clouds by improving the internal linking structure? 2. Does the site benefit from creating lots of additional tag-specific-pages (up to 200k different tags) or can these pages become a problem, as they don't contain a lot of rich content as such but rather lists of books associated with each tag? Thanks in advance!
On-Page Optimization | | semantopic0