How many links can you have on sitemap.html
-
we have a lot of pages that we want to create crawlable paths to. How many links are able to be crawled on 1 page for sitemap.html
-
Sitemaps are limited to 50MB (uncompressed) and 50,000 URLs from Google perspective.
All formats limit a single sitemap to 50MB (uncompressed) and 50,000 URLs. If you have a larger file or more URLs, you will have to break it into multiple sitemaps. You can optionally create a sitemap index file (a file that points to a list of sitemaps) and submit that single index file to Google. You can submit multiple sitemaps and/or sitemap index files to Google.
Just for everyone's references - here is a great list of 20 limits that you may not know about.
-
Hi Imjonny,
As you know google crawl all pages without creating any sitemap. You don't need to create html sitemap. Xml sitemap is sufficient to crawl all pages. if you have millions pages, You need to create html sitemap with proper category wise and keep upto 1000 links on one page. . As you know html site map is creating for user not Google, So you don't need to worry about that too much.
Thanks
Rajesh -
We break ours down to 1000 per page. A simple setting in Yoast SEO - if you decide to use their sitemap tool. It's worked well for us though I may bump that number up a bit.
-
Well rather the amount of links each page of the sitemap.html is allowed to have. For example, If I have a huge site, I don't want to place all links on 1 page, I would probably break them out to allow the crawlers some breathing room between different links.
-
Hello!
I get that you are referring to the maximum size and/or the limit of URLs the sitemap file can have. That gets answered in the faq of sitemap.org: (link here)
Q: How big can my Sitemap be?
Sitemaps should be no larger than 50MB (52,428,800 bytes) and can contain a maximum of 50,000 URLs.Best luck!
GR
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Weird Links Should I Disavow?
I have noticed some weird backlinks in Google Search console and Referals for Google Analytics. For example a reddit page I have never commented on or been on has referred over 900 visitors. The page has no relevance to my site whatsoever, when I check the source code I cannot see the link, so perhaps its been removed. Also seeing links in Google Search Console from sites that are just domain name for sale type pages, and sites/pages that don't seem to exist anymore, or which redirect to others. All of these links have disappeared as well, nothing in source code . And numerous pages that used to link to 404's on my site, many domain name for sale type pages, another which makes my bitdefender plugin go crazy. And seeing common referral patterns in Google Analytics, i.e. numerous /try.php pages on different domains that presumably used to link back but which now redirect to another site. I cannot say there are thousands of these, but I guess they are causing more harm than good. My instinct is to I go through all the links I can and disavow, the link types described above, but am I safe to do so? And is it a good idea or a waste of my time? NB: I haven't built any of them.
White Hat / Black Hat SEO | | GrouchyKids1 -
Penguin: Is there a "safe threshold" for commercial links?
Hello everyone, Here I am with a question about Penguin. I am asking to all Penguin experts on these forums to help me understand if there is a "safe" threshold of unnatural links under which we can have peace of mind. I really have no idea about that, I am not an expert on Penguin nor an expert of unnatural back link profiles. I have a website with about 84% natural links and 16% affiliate/commercial links. Should I be concerned about possibly being penalized by an upcoming Penguin update? So far, I have never been hit by any previous Penguin released, but... just in case, you experts, do you know what's the "threshold" of unnatural links that shouldn't be exceeded? Or, in your experience, what's the classic threshold over which Google can penalize a website for unnatural back link profile? Thank you in advance to anyone helping me on this research!
White Hat / Black Hat SEO | | fablau0 -
Dealing with links to your domain that the previous owner set up
Hey everyone, I rebranded my company at the end of last year from a name that was fairly unique but sounded like I cleaned headstones instead of building websites. I opted for a name that I liked, it reflected my heritage - however it also seems to be quite common. Anyway, I registered the domain name as it was available as the previous owner's company had been wound up. It's only been in the last week or two where I've managed to have a website on that domain and I've been tracking it's progress through Moz, Google & Bing Webmaster tools. Both the webmaster tools are reporting back that my site triggers 404 errors for some specific links. However, I don't have or have never used those links before. I think the previous owner might have created the links before he went bust. My question is in two parts. The first part is how do I find out what websites are linking to me with these broken URL's, and the second is will these 404'ing links affect my SEO? Thanks!
White Hat / Black Hat SEO | | mickburkesnr0 -
Link profile heavy with press release syndication links caused drop at Penguin 2.0
I'm wrestling with something that I'm hoping members of the community can provide input on.... I've working with an enterprise level client that is in the business of data capture and distribution. I've diagnosed a clear drop of traffic on May 22nd, i.e a loss of search visibility post Penguin 2.0. Their link profile is big! Discussions with internal stakeholders who have been with the company 10's of years confirm that no "link building" service providers have ever been hired and no over-zealous employee is ever likely to have tried to "do" link building internally. They are just one of those lucky companies that by their nature publish information that people want to link to and share. As a first port of call I've grouped links by anchor text and can see groups of hundreds of matching anchors based on their brand URL and specific page titles. The matching anchors have resulted from big take up of interesting data that they have marketed via press releases. NOT for link purposes. My question is this.... Does the community think or have evidence (or can point me toward any case studies) that show that Press release syndication alone could result in: a) a penguin penalty or...
White Hat / Black Hat SEO | | QubaSEO
b) a devaluing of press release type links during Penguin 2.0 that could have resulted in a loss of search visibility and give the impression of a penalty Your thoughts are much appreciated!0 -
Our site has too many backlinks! How can we do a bad backlink audit?
Webmaster Tools is saying we have close to 24 million links to our site. The site has been around since the mid 90s and has accumulated all these links since. We also have our own network of sites that have links in their templates to our main site. I'm fighting to get these links "nofollow"'d but upper management seems scared to alter this practice. This past year we've found our rankings have dropped significantly and suspect it's due to some spammy backlinks or being penalized for doing an accidental link scheme network. 24 million links is too many to check manually for using the disavow tool and it seems that bulk services out there to check backlinks can't even come close. What's an SEO to do?
White Hat / Black Hat SEO | | seoninjaz0 -
Getting Back Links When I Cannot Add Outbound Links to My Site
I have a collection of websites that I do not control in terms of content or page creation/editing. As a result, I have no way to add links to outside sites on any existing or new pages. Given this, how can I go about finding and requesting other sites link back to our sites/pages if I cannot offer them a link to their site in return? I know that content is a link driver, but I do not control the content, so I cannot develop new content to help drive links. I appreciate any help/advice any experts can provide.
White Hat / Black Hat SEO | | dsinger0 -
Single-words high keyword density. How many is too many.
Dear All SeoMoz users, I'm a web designer for some time now. Doing some basic SEO from time to time. I just started up with brand new website. The website is not ranking very well for 2nd line keyword (keyword density < 2%), but the problem is not ranking at all for for my main keyword. I think the problem is the keyword density. For phrases that are 3-words long my keyword density is less than 4%. I suspect the problem is that keyword density for single-word phrases is between 8-12%. Please note that the 3 words with highest keyword density make my main 3-words long keyword. Is this the case? Should I be avoiding keyword density larger than 4% for single-word phrases as well? What is you experiences is this matter? Could my single-word phrases be treated as keyword stuffing by Google?
White Hat / Black Hat SEO | | pseefeld0 -
Thought on optimising the perfect keyword location link
My site works a bit like a directory, so say I have a page called "Ice Cream Vendors" - on that page I would talk a bit about Ice Cream Vendors, then I will have a list of Ice Cream Vendor Locations. My list of locations can be quite big depending on the product and the amount of locations they occur in - when you click a location, it goes to a page showing all "ICeCream Vendors" in that location. So Currently I will have a table on the page a bit like this: ICE CREAM VENDOR LOCATIONS
White Hat / Black Hat SEO | | James77
New York
Miami
Las Vegas This is all perfectly nice, simple and usable - BUT it is not producing perfect keyword links - for perfect keyword links the list should be like this: ICE CREAM VENDOR LOCATIONS
New York Ice Cream Vendors
Miami Ice Cream Vendors
Las Vegas Ice Cream Vendors Now I have my perfect anchor links - BUT it looks rediculous and is NOT user friendly. So What do I do?
1/. Build it for users and not have perfect anchor links, and loose in SEO?
2/. Build a perfect SEO links and make it less usable and looking spammy? OR 3/. Deliver the search engine the perfect SEO links, and the user the userfriendly version? In this I mean I could do the following:
SE's (and screen readers I think would see):
ICE CREAM VENDOR LOCATIONS
New York Ice Cream Vendors
Miami Ice Cream Vendors
Las Vegas Ice Cream Vendors Users would See
ICE CREAM VENDOR LOCATIONS
New York
Miami
Las Vegas Now in my view I am doing nothing wrong - I am mearly giving the user the most userfriendly version and I am giving the SE more information on the link, that the user doesn't need. So - In my view I am doing something that is honest - but what are your thoughts?? Has anyone tried to do this? Thanks0