I need an XML sitemap expert for 5 minutes!
-
Hi all!
I'm hoping that someone with a lot of experience with XML sitemaps can help me out here...
When submitting my sitemap in Google Webmaster Tools, these are the results:
2,414,714 Submitted
34,721 IndexedAnd there's also tonnes of warnings.
Would anyone be able to take a quick look at these sitemaps to perhaps advise me on what's going wrong there? These do not load without the www, not sure if this is an issue?
http://www.eumom.ie/sitemap.xml
http://www.eumom.ie/sitemap.xml.gzThanks everyone in advance!!
Gavin
-
Few rules about sitemaps;
-
You should only include in them pages you also want crawled and indexed
-
They should not contain URLs with 404s or blocked by robots.txt
My guess is there are too many URLs in the sitemaps, since I'd guess the website is not over 2 million actual "real" pages,
Also, I randomly clicked on a URL in one of the sitemaps and it 404'd;
http://www.eumom.ie/forums/topic/oakhill-school-leopardstown-/
This is probably causing a lot of the errors you see. It's honestly not a 5 minute fix - but if it were my site, I would be using the Yoast SEO plugin and using the sitemap feature within Yoast. It makes it very easy to include / exclude certain pages and updated automatically etc.
I think there must be a way to tell your plugin what to include / exclude from the sitemap but I don't have as much experience with it.
But generally - only include pages you want crawled and indexed. Don't include pages that 404.
-
-
Hi all,
Many thanks for your input so far, much appreciated!
The sitemaps that you are seeing actually were generated using that plugin you mentioned. Formatting-wise, do you see anything wrong with the sitemaps?
Thanks!!
Gavin -
I couldn't agree more altecdesign!
http://wordpress.org/plugins/google-sitemap-generator/ all the way!
-
That XML sitemap you linked too is formatted in an odd way. I noticed the site you are generating the xml sitemap for is based in wordpress. There is a really solid sitemap plugin you could use to generate your XML and submit to google instead of the current plugin you are using: http://wordpress.org/plugins/google-sitemap-generator/
I've used that plugnin numerous times and submitted sitemaps to google with no errors. Hopefully that helps you out.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Sitemap generator partially finding list of website URLs
Hi everyone, When creating my XML sitemap here it is only able to detect a portion of the website. I am missing at least 20 URLs (blog pages + newly created resource pages). I have checked those missing URLs and all of them are index and they're not blocked by the robots.txt. Any idea why this is happening? I need to make sure all wanted URLs to be generated in an XML sitemap. Thanks!
Technical SEO | | Taysir0 -
Image Sitemap
I currently use a program to create our sitemap (xml). It doesn't offer creating an mage sitemaps. Can someone suggest a program that would create an image sitemap? Thanks.
Technical SEO | | Kdruckenbrod0 -
Mobile sitemap needed for responsive website?
I've seen some older 2012 posts that discuss, but nothing recent given the new changes to emphasize mobile. For website that are already tested and verified as mobile responsive, is best practice to develop a mobile-specific sitemap and submit that as well? Or will any mobile crawlers spider the regular sitemap?
Technical SEO | | Addion0 -
Need 301 Advice with a Recovered URL from a Domain Typosquatter
I am new to a SMB and someone bought the plural version of our domain back in 2005 and has yet to let it expire. The domain was just renewed for another year so we finally decided to contact a lawyer and go through the domain name dispute process. This seems like a pretty cut an dry case and the lawyer is very confident that we'll have the domain within 30-40 days. Currently the plural version domain 303s to spammy web pages, shows shady ads and is just a malicious looking page in general. I am not savvy enough to know the exact complexities of what's happening on the backend but it's spammy. Knowing the history of the plural version domain, how would you treat it after we acquire it? Obviously, I wouldn't want to put our site in jeopardy by 301ing the plural version of our URL to our current healthy site but at the same time many customers might go to that domain by accident so eventually I'd like to 301 it. If it's any help, the plural version has a robots.txt that prevent G from crawling it..thank you in advance for your guidance!
Technical SEO | | ssimarketing0 -
XML Feed
If a site has an xml feed being used by 100 companies to create the content on their site. Will those 100 sites receive any link juice? Is there any way content may be classed as duplicate across these sites? And should the page on the site where the xml feed is coming from have the page indexed first?
Technical SEO | | jazavide0 -
Sitemap Creation
Hi I am looking for the best way to generate an XML sitemap for webmaster tools for my website http://www.cheapfindergames.com. I have come across http://www.xml-sitemaps.com/ but it only allows up to 500 links. Is there a PHP script that any experts could share that would create the XML map that I could upload please? Many Thanks
Technical SEO | | ocelot0 -
Domain targeting advice needed please
I would be interested in hearing the views of other seomozers on this issue please. I have a web server hosted in The Netherlands which I currently host my sites on, it is super fast (16core 24gb ram) and in 8 months has had 4 mins of downtime! On this server I wish to build a couple of ecommerce stores. However this is where my issue lays The first store I launch will be targeted at the UK market, however the domain I wish to use for it is a .com domain which has a moz ranting of about 36 (better than most of my competitors, worse than a few so it's a good headstart). The problem I would then have is a .com domain hosted on a Dutch server targeting UK people. Even if I was to set the webmaster tools location to UK it would not be ideal. Also, when it comes to launching the US site I would then be looking at using a .us domain which is far from ideal The other option I have is to use the .co.uk domain for the UK site but this is new and lack any decent moz score. Given this I am now pondering the following set up....using the .com domain on the Dutch server but putting the UK store in domain.com/UK and the future usa store in domain.com/usa. Would this be the best work around? I could then set the location of folders in the webmaster tools? Also, I plan on using geo redirecting on the domain so if a uk page happens to rank in the USA listings the user gets automatically redirected to the nearest matching product available in their country in the /us/ folder. Would this be easiest to work with on just one domain as it wouldn't technically be redirecting people to another site as per using two domains. Any thoughts would be good. Not even sure I have managed to explain it very clearly hehe
Technical SEO | | Grumpy_Carl0 -
Mobile sitemaps - how much value?
Hi, We have a main www website with a standard sitemap. We also have a m. site for mobile content (the mobile site only contains our top pages and doesn't include the entire site). If a mobile client accesses one of our www pages we redirect to the m. page. If we don't have a m. version we keep them on the www site. Since we already have a www sitemap, is there much value in creating a mobile site map? The mobile site (although missing all pages) is pretty robust and contains most content people are looking for. Will the mobile sitemap help for Mobile searches (more so than our standard sitemap)? I'm also planning on rel canonical the m. pages to the www. pages (per other suggestios on SEOMoz) Thanks
Technical SEO | | NicB10