Sitemap.xml problem in Google webmaster
-
Hi,
My sitemap.xml is not submitting correctly in Google Webmaster.
There is 697 url submitted but only 56 are in Google index.
At the top of webmaster this is what it says ->>>
http://www.example.com/sitemap.xml has been resubmitted.
But when when I clicked status button RED X occurs.
Any suggestions about this, thanks...
-
Cheers for your reply and answer
& Yes most of your assumptions were correct I am using sitemap generation. The issue is fixed there was a problem with the sitmap when created but it's all sorted now & submitted correctly in WMT.
Thanks...
-
Cheers for your reply and answer
& Yes most of your assumptions were correct I am using sitemap generation. The issue is fixed there was a problem with the sitmap when created but it's all sorted now & submitted correctly in WMT.
Thanks...
-
For the 8 invalid pages, you need to fix the URLs. Based on your questions I assume you are using some form of sitemap generation software. Apparently it is not configured correctly. You will need to take a look at these pages to determine why the URLs are invalid and/or contact the sitemap software vendor.
With respect to the indexing, submitting a sitemap is no guarantee that the pages will be indexed. You can submit a 1000 page site and have every page indexed, or you can have only a couple hundred pages indexed. There are a variety of factors involved.
Some factors which can affect indexing:
-
Is your robots.txt file blocking any of these pages?
-
Are any of these pages duplicate content?
-
Are any of the pages invalid URLs?
-
Are any of these pages canonicalized to other pages?
-
Are any of these pages 301'd to other pages?
-
How well is your site's navigation working? Sitemaps help Google find island pages and such, but your site will be crawled much better with proper navigation along with both internal and external links.
-
How popular is your site and these pages? Pages with good PA are crawled regularly and sites with high DA are crawled more frequently and deeper then other sites.
-
-
I'm just wondering how to do go about fixing these? I ses that they are not valid. Also once fixed do you think this will solve the sitmap issue? (like are these 8 not valid pages causing 600+ pages not being indexed) thanks.
-
The links in your reply are not valid. Try clicking on one of them. They are to your secure Google WMT page and they have an extra http:// prefix.
-
Errors look this ->
1916
Invalid URLThis is not a valid URL. Please correct it and resubmit.URL:http://exhibitions/info_22.htmlParent tag: urlTag: locProblem detected on: Aug 4, 2011 1919Invalid URLThis is not a valid URL. Please correct it and resubmit.URL:http://irish-myths-and-legends/info_12.htmlParent tag: urlTag: locProblem detected on: Aug 4, 2011There is about 10 errors like the above, any suggestions?
-
You need to click on the sitemap in Google WMT and it will inform you of the issue. There are many possible causes ranging from the sitemap link not being accessible to the file not being formatted correctly.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Which pages should I index or have in my XML sitemap?
Hi there, my website is ConcertHotels.com - a site which helps users find hotels close to concert venues. I have a hotel listing page for every concert venue on my site - about 12,000 of them I think (and the same for nearby restaurants). e.g. https://www.concerthotels.com/venue-hotels/madison-square-garden-hotels/304484 Each of these pages list the nearby hotels to that concert venue. Users clicking on the individual hotel are brought through to a hotel (product) page e.g. https://www.concerthotels.com/hotel/the-new-yorker-a-wyndham-hotel/136818 I made a decision years ago to noindex all of the /hotel/ pages since they don't have a huge amount of unique content and aren't the pages I'd like my users to land on . The primary pages on my site are the /venue-hotels/ listing pages. I have similar pages for nearby restaurants, so there are approximately 12,000 venue-restaurants pages, again, one listing page for each concert venue. However, while all of these pages are potentially money-earners, in reality, the vast majority of subsequent hotel bookings have come from a fraction of the 12,000 venues. I would say 2000 venues are key money earning pages, a further 6000 have generated income of a low level, and 4000 are yet to generate income. I have a few related questions: Although there is potential for any of these pages to generate revenue, should I be brutal and simply delete a venue if it hasn't generated revenue within a time period, and just accept that, while it "could" be useful, it hasn't proven to be and isn't worth the link equity. Or should I noindex these "poorly performing pages"? Should all 12,000 pages be listed in my XML sitemap? Or simply the ones that are generating revenue, or perhaps just the ones that have generated significant revenue in the past and have proved to be most important to my business? Thanks Mike
Technical SEO | | mjk260 -
Duplicate Content Problem!
Hi folks, I have a quite awkward problem. Since a few weeks a get a huge amount of "duplicate content errors" in my MOZ crawl reports. After a while of looking for the error I thought of the domains I've bought additionally. So I went to Google and typed in site:myotherdomains.com The results was as I expected that my original website got indexed with my new domains aswell. That means: For example my original website was index with www.domain.com/aboutus - Then I bought some additional domains which are pointing on my / folder. What happened is that I also get listed with: www.mynewdomains.com/com How can I fix that? I tried a normal domain redirect but it seems as this doesn't help as when I am visiting www.mynewdomains.com the domain doesnt change in my browser to www.myoriginaldomain.com but stays with it ... I was busy the whole day to find a solution but I am kinda desperate now. If somebody could give me advice it would be much appreciated. Mike
Technical SEO | | KillAccountPlease0 -
Google webmaster errors
**If you know what these google webmasters errors mean, and you can explain it to me in simple english and tell me how I can locate the problem, I would really appreciate it!. <colgroup><col width=""><col width=""><col width=""><col width=""><col width="*"><col width="124"><col width="54"></colgroup>
Technical SEO | | Joseph-Green-SEO
| | | | | Server error | | | | Soft 404 | | | | Access denied | | Not found | | | Not followed | | | |** I have many of these errors, is it harming SEO?Yoseph0 -
Google Webmaster Tool - Crawl Stats Query ?
Dear All, I have been looking at GWT Crawl Stats and wondering how should I be interrupting the crawl stats chart. AllI I see is 3 charts telling me a high , low and average for the below but I am wondering is there anything I really need to be looking for ?. Pages crawled per day Kilobytes downloaded per day Time spent downloading a page (in milliseconds) thanks Sarah
Technical SEO | | SarahCollins0 -
Exclude Child URLs from XML Sitemap Generator (Wordpress)
Hi all, I was recommended the XML Sitemap Generator for Wordpress by the very helpful Keith Bloemendaal and John Pring - however I can't seem to exclude child URLs. There is a section Exclude items and a subsection Exclude posts. I have tried inputting the URLs for the pages I don't want in the sitemap, however that didn't work. So I read that you have to include a list of "IDs" - not sure where on earth to find that info, tried the page name and the post= number from the URL, however neither worked. I hope somebody can point me in the right direction - and apologies, I am a Wordpress novice, and I got no answers from the Wordpress forums so turned right back to SEOmoz! Cheers.
Technical SEO | | markadoi840 -
Problem With Video Sitemap Becuase All Videos Are in he Same URL
Hi, I created a video sitemap and now I'm getting an error on webmaster tools because the location for some of the videos is the same. It says: Duplicate URL - This URL is a duplicate of another URL in the sitemap. Please remove it and resubmit. What can I do if all my videos are located in the same URL?? Thanks
Technical SEO | | Tug-Agency0 -
Not ranking well in Google
Hi, I am new to Seomoz,I have some little doubts regarding <title>tag.</p> <p>Can i target 3 words in the title tag. Currently i am on top for one keyword, and i cant get the rest two in top positions. Here is my website, can anyone review my site please.</p> <p>xxx(dot)ridpiles(dot)com with keyword hemorrhoids treatment</p> <p>I have good amount of backlinks, but still something i am missing. I have 100% unique content.</p> <p> </p> <p>Regards</p></title>
Technical SEO | | Dexter22387874870 -
Include pagination in sitemap.xml?
Curious on peoples thoughts around this. Since restructuring our site we have seen a massive uplift in pages indexed and organic traffic with our pagination. But we haven't yet included a sitemap.xml. It's an ancient site that never had one. Given that Google seems to be loving us right now, do we even need a sitemap.xml - aside from the analytical benefis in WM Tools? Would you include pagination URL's (don't worry, we have no duplicate content) in the sitemap.xml? Cheers.
Technical SEO | | sichristie0