Is it OK for a sitemap to appear as a "Top URL" in Google Webmaster?
-
I'm using Google Webmaster (alongside other tools) to understand how Google is indexing my site.
One of the tools is "Content Keywords", where it lists keywords that Google sees as significant for your site. The keywords shown are generally fine, but when I click on an individual word, I am often seeing our sitemap as one of the "Top URLs" that the keyword is found on (our sitemap is at system/sitemap1.xml.gz) - is this OK?
Obviously I don't want to add the sitemap URL to robots.txt, but I also want to ensure that 'real' user-focused pages (e.g. our homepage) appear higher in the "Top URLs" list for the keywords, as I'm assuming this is an indicator of how the site is performing in search.
Any help appreciated!
-
Thanks for the answer. However I'm still unclear on a few things so I thought I'd give some further info:
- We actually have two XML sitemaps - one for our main site including our forums (this sitemap is generated/submitted by a ruby on rails plugin) and one for blog posts and static pages (this sitemap is generated by a Wordpress plugin). The sitemap which is appearing as a "Top URL" is the first one
- There are actually no links to our sitemap anywhere on our site - the only way Google knows about it is because we automatically generate and submit it to Webmaster
I think the reason that it is appearing as a Top URL is because all of the page titles of forum posts are listed in the sitemap, and this is the only page where they are all listed on one page. So I think you are right about the 'simple algorithm' thing, but I think it's because of the frequency of the keyword in the sitemap, rather than because the sitemap is linked to from anywhere on the site (because it's not).
This brings me to a related question - is it bad having two separate XML sitemaps, and should I be linking to them somehow from the site?
-
I wouldn't be overly concerned.
For some terms, especially product codes and the detail pages of your site there are probably only going to be three pages where that term appears. The product page itself, the page within the navigation that links to that page (normally a list), and the sitemap.
Your sitemap is probably heavily linked to across the site so it does kind of make sense that it would appear as one of the top URLs for a term.
The reason I wouldn't be overly concerned is that I would IMAGINE (and I could be totally wrong) that the top Pages list is generated by a very simple algorithm that doesn't reflect how the organic search algorithm sees your site.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
"Noindex, follow" for thin pages?
Hey there Mozzers, I have a question regarding Thin pages. Unfortunately, we have Thin pages, almost empty to be honest. I have the idea to ask the dev team to do "noindex, follow" on these pages. What do you think? Has someone faced this situation before? Will appreciate your input!
Technical SEO | | Europarl_SEO_Team0 -
Using "Div's" to place content at top of HTML
Is it still a good practice to use "div's" to place content at the top of the HTML code, if your content is at the bottom of the web page?
Technical SEO | | tdawson090 -
Clarification on indexation of XML sitemaps within Webmaster Tools
Hi Mozzers, I have a large service based website, which seems to be losing pages within Google's index. Whilst working on the site, I noticed that there are a number of xml sitemaps for each of the services. So I submitted them to webmaster tools last Friday (14th) and when I left they were "pending". On returning to the office today, they all appear to have been successfully processed on either the 15th or 17th and I can see the following data: 13/08 - Submitted=0 Indexed=0
Technical SEO | | Silkstream
14/08 - Submitted=606,733 Indexed=122,243
15/08 - Submitted=606,733 Indexed=494,651
16/08 - Submitted=606,733 Indexed=517,527
17/08 - Submitted=606,733 Indexed=517,498 Question 1: The indexed pages on 14th of 122,243 - Is this how many pages were previously indexed? Before Google processed the sitemaps? As they were not marked processed until 15th and 17th? Question 2: The indexed pages are already slipping, I'm working on fixing the site by reducing pages and improving internal structure and content, which I'm hoping will fix the crawling issue. But how often will Google crawl these XML sitemaps? Thanks in advance for any help.0 -
Can I "Run Macros" on my own?
I talked to the SEO company I am using and trying to get an understanding of what it is they are doing for me. They told me that one of the most important things they are doing is running macros. Is this something I could learn to do myself? What does it mean? How do I do it? How long does it take?? I have recently been educating myself on SEO and coded my website with metadata titles and descriptions. Is running macros something I can do on my own too? I guess I'd also just like to know what it is.
Technical SEO | | CapitolShine0 -
What can I do if Google Webmaster Tools doesn't recognize the robots.txt file?
I'm working on a recently hacked site for a client and and in trying to identify how exactly the hack is running I need to use the fetch as Google bot feature in GWT. I'd love to use this but it thinks the robots.txt is blocking it's acces but the only thing in the robots.txt file is a link to the sitemap. Unde the Blocked URLs section of the GWT it shows that the robots.txt was last downloaded yesterday but it's incorrect information. Is there a way to force Google to look again?
Technical SEO | | DotCar0 -
How long will Google take to stop crawling an old URL once it has been 301 redirected
I need to do a clean-up old urls that have been redirected in sitemap and was wondering about this.
Technical SEO | | Ant-8080 -
302 vs. a href="nofollow"
we came across one thing the we did not asked to programm by our intention. we have a magento shop and on the produktpage we have those "compare" buttons. these link have a session id and the follow a 302 back onto the same page. so i beleive the idea is that google will just not follow 302s and thats it. so my questions is: is this right what we beleive if so why is a 302 better compared to a a href="nofollow" ???
Technical SEO | | kynop0