Should I worry about errors MozBot finds but is not on my sitemap?
-
MozBot crawled a found a couple errors that isn't included on my sitemap plugin, such as duplicate page content on author pages.
Should I worry about things not on my sitemap?
-
Yes, I would fix every thing that is or could be a problem, it is hard to rank, and you dont weant anything working aginst you.
-
Whether a page is or is not included in your sitemap is irrelevant. Search engines will perform a normal crawl of your site based on the navigation and links for your site. If any page on your site can be reached from any navigation or link on your site, then search engines can find it.
If the page is not marked with a noindex tag, then search engines may attempt to index the page which would cause a duplicate content issue.
A common cause of duplicate content issues on author pages is caused by a lack of information. An author who has provided a detailed bio produces a nice author page, but other authors may share the absolutely minimum amount of information required by your site to publish, and if there is no content other then a user name that would cause a duplicate content issue.
The preferable solution would be to gather more information from authors. Name, email, social accounts, location, areas of expertise, interests, credentials, etc. If you are unable to do such you can also noindex the pages for those authors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Would a Search Engine treat a sitemap hosted in the cloud in the same way as if it was simply on /sitemap.htm?
Mainly to allow updates without the need for publishing - would Google interpret any differently? Thanks
Technical SEO | | RichCMF0 -
Robots.txt and Multiple Sitemaps
Hello, I have a hopefully simple question but I wanted to ask to get a "second opinion" on what to do in this situation. I am working on a clients robots.txt and we have multiple sitemaps. Using yoast I have my sitemap_index.xml and I also have a sitemap-image.xml I do put them in google and bing by hand but wanted to have it added into the robots.txt for insurance. So my question is, when having multiple sitemaps called out on a robots.txt file does it matter if one is before the other? From my reading it looks like you can have multiple sitemaps called out, but I wasn't sure the best practice when writing it up in the file. Example: User-agent: * Disallow: Disallow: /cgi-bin/ Disallow: /wp-admin/ Disallow: /wp-content/plugins/ Sitemap: http://sitename.com/sitemap_index.xml Sitemap: http://sitename.com/sitemap-image.xml Thanks a ton for the feedback, I really appreciate it! :) J
Technical SEO | | allstatetransmission0 -
How to find and fix 404 and broken links?
Hi, My campaign is showing me many 404 problems and other tools are also showing me broken links, but the links they show me dose work and I cant seem to find the broken links or the cause of the 404. Can you help?
Technical SEO | | Joseph-Green-SEO0 -
How to properly remove 404 errors
Hi, According to seomoz report I have two 404 errors on my site. (http://screencast.com/t/2FG8fA1dvGB) I removed them from google webmasters central about 2 weeks ago (http://screencast.com/t/MQ8XBvrFm ) , but they're still showing as an error in the next report (weekly update). Is there anything else you do about 404 or just remove urls through gwc? Or maybe seomoz data is delayed? Thanks in advance, JJ
Technical SEO | | jjtech0 -
Odd 404 Errors in WP That I cannot find the origin
Hello Everyone, I have a really odd error that I cannot figure out how to fix. I keep getting a 404 error (through google webmaster tools, and SEOmoz) at this url http://www.cio-tech.com/cios-oracle-optimization-results-roi-justification-save-20m-over-5-years/www.SynSynAck.com... I am not even too sure how this link originated because the first part is a URL to a blog post, and the second is another website that I have. I'd appreciate any help on the matter.
Technical SEO | | Packetman0070 -
Should I really worry about warnings¿?
I have about 300 errors about duplication of content. But I also have like 3000 warnings for: 2500, too many links on page 2000, meta title longer than 66 characters 200, too long url. I have analyzed my competition, and 90% of them have also too many links on pages and long meta titles. Are these 2 really factors to improve my google ranking? My site is: theprinterdepo.com and my main keywords are printers, refurbished printers, laser printers
Technical SEO | | levalencia10 -
How to generate a visual sitemap using sitemap.xml
Are there any tools (online preferably) which will take a sitemap.xml file and generate a visual site map? Seems like an obvious thing to do, but can't find any simple tools for this?
Technical SEO | | k3nn3dy30 -
Should there be a canonical tag on my 404 error page?
In my crawl diagnostics, I notice some 4xx client errors. They are appearing for pages that no longer exist, so I'm not sure what the problem is. Shouldn't they just be dealt as 404's? Anyway, on closer inspection I noticed that my 404 error page contains a canonical tag which points to the missing page. Could this be the issue? Is it a good idea to remove the canonical tag from this error page? Thanks.
Technical SEO | | Leighm0