Crawler reporting incorrect URLs, resulting in false errors...
-
The SEOmoz crawler is showing 236 Duplicate Page Titles. When I go in to see what page titles are duplicated I see that the URLs in question are incorrect and read "/about/about/..." instead of just "/about/" The shown page duplicates are the result of the crawler is ending up on the "Page not found" page.
Could it be the result of using relative links on the site? Anything I can do to remedy?
Thanks for your help!
-Frank
-
Hey Frank! This definitely sounds like an issue with relative links on the page; if the crawler sees it, he'll follow it, and continue to do so indefinitely, causing a super-long URL with the same sub-page listed over and over in the URL. I actually see this bug a fair amount. I'd recommend you look at your page code, try to find relative link tags, and then fix them on that side of things.
If you think it's a bug and nothing in your code, you can always send an email to us at help@seomoz.org - we'll be able to take a look and make sure it's not a bug on our side (make sure to send your PRO email address, the campaign with the issue, the URL with the issue, and any relevant screenshots/examples to help us diagnose). In my experience, though, this is always because of a relative link I find in the source. If you need help with how to fix that, you may want to start another thread here in the Q&A and post your page's source or URL. Hope this helps!
-
I am getting the same thing. I hope someone can help with this!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate URLs
A campaign that I ran said that my client's site had some 47,000+ duplicate pages and titles. I was wondering how I can possibly set that many 301 redirects, but a Moz help engineer said it has a lot to do with session IDs. See this set of duplicate URLs: http://www.lumberliquidators.com/ll/c/engineered-hardwood-flooring (clearly the main URL for the page)
Moz Pro | | AlanJacob
http://www.lumberliquidators.com/ll/c/engineered-hardwood-flooring?PIPELINE_SESSION_ID=0ac00a2e0ad53eb90cb0b0304d178fc1
http://www.lumberliquidators.com/ll/c/engineered-hardwood-flooring?PIPELINE_SESSION_ID=0ac3039d0ad4af2720b3ccd2238547ab
http://www.lumberliquidators.com/ll/c/engineered-hardwood-flooring?PIPELINE_SESSION_ID=0ac071ed0ad4af292684b0746931158f To a crawler, that looks like 4 different pages, when it's clear that they're actually all different URLs for the same page. I was wondering if some of you, maybe with experience in site architecture, would have insight into how to address this issue? Thanks Alan0 -
Rogerbot crawls my site and causes error as it uses urls that don't exist
Whenever the rogerbot comes back to my site for a crawl it seems to want to crawl urls that dont exist and thus causes errors to be reported... Example:- The correct url is as follows: /vw-baywindow/cab_door_slide_door_tailgate_engine_lid_parts/cab_door_seals/genuine_vw_brazil_cab_door_rubber_68-79_10330/ But it seems to want to crawl the following: /vw-baywindow/cab_door_slide_door_tailgate_engine_lid_parts/cab_door_seals/genuine_vw_brazil_cab_door_rubber_68-79_10330/?id=10330 This format doesn't exist anywhere and never has so I have no idea where its getting this url format from The user agent details I get are as follows: IP ADDRESS: 107.22.107.114
Moz Pro | | spiralsites
USER AGENT: rogerbot/1.0 (http://moz.com/help/pro/what-is-rogerbot-, rogerbot-crawler+pr1-crawler-17@moz.com)0 -
How do you explain this in this Keyword Difficulty Report?
Hello here, I am trying to understand how my site virtualsheetmusic.com can better compete for the keyword "sheet music download" where we used to be 3rd until a few months ago, and now we are at the 5th spot. But here is my specific question: I can't find an explanation why two of our competitors are ranked before us at the 3rd and 4th spot. Please, look at the attached image (very wide) which shows the Keyword Difficulty Report for this keyword. You can also download a PDF of it here below: http://www.virtualsheetmusic.com/storage/keyword-sheet-music-download.pdf We are at the 5th spot, and the two competitors highlighted in pink are the ones I am talking about: I am trying to understand why they are ranking better than us, despite our metrics look much better than theirs. In fact, if I look at the metrics in the report, I can't find an explanation to justify their outrank against us, so there must be something else that the report is missing. Any thoughts about this issue are very welcome! Thank you in advance for any help. Sincerely,
Moz Pro | | fablau
Fabrizio [keyword-sheet music download.jpg](http://www.virtualsheetmusic.com/storage/keyword-sheet music download.jpg)0 -
Duplicate content error?
I am seeing an error for duplicate content for the following pages: http://www.bluelinkerp.com/contact/ http://www.bluelinkerp.com/contact/index.asp Doesn't the first URL just automatically redirect to the default page in that directory (index.asp)? Why is it showing up as separate duplicate pages?
Moz Pro | | BlueLinkERP0 -
CSV reports in SEOmoz
Hello, I would like to export the reports from SEOmoz to an Excel sheet. However when I downoad the report and open it, all the information is random and is hard to work on it. Since Im not an excel expert, I have to ask if there is an Excel sheet ready to receive the SEOmoz reports. Tks for the help, Regards, PP
Moz Pro | | PedroM0 -
Lots of site errors after last crawl....
Something interesting happened on the last update for my site on SEOmoz pro tools. For the last month or so the errors on my site were very low, then on the last update I had a huge spike in errors, warnings, and notices. I'm not sure if somehow I made a change to my site (without knowing it) and I caused all of these errors, or if it just took a few months to find all the errors on my site? My duplicate page content went from 0 to 45, my duplicate page titles went from 0 to 105, my 4xx (client error) went from 0 to 4, and my title missing or empty went from 0 to 3. On the warnings sections my missing meta description tag went form a hand full to 444. (most of these looking to be archive pages.) Down in the notices I have over 2000 that are blocked by meta robots, meta-robots nofollow, and Rel canonical. I didn't have any where near this many prior to the last update of my site. I just wanted to see what I need to do to clean this up, and figure out if I did something to cause all the errors. I'm assuming the red errors are the first things I need to clean up. Any help you guys can provide would be greatly appreciated. Also if you'd like me to post any additional information, please let me know and I'd be glad to.
Moz Pro | | NoahsDad0 -
After fixing errors can I re-crawl for diagnostics?
As I am fixing errors will the campaign automatically update to show where I have fixed issues?
Moz Pro | | eidna220 -
Inbound Links Report Problem
While looking over my competitors inbound link report, they have a reported 5K inbound links. The first 20 listed inbound domains are things like UPS, and other sites that when I look at them, and the source file has no link to their domain at all??? So are they using some kind of Black Hat technique, or is SEOMoz not reporting properly? Help please.
Moz Pro | | WBConsulting0