Why moz pro detects inexistent links?
-
I have a campaign in moz pro to my personal webpage for testing purposes and also a bit of learning. But i have a question:
On link -> Link analysis i can see this:
http://maqui.darkbolt.net/project/chat/index.php 404http://maqui.darkbolt.net/project/docs/index.php 404http://maqui.darkbolt.net/project/down/index.php 404http://maqui.darkbolt.net/project/foto/index.php 404http://maqui.darkbolt.net/project/news/index.php?news=1 404http://maqui.darkbolt.net/project/project/index.php 404http://maqui.darkbolt.net/project/ro/index.php 404http://maqui.darkbolt.net/project/who/index.php 404Obviously all these address doesn't exist. There are links on the page project/index.php linking to, for example, /chat/index.php.How can i resolve this problem on the stats? There's something bad really on the page? As i can see all links on the page are working properly.
-
Hi there,
-
I reviewed several pages that we're reporting as duplicates and none of them are canonicalized. If you don't want to change the content in the source code so it's not 95% similar to the other pages, you'll need to add canonicals. The Help Hub has some good info on how to do this. You can also run a search in the community.
-
If we reported 404's in the initial crawl, it's because they existed at the time. The most recent crawl isn't showing any 404's so this shouldn't be an issue anymore.
-
Again, there are no 404's reported in this week's crawl for your campaign so there will be no 404's in the crawl diagnostic csv. That is where you'll want to go if this comes up again though.
-If the page on your site is linked to from anywhere on your site, we will crawl and report on it up to the page crawl limit set for the campaign. We're not going to report data for non-existent links as that isn't physically possible. I hope this helps clear things up.
-
-
As i've told at first post and anothers,
1º: I have a failure of duplicate content, detected on first crawl. The dup content comes from a parameter on all pages (?login). I've solvented it making this parameter a link to main page. This change resolves the duplicate content on all site.
2º: On first crawl the system itself detects 404's. The do not exist. I haven't change anything to resolve 404's. If you see the urls, from my screenshots, they are strange URLs, because they detect the 404 on urls as style of index.php/project.php. This page doesn't have mod_rewrite or anything similar, because this, these URL's are impossible.
3º: I've tried to download the crawl diagnostic. On one file i doesn't have the referrer URL, on another, they doesn't have the 404 ones.
I'm trying to know why the system detects this pages when they doesn't exist and aren't linked from anyother site. If i have something bad, then, i have something VERY bad and i need to resolve right now. If not, i think the system detects some incorrect at my page but i cannot understand why.
That's worries me a lot, because, this page and campaign are a test. It's my personal web and i have only a few pages and a few links. But if i can't understand the results from this page. How can i understand / read the results from my best page, who haves more than 10.000 pages, multiple domains, social medias, and more?
-
Gotcha.
We're not actually reporting 404's in this case. We're reporting that one page is a duplicate of another which happens if the content on the source code is 95% similar or greater. The pages we're reporting that are duplicate did exist at the time of the crawl which is why they're showing up. If you made any changes after the crawl, there is a chance that the pages no longer exists in which case the next crawl will not show them as duplicates. They will be reported as 404's though so you'll still want to resolve that problem.
Outside of that, you can download the crawl diagnostic csv to get a list of referrer URL's. This is handy if you're ever unsure of how we got to a specific page. Hope this helps clear things up!
-
Yes, that's correct.
-
Hi there,
We're a bit lost in what you're trying to ask here. Based on what I've read, it sounds like you're saying that the weekly campaign crawl (not the Link Analysis data) is reporting 404's and you're not sure how/why that is happening. Is that correct?
-
Yes. Link analysis shows pages as: comusys.php/comusys.php (Who, obviously, doesn't exist).
On crawl analysis CSV i cannot get these pages. You can also get the analysis from your own open site explorer and view these pages doesn't appear.
And, now, Link analysis doesn't show any 404.
I've attached some examples of my campaign. I cannot understand why they are detecting this.
Any help will be apreciated.
Thanks,
Screenshot%20455.png Screenshot%20456.png Screenshot%20457.png
-
So the link analysis is showing you that there are sites linking to pages that don't exist on your site?
-
In the crawl test there isn't have the 404 errors.
-
There is a column to the far right that should show the referring URL. Be sure to scroll until you find that, and then you will see where we found those URLs.
Clarification: this is in the crawl test report. I'm not sure why you're seeing 404s in the link analysis page.
-
I've requested the CSV and downloaded it. But i cannot see the page pointing to the error, i can only view the error:
That's the report:
<colgroup><col width="392"> <col width="28"></colgroup>
| http://maqui.darkbolt.net/project/chat/index.php | 404 |
| http://maqui.darkbolt.net/project/docs/index.php | 404 |
| http://maqui.darkbolt.net/project/down/index.php | 404 |
| http://maqui.darkbolt.net/project/foto/index.php | 404 |
| http://maqui.darkbolt.net/project/news/index.php?news=1 | 404 |
| http://maqui.darkbolt.net/project/project/index.php | 404 |
| http://maqui.darkbolt.net/project/ro/index.php | 404 |
| http://maqui.darkbolt.net/project/who/index.php | 404 |Obviously there's all erroneous, the section "who" isn't inside "project" one. All links are valid without the part of /project.
I cannot understand why system are reading these links, on page, the links works ok.
-
Have you downloaded the CSV of your crawl report? You can look at the column for the referring URL and see what page is pointing to the 404 error.
-
There's no reason to leave index.php in different URIs, simply i have defined all links with their name, for example, for a root page from a section, the uri are defined as /project/index.php, not /project/ only.
I can clear them, or leave it. There isn't the problem. My problem are the moz stats returning non-linked inexistent addresses, and i doesn't know why.
Also, i've detected a failure who makes moz to find duplicated pages (A erroneous link with only a parameter) and i've corrected already. But i cannot find the reason for the inexistent pages.
-
Is there a reason you leave index.php in your URL? Might make it easier to strip it off using HTACCESS so you can see more clearly what you are dealing with.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why my website Backlinks not getting Crawl by Moz?
Hi, I've the query related to backlinks of my website. Some of the high authority sites give backlinks to my website but these links are still not showing in Moz. My website is 6 months old and also continuously getting backlinks from high authority sites but still these are not showing on Moz and also not improving DA and PA of my website. I've also attached the screenshot of Moz link explorer results please check and guide me what to do with website so Moz will consider it and give some authority. And also guide me How much Moz takes time to crawl website backlinks and shown in their link explorer. Website URL: https://www.welderexpert.com Moz Expert Suggestions needed. Thanks, overview?site=welderexpert.com⌖=domain
Link Explorer | | KOidue0 -
Moz crawling http rather than https site
Our site is secure but when I ask moz to crawl it by giving the root domain including https moz insists on crawling the non secure version. How do i force it to crawl the secure version?
Link Explorer | | media12340 -
Open Site Explorer external + follow links percentage
Hi how are you my root domain according to open site explorer has 90% of total links, external + follow. whereas my competitors have 5 - 6% how do I get this down to look more natural. Also what does this metric mean and how do I work out the percentage? Also I only have around 100 pages on my website which is a Shopify store and I have a small amount of internal followed links is this important for an ecommerce website as it is small number in comparison to my competitors. Thank you regards Adam
Link Explorer | | hourspy1 -
Domain Authority (DA) in Moz Pro changed only within the last 1-2 months?
Has anyone noticed that the Domain Authority (DA) as reported in Moz Pro has changed only within the last 1-2 months? We have screen shots showing plots of DA vs competitors w/ line graph 2 months ago starting in NOV 2017 which today starting JAN 2018 and comparing shows DA up to 50% different!
Link Explorer | | Amplitude_Digital
The change is seen both in the Links Overview and under the Spam Score sections still marked "NEW". Can Moz confirm that it's only recently within the last 2 months that in Moz Pro the NEW DA numbers have retroactively been updated even though the new Link Explorer has been publicly out since APR 30 from https://moz.com/community/q/moz-s-new-link-explorer-including-our-revamped-index-and-da-pa-scores-is-now-open-to-everyone? Look at the top green line starting ~12 months ago on both graphs, w/ old below 40 and new above 50. We've seen even greater differences for other tracked domains. Thanks! view0 -
Angular SPA & MOZ Crawl Issues
Website: https://www.exambazaar.com/ Issue: Domain Authority & Page Authority 1/100 I am using Prerender to cache/render static pages to crawl agents but MOZ is not able to crawl through my website (https://www.exambazaar.com/). Hence I think it has a domain authority of 1/100. I have been in touch with Prerender support to find a fix for the same and have also added dotbot to the list of crawler agents in addition to Prerender default list which includes rogerbot. Do you have any suggestions to fix this? List: https://github.com/prerender/prerender-node/commit/5e9044e3f5c7a3bad536d86d26666c0d868bdfff Adding dotbot to Express Server:
Link Explorer | | gparashar
prerender.crawlerUserAgents.push('dotbot');0 -
Internal links - OpenSiteExplorer vs Webmaster tools
Hi guys, I run Opensiteexplorer on the website www.fazland.com (in italian...). (https://moz.com/researchtools/ose/comparisons?site=www.fazland.com) It only shows ca 240 internal links where I know there are plenty more also shown in Google Webmaster Tools. I cannot figure out why it can crawl so few links - Google shows 79.000 links found and internal links are pretty well established. Any suggestions from your experience on this difference? Cheers, Rob
Link Explorer | | r.bonsanti0 -
Does Feedburner URL of the Home Page Carry Link Equity?
Hi There, During an SEO Audit, I found that OSE categorizes Feedburner URL of root domains under link-equity passing and followed. For example, the following link has been categorized under link-equity passing and followed: http://feeds.feedburner.com/SpoonflowerBlog I have heard that a lot of SEOs saying feedburner links don't carry any link juice. If that's true, then why does OSE categorize feedburner URL of root domains under link-equity passing and followed? I would appreciate if someone from the Moz staff could take some to answer this. Thanks.
Link Explorer | | TopLeagueTechnologies1 -
Moz & Other Sites Not Showing in Link Profile?
I'm curious to know why Moz, YouTube, G+ and some other links are not showing the OSE link profile as NOFOLLOW or DOFOLLOW for our website whiteboardcreations.com? When doing analysis, we see a lot of companies YouTube pages with their NF link and G+ page links and even see a lot of the Moz community members with their Profile pages showing the NF link or the DF links. I'm confused as to why OSE is not picking up any of our links from those respective sites. Any thoughts? Thank you in advance. - Patrick
Link Explorer | | WhiteboardCreations0