Why moz pro detects inexistent links?
-
I have a campaign in moz pro to my personal webpage for testing purposes and also a bit of learning. But i have a question:
On link -> Link analysis i can see this:
http://maqui.darkbolt.net/project/chat/index.php 404http://maqui.darkbolt.net/project/docs/index.php 404http://maqui.darkbolt.net/project/down/index.php 404http://maqui.darkbolt.net/project/foto/index.php 404http://maqui.darkbolt.net/project/news/index.php?news=1 404http://maqui.darkbolt.net/project/project/index.php 404http://maqui.darkbolt.net/project/ro/index.php 404http://maqui.darkbolt.net/project/who/index.php 404Obviously all these address doesn't exist. There are links on the page project/index.php linking to, for example, /chat/index.php.How can i resolve this problem on the stats? There's something bad really on the page? As i can see all links on the page are working properly.
-
Hi there,
-
I reviewed several pages that we're reporting as duplicates and none of them are canonicalized. If you don't want to change the content in the source code so it's not 95% similar to the other pages, you'll need to add canonicals. The Help Hub has some good info on how to do this. You can also run a search in the community.
-
If we reported 404's in the initial crawl, it's because they existed at the time. The most recent crawl isn't showing any 404's so this shouldn't be an issue anymore.
-
Again, there are no 404's reported in this week's crawl for your campaign so there will be no 404's in the crawl diagnostic csv. That is where you'll want to go if this comes up again though.
-If the page on your site is linked to from anywhere on your site, we will crawl and report on it up to the page crawl limit set for the campaign. We're not going to report data for non-existent links as that isn't physically possible. I hope this helps clear things up.
-
-
As i've told at first post and anothers,
1º: I have a failure of duplicate content, detected on first crawl. The dup content comes from a parameter on all pages (?login). I've solvented it making this parameter a link to main page. This change resolves the duplicate content on all site.
2º: On first crawl the system itself detects 404's. The do not exist. I haven't change anything to resolve 404's. If you see the urls, from my screenshots, they are strange URLs, because they detect the 404 on urls as style of index.php/project.php. This page doesn't have mod_rewrite or anything similar, because this, these URL's are impossible.
3º: I've tried to download the crawl diagnostic. On one file i doesn't have the referrer URL, on another, they doesn't have the 404 ones.
I'm trying to know why the system detects this pages when they doesn't exist and aren't linked from anyother site. If i have something bad, then, i have something VERY bad and i need to resolve right now. If not, i think the system detects some incorrect at my page but i cannot understand why.
That's worries me a lot, because, this page and campaign are a test. It's my personal web and i have only a few pages and a few links. But if i can't understand the results from this page. How can i understand / read the results from my best page, who haves more than 10.000 pages, multiple domains, social medias, and more?
-
Gotcha.
We're not actually reporting 404's in this case. We're reporting that one page is a duplicate of another which happens if the content on the source code is 95% similar or greater. The pages we're reporting that are duplicate did exist at the time of the crawl which is why they're showing up. If you made any changes after the crawl, there is a chance that the pages no longer exists in which case the next crawl will not show them as duplicates. They will be reported as 404's though so you'll still want to resolve that problem.
Outside of that, you can download the crawl diagnostic csv to get a list of referrer URL's. This is handy if you're ever unsure of how we got to a specific page. Hope this helps clear things up!
-
Yes, that's correct.
-
Hi there,
We're a bit lost in what you're trying to ask here. Based on what I've read, it sounds like you're saying that the weekly campaign crawl (not the Link Analysis data) is reporting 404's and you're not sure how/why that is happening. Is that correct?
-
Yes. Link analysis shows pages as: comusys.php/comusys.php (Who, obviously, doesn't exist).
On crawl analysis CSV i cannot get these pages. You can also get the analysis from your own open site explorer and view these pages doesn't appear.
And, now, Link analysis doesn't show any 404.
I've attached some examples of my campaign. I cannot understand why they are detecting this.
Any help will be apreciated.
Thanks,
Screenshot%20455.png Screenshot%20456.png Screenshot%20457.png
-
So the link analysis is showing you that there are sites linking to pages that don't exist on your site?
-
In the crawl test there isn't have the 404 errors.
-
There is a column to the far right that should show the referring URL. Be sure to scroll until you find that, and then you will see where we found those URLs.
Clarification: this is in the crawl test report. I'm not sure why you're seeing 404s in the link analysis page.
-
I've requested the CSV and downloaded it. But i cannot see the page pointing to the error, i can only view the error:
That's the report:
<colgroup><col width="392"> <col width="28"></colgroup>
| http://maqui.darkbolt.net/project/chat/index.php | 404 |
| http://maqui.darkbolt.net/project/docs/index.php | 404 |
| http://maqui.darkbolt.net/project/down/index.php | 404 |
| http://maqui.darkbolt.net/project/foto/index.php | 404 |
| http://maqui.darkbolt.net/project/news/index.php?news=1 | 404 |
| http://maqui.darkbolt.net/project/project/index.php | 404 |
| http://maqui.darkbolt.net/project/ro/index.php | 404 |
| http://maqui.darkbolt.net/project/who/index.php | 404 |Obviously there's all erroneous, the section "who" isn't inside "project" one. All links are valid without the part of /project.
I cannot understand why system are reading these links, on page, the links works ok.
-
Have you downloaded the CSV of your crawl report? You can look at the column for the referring URL and see what page is pointing to the 404 error.
-
There's no reason to leave index.php in different URIs, simply i have defined all links with their name, for example, for a root page from a section, the uri are defined as /project/index.php, not /project/ only.
I can clear them, or leave it. There isn't the problem. My problem are the moz stats returning non-linked inexistent addresses, and i doesn't know why.
Also, i've detected a failure who makes moz to find duplicated pages (A erroneous link with only a parameter) and i've corrected already. But i cannot find the reason for the inexistent pages.
-
Is there a reason you leave index.php in your URL? Might make it easier to strip it off using HTACCESS so you can see more clearly what you are dealing with.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Backlink not showing on Link Explorer
Hello, Our team have been doing link building for a month, however non of the links are coming up in MOZ Link Explorer. We used link tracker to check, and it is not linking back. when checking on the other blog's site, they are linking back but we are unable to see it in moz. For example:
Link Explorer | | WAUBIKE
Indiegogo: https://www.indiegogo.com/projects/wau-the-best-feature-packed-smart-ebike#/
NewAtlas: https://newatlas.com/wau-ebike-affordable/56791/ These both links are linking back to us at www.waubike.com, and we can see it on google web masters, but we cannot see it on Moz? We have checked our server and they confirmed they are not blocking Moz's crawlers. Anyone else have this issue ?0 -
Moz's new Link Explorer, including our revamped index and DA/PA scores is now open to everyone!
Hey Moz Community, Link Explorer is now open to the public! Everyone can access it via a subscription or a free Moz ‘Community’ account. As you may know by now, the brand-new Link Explorer tool is primed to replace Open Site Explorer as Moz’s link building and analysis tool. The Link Explorer project is the result of an incredible amount of perseverance and hard work by the team, and we’re proud to be able to finally share it with you — we know it’s going to revolutionize how you approach link building and make your job easier. You can read more about the tool here in Sarah Bird’s announcement post. Because Link Explorer improves on almost every aspect of Open Site Explorer, the metrics have improved, too. That means you’re likely going to see some Domain Authority and Page Authority discrepancies between OSE’s index and Link Explorer’s index. We definitely suggest you use the new DA/PA from Link Explorer, as they’re more accurate and refresh daily rather than monthly, as was the case with OSE’s index. However, we also realize that many of you use these metrics to report to your clients and colleagues, and a sudden change or fluctuation could potentially make your job harder. Which DA is the real DA? The new DA is based on a much larger index that has many improvements, several of which are designed to make the index more like Google’s than ever before. You should consider moving towards the new DA (and the old DA won’t be updated after April 26th 2018, so the sooner the better). While there will be fluctuations as we improve the model and add features to the index, we expect it to remain largely stable and to be a far more accurate picture of a site’s authority according to how it’s seen by Google. Why is Link Explorer’s DA/PA considered better than OSE’s, and which should I trust? The larger link index with improved crawl selection allows us to produce a stronger model that includes a much larger proportion of the web. That being said, DA and PA should always be considered in the context of your competitors. A drop in PA or DA relative to the old OSE is of little concern if your competitors saw similar movement. Is Domain Authority/Page Authority an absolute score or a relative one? Both DA and PA are relative to the Internet as a whole. If Facebook acquired a billion new links, everyone’s PA and DA would drop relative to Facebook. Because of this, it’s always best to look at PA and DA in comparison to your competitors. What does a drop/raise in DA mean in Link Explorer vs OSE? How can I explain this to my clients when I’m reporting it? DA and PA should always be considered in the context of your competitors. A drop or raise in PA or DA relative to the old OSE is of little concern if your competitors saw similar movement. Reporting that your site has moved from a DA of 45 to a DA of 42 doesn’t tell the whole story, but reporting that your site has a DA of 42 while your main competitor moved from a 43 to a 37 shows that, relative to the sites you’re competing against in the SERPs, your site has significantly more authority and ranking power. What’s happening to MozTrust and MozRank and why, and what should I replace those with? The improvements to our DA/PA and Spam Score metrics now now account for more important nuances in helping you determine one site’s ability to rank higher than another. Because they no longer correlate with Google’s ranking model as well as they used to, MozRank and MozTrust are being deprecated for better metrics. Users should rely on Page Authority, Domain Authority, and Spam Score to determine the importance and quality of pages, domains, and links. I have historical data I use to help my clients benchmark their progress. What do I do now that DA is calculated differently? You should annotate any KPI changes referencing the change in DA and PA. However, most importantly, you should compare those changes to your competitors, as this will best show how strong your site’s authority is relative to the sites you’re competing against in the SERPs. We take updating our metrics very seriously, and our last major update to the model was 7 years ago. Users of Domain Authority and Page Authority can expect us to continue to produce steady, reliable metrics for the long haul, and only make changes to these metrics when we believe the benefits dramatically outweigh the stability of the metric. Do you have any questions about the new metrics? Anticipating a tough time reporting changes to clients or bosses? Metrics, features or functionality missing that you would want to see? Let us know in the thread, and we’ll work to find a good answer for you. Hope you enjoy the new Link Explorer product and the amazing new link index powering it. We are very excited to provide this valuable data to our community and customers.
Link Explorer | | IanWatson9 -
Sudden Spike in 404 Pages Not Found in Moz Crawl But No Errors in WMT
Recently I received a spike in errors from the Moz crawler. When I looked into the matter I noticed that all the URI's looked right but then I looked a little closer and there was a /page/2 and /page/3 in front of the URI's. I'm running a WordPress website. Immediately I thought to myself this must be some kind of caching or permalinks error. So I disabled all my plugins including W3 Total Cache and ran the Integrity Link Crawler for the Mac and found that the errors were still popping up. 404-errors-ncworkercomp.png?dl=0
Link Explorer | | NCCompLawyer0 -
Unable to fully see full spectrum of links built on some backlink checkers
We have built various links in directories and blogs to our website. We can clearly see that these links have been established on these websites. Some we have built well over 6 months ago. However, upon using a range of tools such as Moz Open Explorer, Cognitive SEO Backlink Explorer and SEM Rush's Backlink Checker, we cannot see in any of these backlink checking tools the full spectrum of links we know we have built displayed. In instances of using these tools, we are only seeing a very small sample of links that we have built. Our question here is then threefold: Why is it that a lot of these tools don't pick up the links we know we have built? Does google eventually see all the links we have built? Or does it suffer from the same problem as backlink checkers, unable to identify all built links? Is there a backlink checker that is throughly comprehensive, or close to, in its ability to identify all links? We have heard good things about Ahrefs and would love to hear people's thoughts here.
Link Explorer | | Gavo0 -
Duplicated content detected with MOZ crawl with canonical applied
Hi there! I have a slight problem.
Link Explorer | | Eurasmus.com
I have a site with Joomla 3.3 that we recently migrated from 2.5. Joomla, for some reason that I don´t really get, creates hundreds of weird urls for the site like
mydomain.com/en -> joomla creates en/home/149-xxx-xxx/xxxxxx-xxxxxx that links to the first one.
The new version 3.3 knows this bug and applies a rel=canonical to the ones created "artificially", so they should not be identified as duplicated. Sample piece of code: en/home/149-all-en/xxxxxxx-xxxxxx" rel="canonical" / MOZ crawler identifies this as duplicated and like this I have thousands of pages duplicated all with titles, content etc... all the ones created by joomla. Still my site has good SEO results and I can not see any penalties but I am a bit concerned they may come in the future.... Can anyone explain me what is happening? Thank you in advance for your time,0 -
SEO Benefit from Moz Profile link?
Hi all Excuse the rookie question, but do the website links in Moz profile give positive SEO benefits? If so how long should it take to start seeing improvements or have the linking domain in Open explorer and is there a way to speed things up? Thanks in advance
Link Explorer | | IsaCleanse0 -
OSE - Link Opportunities
I'm checking out this new Link Opportunities feature for one of our sites and what I'm seeing right now is pretty disappointing. For Reclaim Links, everything listed is an internal link on our site. It's got tons of URLs from our old Iciniti structure (replatformed to Magento mid-March). It's crawling tons of stuff that's blocked in robots. There are no links from external domains in the first 5 pages. For Unlinked Mentions, it's showing tons of mentions - 33,566 to be exact, a ridiculous number - mostly from news sites like Forbes, WSJ, Guardian, CNN, etc. These sites are not mentioning us. It's set up to look only for our brand name or domain name, so I don't know how it's thinking there are all these nonexistent mentions. What's going on with Link Opportunities?
Link Explorer | | Kingof50 -
Moz can't crawl domain due to IP Geo redirect loop
Hi, I'm trying to crawl our domain www.salvationarmy.org.au via my Moz account and it only ever returns results for one page when it should be crawling more than 3,000 pages. In talking to support, they have said that because of the redirect we have in place it is creating a 302 loop and therefore not delivering results. Usually in this case I would obtain Moz's IP addresses and add them to the redirect settings as an exception, but Moz have said they use cloud-based services for crawling so the IPs change all the time. Does anyone have any idea how to solve this issue? At this point I've paid for a year's subscription to a product I can't use. Thanks, Mel
Link Explorer | | SalvationArmy0