Why moz pro detects inexistent links?
-
I have a campaign in moz pro to my personal webpage for testing purposes and also a bit of learning. But i have a question:
On link -> Link analysis i can see this:
http://maqui.darkbolt.net/project/chat/index.php 404http://maqui.darkbolt.net/project/docs/index.php 404http://maqui.darkbolt.net/project/down/index.php 404http://maqui.darkbolt.net/project/foto/index.php 404http://maqui.darkbolt.net/project/news/index.php?news=1 404http://maqui.darkbolt.net/project/project/index.php 404http://maqui.darkbolt.net/project/ro/index.php 404http://maqui.darkbolt.net/project/who/index.php 404Obviously all these address doesn't exist. There are links on the page project/index.php linking to, for example, /chat/index.php.How can i resolve this problem on the stats? There's something bad really on the page? As i can see all links on the page are working properly.
-
Hi there,
-
I reviewed several pages that we're reporting as duplicates and none of them are canonicalized. If you don't want to change the content in the source code so it's not 95% similar to the other pages, you'll need to add canonicals. The Help Hub has some good info on how to do this. You can also run a search in the community.
-
If we reported 404's in the initial crawl, it's because they existed at the time. The most recent crawl isn't showing any 404's so this shouldn't be an issue anymore.
-
Again, there are no 404's reported in this week's crawl for your campaign so there will be no 404's in the crawl diagnostic csv. That is where you'll want to go if this comes up again though.
-If the page on your site is linked to from anywhere on your site, we will crawl and report on it up to the page crawl limit set for the campaign. We're not going to report data for non-existent links as that isn't physically possible. I hope this helps clear things up.
-
-
As i've told at first post and anothers,
1º: I have a failure of duplicate content, detected on first crawl. The dup content comes from a parameter on all pages (?login). I've solvented it making this parameter a link to main page. This change resolves the duplicate content on all site.
2º: On first crawl the system itself detects 404's. The do not exist. I haven't change anything to resolve 404's. If you see the urls, from my screenshots, they are strange URLs, because they detect the 404 on urls as style of index.php/project.php. This page doesn't have mod_rewrite or anything similar, because this, these URL's are impossible.
3º: I've tried to download the crawl diagnostic. On one file i doesn't have the referrer URL, on another, they doesn't have the 404 ones.
I'm trying to know why the system detects this pages when they doesn't exist and aren't linked from anyother site. If i have something bad, then, i have something VERY bad and i need to resolve right now. If not, i think the system detects some incorrect at my page but i cannot understand why.
That's worries me a lot, because, this page and campaign are a test. It's my personal web and i have only a few pages and a few links. But if i can't understand the results from this page. How can i understand / read the results from my best page, who haves more than 10.000 pages, multiple domains, social medias, and more?
-
Gotcha.
We're not actually reporting 404's in this case. We're reporting that one page is a duplicate of another which happens if the content on the source code is 95% similar or greater. The pages we're reporting that are duplicate did exist at the time of the crawl which is why they're showing up. If you made any changes after the crawl, there is a chance that the pages no longer exists in which case the next crawl will not show them as duplicates. They will be reported as 404's though so you'll still want to resolve that problem.
Outside of that, you can download the crawl diagnostic csv to get a list of referrer URL's. This is handy if you're ever unsure of how we got to a specific page. Hope this helps clear things up!
-
Yes, that's correct.
-
Hi there,
We're a bit lost in what you're trying to ask here. Based on what I've read, it sounds like you're saying that the weekly campaign crawl (not the Link Analysis data) is reporting 404's and you're not sure how/why that is happening. Is that correct?
-
Yes. Link analysis shows pages as: comusys.php/comusys.php (Who, obviously, doesn't exist).
On crawl analysis CSV i cannot get these pages. You can also get the analysis from your own open site explorer and view these pages doesn't appear.
And, now, Link analysis doesn't show any 404.
I've attached some examples of my campaign. I cannot understand why they are detecting this.
Any help will be apreciated.
Thanks,
Screenshot%20455.png Screenshot%20456.png Screenshot%20457.png
-
So the link analysis is showing you that there are sites linking to pages that don't exist on your site?
-
In the crawl test there isn't have the 404 errors.
-
There is a column to the far right that should show the referring URL. Be sure to scroll until you find that, and then you will see where we found those URLs.
Clarification: this is in the crawl test report. I'm not sure why you're seeing 404s in the link analysis page.
-
I've requested the CSV and downloaded it. But i cannot see the page pointing to the error, i can only view the error:
That's the report:
<colgroup><col width="392"> <col width="28"></colgroup>
| http://maqui.darkbolt.net/project/chat/index.php | 404 |
| http://maqui.darkbolt.net/project/docs/index.php | 404 |
| http://maqui.darkbolt.net/project/down/index.php | 404 |
| http://maqui.darkbolt.net/project/foto/index.php | 404 |
| http://maqui.darkbolt.net/project/news/index.php?news=1 | 404 |
| http://maqui.darkbolt.net/project/project/index.php | 404 |
| http://maqui.darkbolt.net/project/ro/index.php | 404 |
| http://maqui.darkbolt.net/project/who/index.php | 404 |Obviously there's all erroneous, the section "who" isn't inside "project" one. All links are valid without the part of /project.
I cannot understand why system are reading these links, on page, the links works ok.
-
Have you downloaded the CSV of your crawl report? You can look at the column for the referring URL and see what page is pointing to the 404 error.
-
There's no reason to leave index.php in different URIs, simply i have defined all links with their name, for example, for a root page from a section, the uri are defined as /project/index.php, not /project/ only.
I can clear them, or leave it. There isn't the problem. My problem are the moz stats returning non-linked inexistent addresses, and i doesn't know why.
Also, i've detected a failure who makes moz to find duplicated pages (A erroneous link with only a parameter) and i've corrected already. But i cannot find the reason for the inexistent pages.
-
Is there a reason you leave index.php in your URL? Might make it easier to strip it off using HTACCESS so you can see more clearly what you are dealing with.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Link Explorer is not working for particular domain
Hello, Linking Domain section in Link Explorer is not working fine for sewingmachineprice.com. When I put this domain to analyze for backlinks it says, "You entered the URL https://sewingmachineprice.com/ which redirects to www.hugedomains.com/domain_profile.cfm?d=sewingmachineprice&e=com. Click here to analyze www.hugedomains.com/domain_profile.cfm?d=sewingmachineprice&e=com instead." In fact, this domain is not redirected to any third domain or URL. I personally checked it and can access this website without any redirection.
Link Explorer | | Jacksmorrin0 -
What does "Number of Linking Root Domains" in Open Site Explorer actually mean?
I've read through the forum, but I'm confused as to what "Number of Linking Root Domains" means in the Linking Domains report. For example, if the root domain is Moz.com and the number is 135,000 does that mean there are 135,000 links from Moz pointing back to my site or does it mean there are 135,000 sites linking back to Moz? Any help is appreciated. Thank you!
Link Explorer | | karrabarron1 -
(301 Redirect) Link Spam Score: 10/17
Hi everyone, I'm looking for a specific answer, so I'll give some specifics to help get some information. https://moz.com/researchtools/ose/links?site=http%3A%2F%2Fgpstrackit.com%2F&filter=&source=external&target=page&group=0&page=1&sort=page_authority&anchor_id=&anchor_type=&anchor_text=&from_site= After looking through Open Site Explorer we've noticed one of the links from login.gpstrackit.net has a 10/17 spam score. https://moz.com/researchtools/ose/spam-analysis/flags?subdomain=login.gpstrackit.net This link is a 301 redirect from a previous login URL, so users can be redirected to our current site and login from there. I don't know how to approach this situation or what the correct fix is. Any advice would be much appreciated!
Link Explorer | | ccox10 -
A few questions regarding Moz tools + E-commerce strategy
Hi everyone 🙂 I'm currently in the midst of optimizing a Scandinavian E-commerce site. I have a few questions, that hopefully someone will be able to help me get answered. Firstly, GoogleBots should be able to recognize "ø" as "oe", "æ" as "ae" and "å" as "aa" in the URL title. I've noticed that Moz' On-page grader does not support this unfortunately - has something changed or do Scandinavians just receive a little less love than the English? Secondly, how does one avoid keyword stuffing on E-commerce sites? The products that are displayed in category pages all make use of the same keyword that is targeted for that category. As such, some pages have 40+ mentions of the keyword, although in reality there are less than 15 (the rest being in the product names). Any tips or tricks on how to get this optimized or does Google simply recognize the site as an E-commerce site and somewhat ignores keyword stuffing - as long as the website has sufficient content? Thirdly, has something happened to Moz' Open Site Explorer? It seems like something has changed and when I checked for backlinks for the site today, only 3 was found. I know for a fact that many many more exist (which other tools also confirm when they scrape the site). Looking forward to hearing from all of you! Best, Mark
Link Explorer | | osn0 -
Thousands of Missing Links in OSE Report
Hi there, Moz reports 70k+ external links pointing to my site but when I use the Open Site Explorer to export all links, I only see 9,000+ I have tried combining different combinations of fields ('this root domain', 'this subdomain', 'all links' etc) but can never see more than 9,000 links. My website does redirect to www. and have a couple of subdomains, the report for non www. only shows around 120 external links. We also use Ahrefs and the crawl reports here display 40k+ links (on the dashboard and when exported) which makes me believe the problem is not with my website but rather OSE. Any help is much appreciated, Thanks, Jason
Link Explorer | | Xtend-Life0 -
Moz Spam Score
Hi! The spam score for my sites is "--" with no graph. The sites are large ecommerce sites with a ton of branded search and DAs of 50+. Simple question - are these clean spam scores, or has Moz not calculated the scores yet?
Link Explorer | | AMHC1 -
My linked in profile is showing up the opensite exploer but not showing up in on page grader, any ideas why? the onpage grader says in url is in accessible
my linked on company profile ios not showing up in the on page grader tool, it return un accessible uRL? but we give me metrics in domain checker? its a new profile i set up about 8 hours ago,
Link Explorer | | maxmxmax0 -
Why does OSE & Moz Q&A sections not work for me in Chrome any more?
A few weeks ago I started seeing instances of OSE not showing all the information when it was showing I was logged in while using Google Chrome. It would work, then not work and kept showing me the button to sign up for Moz Analytics even though I was logged in. I'm a Pro Member, which is confusing. Then I noticed that while logged in, I would be logged out somehow when trying to contribute to the Q&A's and it would bounce me back to the Moz Home page or show me the white Login button in the upper right. Again, in Chrome only. I was told by support to clean my cookies/cache and that works for maybe a few hours, but then would resort back to logging me out. It has become very annoying, and now can only work logged in from FireFox. Not a huge issue, but everything else I do all day is in Chrome. Is anyone else having this same issues? Any recommendations? Thanks in advance. - Patrick
Link Explorer | | WhiteboardCreations0