Why moz pro detects inexistent links?
-
I have a campaign in moz pro to my personal webpage for testing purposes and also a bit of learning. But i have a question:
On link -> Link analysis i can see this:
http://maqui.darkbolt.net/project/chat/index.php 404http://maqui.darkbolt.net/project/docs/index.php 404http://maqui.darkbolt.net/project/down/index.php 404http://maqui.darkbolt.net/project/foto/index.php 404http://maqui.darkbolt.net/project/news/index.php?news=1 404http://maqui.darkbolt.net/project/project/index.php 404http://maqui.darkbolt.net/project/ro/index.php 404http://maqui.darkbolt.net/project/who/index.php 404Obviously all these address doesn't exist. There are links on the page project/index.php linking to, for example, /chat/index.php.How can i resolve this problem on the stats? There's something bad really on the page? As i can see all links on the page are working properly.
-
Hi there,
-
I reviewed several pages that we're reporting as duplicates and none of them are canonicalized. If you don't want to change the content in the source code so it's not 95% similar to the other pages, you'll need to add canonicals. The Help Hub has some good info on how to do this. You can also run a search in the community.
-
If we reported 404's in the initial crawl, it's because they existed at the time. The most recent crawl isn't showing any 404's so this shouldn't be an issue anymore.
-
Again, there are no 404's reported in this week's crawl for your campaign so there will be no 404's in the crawl diagnostic csv. That is where you'll want to go if this comes up again though.
-If the page on your site is linked to from anywhere on your site, we will crawl and report on it up to the page crawl limit set for the campaign. We're not going to report data for non-existent links as that isn't physically possible. I hope this helps clear things up.
-
-
As i've told at first post and anothers,
1º: I have a failure of duplicate content, detected on first crawl. The dup content comes from a parameter on all pages (?login). I've solvented it making this parameter a link to main page. This change resolves the duplicate content on all site.
2º: On first crawl the system itself detects 404's. The do not exist. I haven't change anything to resolve 404's. If you see the urls, from my screenshots, they are strange URLs, because they detect the 404 on urls as style of index.php/project.php. This page doesn't have mod_rewrite or anything similar, because this, these URL's are impossible.
3º: I've tried to download the crawl diagnostic. On one file i doesn't have the referrer URL, on another, they doesn't have the 404 ones.
I'm trying to know why the system detects this pages when they doesn't exist and aren't linked from anyother site. If i have something bad, then, i have something VERY bad and i need to resolve right now. If not, i think the system detects some incorrect at my page but i cannot understand why.
That's worries me a lot, because, this page and campaign are a test. It's my personal web and i have only a few pages and a few links. But if i can't understand the results from this page. How can i understand / read the results from my best page, who haves more than 10.000 pages, multiple domains, social medias, and more?
-
Gotcha.
We're not actually reporting 404's in this case. We're reporting that one page is a duplicate of another which happens if the content on the source code is 95% similar or greater. The pages we're reporting that are duplicate did exist at the time of the crawl which is why they're showing up. If you made any changes after the crawl, there is a chance that the pages no longer exists in which case the next crawl will not show them as duplicates. They will be reported as 404's though so you'll still want to resolve that problem.
Outside of that, you can download the crawl diagnostic csv to get a list of referrer URL's. This is handy if you're ever unsure of how we got to a specific page. Hope this helps clear things up!
-
Yes, that's correct.
-
Hi there,
We're a bit lost in what you're trying to ask here. Based on what I've read, it sounds like you're saying that the weekly campaign crawl (not the Link Analysis data) is reporting 404's and you're not sure how/why that is happening. Is that correct?
-
Yes. Link analysis shows pages as: comusys.php/comusys.php (Who, obviously, doesn't exist).
On crawl analysis CSV i cannot get these pages. You can also get the analysis from your own open site explorer and view these pages doesn't appear.
And, now, Link analysis doesn't show any 404.
I've attached some examples of my campaign. I cannot understand why they are detecting this.
Any help will be apreciated.
Thanks,
Screenshot%20455.png Screenshot%20456.png Screenshot%20457.png
-
So the link analysis is showing you that there are sites linking to pages that don't exist on your site?
-
In the crawl test there isn't have the 404 errors.
-
There is a column to the far right that should show the referring URL. Be sure to scroll until you find that, and then you will see where we found those URLs.
Clarification: this is in the crawl test report. I'm not sure why you're seeing 404s in the link analysis page.
-
I've requested the CSV and downloaded it. But i cannot see the page pointing to the error, i can only view the error:
That's the report:
<colgroup><col width="392"> <col width="28"></colgroup>
| http://maqui.darkbolt.net/project/chat/index.php | 404 |
| http://maqui.darkbolt.net/project/docs/index.php | 404 |
| http://maqui.darkbolt.net/project/down/index.php | 404 |
| http://maqui.darkbolt.net/project/foto/index.php | 404 |
| http://maqui.darkbolt.net/project/news/index.php?news=1 | 404 |
| http://maqui.darkbolt.net/project/project/index.php | 404 |
| http://maqui.darkbolt.net/project/ro/index.php | 404 |
| http://maqui.darkbolt.net/project/who/index.php | 404 |Obviously there's all erroneous, the section "who" isn't inside "project" one. All links are valid without the part of /project.
I cannot understand why system are reading these links, on page, the links works ok.
-
Have you downloaded the CSV of your crawl report? You can look at the column for the referring URL and see what page is pointing to the 404 error.
-
There's no reason to leave index.php in different URIs, simply i have defined all links with their name, for example, for a root page from a section, the uri are defined as /project/index.php, not /project/ only.
I can clear them, or leave it. There isn't the problem. My problem are the moz stats returning non-linked inexistent addresses, and i doesn't know why.
Also, i've detected a failure who makes moz to find duplicated pages (A erroneous link with only a parameter) and i've corrected already. But i cannot find the reason for the inexistent pages.
-
Is there a reason you leave index.php in your URL? Might make it easier to strip it off using HTACCESS so you can see more clearly what you are dealing with.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Why Links Overview is showing '0' "Internal, followed links" when we have many?
Hi, Our site is showing '0' internal links on Links Overview when all other metrics appear to be updating correctly. Any idea why might this be happening? Or are there any issues with the MOZ getting the data from our site when it does a crawl?
Link Explorer | | Chris_Mc0 -
Google Disavow File Format and MOZ Spam Score Updates
Hi, Is there a defined file format for Google disavow file name? Does it has to be disavowlinks.txt or can we do this like domain-name-date.txt ? Also, since Google does not share their data with Moz, how does MOz updates its spam score after we disavow the bad links? Do we need to connect Google search console with Moz?
Link Explorer | | Sunil-Gupta1 -
Moz Pro: Filter inbound links by partial anchor text?
My site has been targeted by a spam farm with hundreds of different domains, all linking to images on our CDN with similar variations of anchor text, eg: get free high quality hd wallpapers wedding cake makers
Link Explorer | | James_NZ
get free high quality hd wallpapers hairstyle makeover
get free high quality hd wallpapers living room cafe
etc Is it possible within Moz Pro to filter all incoming links with anchor text including "free high quality hd wallpapers" so that I can disavow all of the domains en masse? So far I've only been able to display/download the list of links exactly matching the full anchor text which is very time-consuming with 100+ variations. Regards,
James0 -
Inbound Link checking gets different answers
I've been comparing the inbound links report from Open Site Explorer against other backlink services and the results are very different. E.g. Compare the domain on http://backlinkwatch.com/ When I ran my test this other service found about 30% more active backlinks...why would Open Site Explorer skip those results?
Link Explorer | | rsnell0 -
Moz Pro: Linking RDs to Page much lower than Google Search Console
I'm trying to use the Analyze Keyword tool in Moz Pro, and in the SERP Analysis table, my page has a PA of one, and zero root domains linking to it. If I look at the page in Google Search console, it says I have 229 root domains linking to the page from well known domains like github.com, meetup.com, stackoverflow.com, etc. This particular keyword has been tracked in Moz for the last 6 months, but I just noticed that it was extremely low. I am relatively new to Moz, so forgive me if I sound confused, but can someone explain to me how the numbers can be so low?
Link Explorer | | jakebellacera0 -
Moz Toolbar "Get Keyword Difficulty"
New Moz Pro user here, and I've a question about the green "Get Keyword Difficulty" button that shows up to the right of the Google Search Keyword Term Input field after a successful search. Clicking on this seems to have no effect for me what-so-ever. Doesn't take me to the keyword tool, download a report, or add anything to the one screen link analyses in the search results. In short, i'm not sure how this tool is supposed to function. I'm using the latest version of Google Chrome (Version 52.0.2743.60 beta-m (64-bit)) on a Windows 10 machine.
Link Explorer | | bvkinsight1 -
Only one internal Equity Passing Link
Our web site IYBI is reporting only one internal equity passing link. I've somehow never noticed this and now that we are doing a lot more competitor analysis, I'm a little concerned given the numbers some of the other sites in the space are getting. I'm not sure I understand it completely and how it's possible we only have one. Any help would be appreciated.
Link Explorer | | wearehappymedia0 -
Open Site Explorer not reporting all 301 redirected links
Our site had over 2,000 root domains linking to it as reported in MOZ Open Site Explorer (and Google Webmaster Tools). We then changed the domain and made sure that 301 redirects were set up for all pages across the site. That was about 1 month ago. Open Site Explorer is now reporting less than 300 linking root domains. For the links that it is reporting the majority of these are being 301 redirected to the new URLs (some of them we changed the links directly). However the majority of the links that are being 301 redirected are not being reported. It is reporting most of the links coming from the old to the new domain. Google Webmaster Tools is reporting over 1,000 linking root domains to the new domain (it has a max of 1,000). We did notice that MOZ took some time to update the domain authority of the new domain. It was 1 for along time and it has now jumped up to 46 ( it was previously at 74). Maybe this is a time delay thing and eventually Open Site Explorer will report all of the 301 redirected links? It is a bit frustrating at the moment as we can't fully analyse the links to the site to try to focus on the high domain linking sites to get them to change the link directly. Also, If all links are being 301 redirected to the new domain should the authority not be close to where it was previously after one month? Our search traffic has dropped considerably since the launch of the new site and hasn't returned yet, so just wondering if the 301 redirected links pass on as much value as thre original direct links. Thanks, Damien
Link Explorer | | james.harris0