OSE link report showing links to 404 pages on my site
-
I did a link analysis on this site mormonwiki.com. And many of the pages shown to be linked to were pages like these http://www.mormonwiki.com/wiki/index.php?title=Planning_a_trip_to_Rome_By_using_Movie_theatre_-_Your_five_Fun_Shows2052752
There happens to be thousands of them and these pages actually no longer exist but the links to them obviously still do. I am planning to proceed by disavowing these links to the pages that don't exist. Does anyone see any reason to not do this, or that doing this would be unnecessary?
Another issue is that Google is not really crawling this site, in WMT they are reporting to have not crawled a single URL on the site. Does anyone think the above issue would have something to do with this? And/or would you have any insight on how to remedy it?
-
The site does have and has had ranking issues since the first Penguin and has really had problems the last few months. And other than some minor things low quality links are really the only problem with the site.
-
Hi,
Adam is correct that the disavow tool should only be used if you think the links are causing you significant ranking problems. It's become quite common for people to disavow links without either a confirmed penalty or ranking issues, but those two factors were originally how Google recommended the tool be used.
What it sounds like has happened to your site with these bad pages is that spammers have created spam pages on the wiki then pointed links to those pages from elsewhere. It's a very common and old spam tactic, used on sites that allow UGC.
Those pages are now returning 404s, so technically the inbound links pointing to them should not hurt your website or cause a penalty. It's generally assumed the links to 404 pages (good or bad links) don't hurt or help. I disagree that they'll cause a "bad user experience" as it sounds like they have been built for spam purposes only - no one is going to try and visit these links.
If you believe these links are causing a ranking issue, the disavowal tool is certainly an option - I take it there's no chance you can negotiate these links' removal with the folks who built them? Removing links is always preferable to using disavowal also.
-
If you are seeing zero pages indexed and zero traffic from search then I would assume you have perhaps verified and subsequently are looking at data for the non-www version of the domain.
Double check that the site listed in WMT is www.mormonwiki.com and not mormonwiki.com. If you are looking at indexation and traffic data for the www version then there may be something else going on and unfortunately I wouldn't be able to diagnose the issue without looking at the WMT account.
Have your rankings been significantly affected? You would need to perform a fair amount of analysis before you can conclude that the site has been affected algorithmically. You would also need to be sure that any negative impact to rankings is a result of poor quality links and not something else, such as on-page factors.
Using the disavow should really be a last resort and only if it has been impossible to get troublesome links removed. As the warning from Google states, the disavow feature 'can potentially harm your site's performance' so I would not recommend using it until you have performed more in-depth analysis.
-
Right so if the pages no longer exist they need to be gotten rid of right? Most of these won't be removed by the webmasters and so they'll need to be disavowed right?
These pages were UGC and are essentially spam, and entirely irrelevant to anything on the site itself. So 301 redirects would not be wise or useful I don't think.
-
It hasn't received a manual action no. But that doesn't mean algorthimically the site isn't being affected.
So you're saying to not worry at all about these links?
They offer nothing in terms of value. If going to live pages they would be considered very spammy and completely irrelevant. But since these pages don't even exist you're saying it's unnecessary to bother with them at all?
I'm seeing the crawlability issue in WMT itself. The strange thing is that I know some pages have been indexed, we get most of our traffic organically from Google. But WMT shows zero pages indexed, zero traffic from search etc. The site has been verified as well.
-
I agree with Adam, if the links are natural then there is no need to disavow them.
However, if the links go to pages that no longer exist then it provides a poor user experience that can harm your rankings. Think of it like having dead links on your website. Have you set up 301 redirects for the pages that have become inactive? If not, set them up and make sure to redirect the pages to relevant areas of the website (no all to the homepage). Do this and the links should pass more juice and your website's performance should improve.
-
Are you performing a link analysis because the site received a manual action notification in WMT? If the site hasn't received a penalty then there is no need to use the disavow feature. As Google states:
'This is an advanced feature and should only be used with caution. If used incorrectly, this feature can potentially harm your site’s performance in Google’s search results. We recommend that you disavow backlinks only if you believe you have a considerable number of spammy, artificial, or low-quality links pointing to your site, and if you are confident that the links are causing issues for you. In most cases, Google can assess which links to trust without additional guidance, so most normal or typical sites will not need to use this tool.'
In terms of the crawlability of the site, where are you seeing WMT reporting to have not crawled a single page? A simple site: search of the mormonwiki.com domain returns about 65,600 results and I can't see any major issues that would prevent search engines from crawling the site. However, I would probably fix the issue with the robots.txt file. Currently, www.mormonwiki.com/robots.txt 301 redirects to www.mormonwiki.com/Robots.txt, which returns a 404 error.
Hope that helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
More internal links pointing to internal page vs homepage
I was looking at our GSC internal links section and I saw that we have 901 internal links going to our compare rates form and 890 going to our homepage. At the end of most of our content I add a call to action to our compare rates form. Is this SEO friendly or should I have more pointing to the homepage and less pointing to our compare rates page?
Intermediate & Advanced SEO | | LindsayE0 -
Best way to link to 1000 city landing pages from index page in a way that google follows/crawls these links (without building country pages)?
Currently we have direct links to the top 100 country and city landing pages on our index page of the root domain.
Intermediate & Advanced SEO | | lcourse
I would like to add in the index page for each country a link "more cities" which then loads dynamically (without reloading the page and without redirecting to another page) a list with links to all cities in this country.
I do not want to dillute "link juice" to my top 100 country and city landing pages on the index page.
I would still like google to be able to crawl and follow these links to cities that I load dynamically later. In this particular case typical site hiearchy of country pages with links to all cities is not an option. Any recommendations on how best to implement?0 -
meta robots no follow on page for paid links
Hi I have a page containing paid links. i would like to add no follow attribute to these links
Intermediate & Advanced SEO | | Kung_fu_Panda
but from technical reasons, i can only place meta robots no follow on page level (
is that enough for telling Google that the links in this page are paid and and to prevent Google penlizling the sites that the page link to? Thanks!0 -
SERPS showing wrong page
I have optimised a homepage for two keywords. I optimised this a few weeks ago and the page has been crawled by Google, also before this it was already reasonably well optimised for these terms. However, the homepage is not appearing in Google for these terms. Instead two other random pages on the site are appearing for these terms that have not been optimised for these keywords and have few mentions of the keywords on the pages!?? These pages have a lower DA and lower inbound links than the homepage. The homepage is showing for other lower competition keywords. Could anyone offer me some insight into this? The homepage content has been posted on other websites by a former SEO consultant - to a business directory for one? Could duplicate content be causing this problem?
Intermediate & Advanced SEO | | absolutely170 -
Outbound link to PDF vs outbound link to page
If you're trying to create a site which is an information hub, obviously linking out to authoritative sites is a good idea. However, does linking to a PDF have the same effect? e.g Linking to Google's SEO starter guide PDF, as opposed to linking to a google article on SEO. Thanks!
Intermediate & Advanced SEO | | underscorelive0 -
Disallowed Pages Still Showing Up in Google Index. What do we do?
We recently disallowed a wide variety of pages for www.udemy.com which we do not want google indexing (e.g., /tags or /lectures). Basically we don't want to spread our link juice around to all these pages that are never going to rank. We want to keep it focused on our core pages which are for our courses. We've added them as disallows in robots.txt, but after 2-3 weeks google is still showing them in it's index. When we lookup "site: udemy.com", for example, Google currently shows ~650,000 pages indexed... when really it should only be showing ~5,000 pages indexed. As another example, if you search for "site:udemy.com/tag", google shows 129,000 results. We've definitely added "/tag" into our robots.txt properly, so this should not be happening... Google showed be showing 0 results. Any ideas re: how we get Google to pay attention and re-index our site properly?
Intermediate & Advanced SEO | | udemy0 -
New web site - 404 and 301
Hello, I have spent a lot of times on the forum trying to make sure how to deal with my client situation. I will tell you my understanding of the strategy to apply and I would appreciate if you could tell me if the strategy will be okay. CONTEXT I am working on a project where our client wants to replace its current web site with a new one. The current web site has at least 100 000 pages. The new web site will replace all the existing pages of the current site. What I have heard for the strategy the client wants to adopt is to 404 each pages and to 301 redirect each page. Every page would be redirect to a page that make sense in the new web site. But after reading other answers and reading the following comment, I am starting to be concerned: '(4) Be careful with a massive number of 301s. I would not 301 100s of pages at once. There's some evidence Google may view this as aggressive PR sculpting and devalue those 301s. In that case, I'd 301 selectively (based on page authority and back-links) and 404 the rest.' I have also read about performance issue ... QUESTION So, if we suppose that we can manage to map each of the old site pages to a page in the new web site, is a problem to do it? Do you see a performance issue or devaluation potential issue? If it is a problem, please comment the strategy I might considere to suggest: Identify the pages for which I gain links From that group, identify the pages, that gives me most of my juice 301 redirect them and for the other, create a real great 404 ... Thanks ! Nancy
Intermediate & Advanced SEO | | EnigmaSolution0 -
Can I reduce number of on page links by just adding "no follow" tags to duplicate links
Our site works on templates and we essentially have a link pointing to the same place 3 times on most pages. The links are images not text. We are over 100 links on our on page attributes, and ranking fairly well for key SERPS our core pages are optimized for. I am thinking I should engage in some on-page link juice sculpting and add some "no follow" tags to 2 of the 3 repeated links. Although that being said the Moz's on page optimizer is not saying I have link cannibalization. Any thoughts guys? Hope this scenario makes sense.
Intermediate & Advanced SEO | | robertrRSwalters0