Repeated mysterious 404's from ancient site structure killing my rankings
-
Several years ago I changed my site structure to go from a flash based site to a blog based wordpress site. After doing so I went from page 1 to page 30 for my relevant search terms. I have employed people to help me track down the problem and I believe that they have narroed it to the existance of 404's being created from some unknown internal source. I have been for years getting links like this...
<colgroup><col width="792"></colgroup>
||
......regularly showing in webmaster tools, (this is from a top pages report from MOZ where there are hundreds also shown).
When I do a moz crawl of the site, none of these links show up. Therefore I have no way of finding the source of these links (they also do not show me the source in WMT as they should).
We have completely cleared the site and rebuilt it and although it is still only a couple of weeks in it still does not appear to have stopped them.
Does anyone have any way of helping me find the source of these mysterious 404's?
-
Why bother trying to clean anything up? If somewhere out there there are links to your domain, and they're 404'ing, just 301 them to new pages on your site! Capture that link juice, don't let it run out
-
Thanks for your reply EEE3
The ancient link says it is linked from another non existent ancient page that no longer exists and it is always first crawled and last detected on the day that it arrives.
eg. last crawled 4/23/14, first detected 4/23/14
http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/03/st-kilda-wedding......
linked from
http://dfphotographer.com.au/brisbaneweddingphotographer/index.php/2011/03/st-kilda-wedding.....
and
http://dfphotographer.com.au/brisbaneweddingphotographer/2011/03/st-kilda-wedding....
-
Thanks for your response Keri,
Being staff can you please tell me where does the top pages data come from? Is it from crawling my site (like a google spider) or is it sourced from google or somewhere else. How often is that data refreshed?
In answer to your response, I have tried both screaming frog and xenu and my nice clean site structure is all it picks up. None of the ancient messy site structure appears.
Have been through the list of domains looking for an old sitemap or something similar that may have been scraped off my site but after a long and arduous task could not locate any reference to any of these links that show up in top pages and webmaster tools (which says they are linked from other ancient pages - which I will expand on below)
We have looked at all the usual suspects - old sitemaps, plugins and rebuilt the site just in case we missed anything that was lingering around. I have had really good people looking at it who continue to do so it just never seems to go away.
-
In Webmaster Tools, when you click on the 404 and the popup window appears, what is showing in the Linked from tab?
-
I edited the post so the URLs didn't run together. Still not perfect, but a little easier to read.
I'm not exactly sure where those links are coming from. You might run a tool like Xenu Link Sleuth or Screaming Frog on your site to see if there is an internal linking widget gone awry. The other thought I have is to look at Open Site Explorer to see what sites are linking to you and if they're linking to any of those pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Struggling to recouple backlinks that lead to 404 after a redesign launch
We had a site redesign launch a few months ago. Due to other priorities / a lot of things going on, there were a decent amount of URLs that 404ed. We implemented a drupal module search404 to redirect all of these to the homepage or a search results page that pulls search terms from the 404 page. Our goal is to recoup SEO value since we saw a notable dip in domain authority and amount of links after launch. We know a catch-all automated solution like this isn't ideal because it's better to 301 each page to its most relevant live page. For example, if CNN.com links to our Texas location, it's better to 301 to our new texas page than a homepage or search page. The problem is that since we implemented this catch all search404 module, it seems like tools like Moz Pro and Screaming Frog are not picking up the backlinks from external sites that 404 since they were "fixed" by the search404 module. Any ideas on solutions? We're trying to avoid tedious or risky solutions like undoing the module, which may bring back the 404s but cause some UX and SEO issues so we can run a SF crawl. Or going through every link in our link profile manually to check.
Link Explorer | | wchou0 -
My Site Facing indexing issues
Hello, I am facing indexing issues on one of my which is about budget bushcraft knife , Four months have been passed I built my site and published almost 7 article so far. But i am worried no any single keyword ranked on google till yet as i checked through MOZ site explorer. Can anyone guide me what should I need to do now. Thanks
Link Explorer | | james112230 -
REACT site and sitemap.xml
I have a REACT site www.nettheory.com. MOZ seems to only index the homepage but not all the internally linked pages, even the sitemap.xml is there. The reason I say that is the crawl result only shows the http and https version of the homepage, no other pages mentioned. I also noticed MOZ crawl results point out my content is very thin (<50 words). As a matter of fact, it has a lot more words if the JS runs correctly. Do we know if MOZ crawls REACT or JS based sites correctly?
Link Explorer | | NetTheory-Analysts0 -
Open Site Explorer not attributing page authority
Hi there, I have been having an issue with OSE attributing page authority to one of the sites I have been looking at. The homepage page authority is fine but any page after this does not have any according to OSE. I have been in touch with Moz and they have stated it's because there are no external or internal links pointing to any of these pages. However, I know that their are internal links pointing to these pages from the homepage in the nav. The domain is: https://healys.com/ If anyone has any ideas what may be going on that would be great. Thanks!
Link Explorer | | Alice-20160 -
Can't make sense of OSE and MOZ.
I checked this site on my OSE and it shows only 7 inbound links a DA of 14 and no social activity whatsoever yet when I check it on Majestic it shows ExternalBacklinks 71 ReferringDomains 20 Referring IPs 19 Referring Subnets 19 . And.. the page is #1 on google search for hypnotherapy michigan. How can it rate so poorly on MOZ, show so much more on majestic and rank so high on google? I thought MOZ data was supposed to be among the best and that top rated pages on google should also rate high on MOZ. here is the site http://hypnotherapy-detroit.com Additionally, when i look at the site, i notice that most of the backlinks are exchanged links and this site's link exchange page isn't even linked from the home page. Now I thought that kind of link exchange game was now discounted by Google. I don't get it. No social pages at all... low page rank... no new content.. so by MOZ standards there is no justification for this page to be anywhere near page one let alone at position #1. Can someone help me make sense of all this?
Link Explorer | | HypnoPro0 -
Learn how to use Open Site Explorer's Top Pages report to help inform your content marketing efforts. Get your Daily SEO Fix!
With the Top Pages report, you can see the pages on your site (and your competitors’) that are top performers. The pages are sorted by Page Authority - a prediction of how well a specific page will rank in search engines - and also metrics for linking root domains, inbound links, HTTP status and social shares. Be sure to watch today's Daily SEO Fix video tutorial to learn how to use Open Site Explorer's Top Pages report to analyze the competitions' content marketing efforts and to inform your own. This video is part of The Moz Daily SEO Fix tutorial series--Moz tool tips and tricks in under 2 minutes. To watch all of our videos so far, and to subscribe to future ones, make sure to visit the Daily SEO Fix channel on YouTube.
Link Explorer | | kellyjcoop3 -
Is there some way to tell the Moz crawler not to crawl URL's with particular dynamic tags such as "?redirect-to:http//" ?
We are encountering an issue where the crawler is finding a ton of pages from our wordpress login url that has this dynamic tag in it to kinds of different blog entries. It's madness. I can't figure out what is causing these URLs to generate to be crawled in the first place! Does this sound familiar to anyone out there, any constructive suggestions? Robots text or maybe meta robots tags that would resolve this crawl issue?
Link Explorer | | RegistrarCorp0 -
Open Site Explorer
I have entered my site (patioenjoyment.com) a few different times into the OSE and it never seems to find it, MOZ has found it because it has done the crawl test for me a few times, found some issues and I have been working on those. But I assume that if it is able crawl my site it should find it to run the OSE. It has come back with "**It looks like we haven't discovered established link data for this URL yet." everytime. ** Thanks, Julie
Link Explorer | | patioenjoyment0