Repeated mysterious 404's from ancient site structure killing my rankings
-
Several years ago I changed my site structure to go from a flash based site to a blog based wordpress site. After doing so I went from page 1 to page 30 for my relevant search terms. I have employed people to help me track down the problem and I believe that they have narroed it to the existance of 404's being created from some unknown internal source. I have been for years getting links like this...
<colgroup><col width="792"></colgroup>
||
......regularly showing in webmaster tools, (this is from a top pages report from MOZ where there are hundreds also shown).
When I do a moz crawl of the site, none of these links show up. Therefore I have no way of finding the source of these links (they also do not show me the source in WMT as they should).
We have completely cleared the site and rebuilt it and although it is still only a couple of weeks in it still does not appear to have stopped them.
Does anyone have any way of helping me find the source of these mysterious 404's?
-
Why bother trying to clean anything up? If somewhere out there there are links to your domain, and they're 404'ing, just 301 them to new pages on your site! Capture that link juice, don't let it run out
-
Thanks for your reply EEE3
The ancient link says it is linked from another non existent ancient page that no longer exists and it is always first crawled and last detected on the day that it arrives.
eg. last crawled 4/23/14, first detected 4/23/14
http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/03/st-kilda-wedding......
linked from
http://dfphotographer.com.au/brisbaneweddingphotographer/index.php/2011/03/st-kilda-wedding.....
and
http://dfphotographer.com.au/brisbaneweddingphotographer/2011/03/st-kilda-wedding....
-
Thanks for your response Keri,
Being staff can you please tell me where does the top pages data come from? Is it from crawling my site (like a google spider) or is it sourced from google or somewhere else. How often is that data refreshed?
In answer to your response, I have tried both screaming frog and xenu and my nice clean site structure is all it picks up. None of the ancient messy site structure appears.
Have been through the list of domains looking for an old sitemap or something similar that may have been scraped off my site but after a long and arduous task could not locate any reference to any of these links that show up in top pages and webmaster tools (which says they are linked from other ancient pages - which I will expand on below)
We have looked at all the usual suspects - old sitemaps, plugins and rebuilt the site just in case we missed anything that was lingering around. I have had really good people looking at it who continue to do so it just never seems to go away.
-
In Webmaster Tools, when you click on the 404 and the popup window appears, what is showing in the Linked from tab?
-
I edited the post so the URLs didn't run together. Still not perfect, but a little easier to read.
I'm not exactly sure where those links are coming from. You might run a tool like Xenu Link Sleuth or Screaming Frog on your site to see if there is an internal linking widget gone awry. The other thought I have is to look at Open Site Explorer to see what sites are linking to you and if they're linking to any of those pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Moz was unable to crawl your site on Jun 22, 2020\. We were unable to access your site due to a page timeout on your robots.txt, which prevented us from crawling the rest of your site.
Site: www.kpmg.us Getting robots.txt timeout fail since 02/29/20. We've checked our server logs and see no errors. Went through all the steps of the "Troubleshooter". Updated robots.txt to allow rogerbot full access: User-agent: rogerbot
Link Explorer | | KPMG-Search-Social
Disallow: Any ideas how to get roger to crawl my site????1 -
MOZ can't find internal links on my website while Google search console finds thousands of them
When using open site explorer MOZ finds 0 internal links on https://www.glassesgallery.com while Google search console finds thousands of internal links. What could the issue be? Cr8Wk14 oGOf1gb
Link Explorer | | NathanPetralia0 -
Why can i see domain links for competitors form social media sites such as pinterest and youtube but not for myself??
When doing competitor analyse i can see that they are getting domain links form youtube and pinterest yet when i analyse my own links i do not have these. My Google analytics shows me that iam getting traffic from these sources so why is it not showing up for me? and is this affecting my moz rank?
Link Explorer | | RobAdair10 -
Why Aren't All My Backlinks Appearing in Open Site Explorer?
Cross-referenced on Google Webmaster Tools and it looks like MOZ isn't pulling all my links. Any reason why? Thanks in advance!
Link Explorer | | Dental-Care-Allilance0 -
Open Site Explorer metrics report says we have 9 linking root domains, but way more than 7 are listen. What gives?
When I run a report on my website in OSE (SelectAccount" metrics tells me we have 7 "established" root domains and 9 total links. But the detailed report below has many dozens of links. What is metrics telling me?
Link Explorer | | SelectAccount0 -
Does OSE crawl our site often? Or do we have an other problem? (internal links)
I am wondering if i can find out when OpenSiteExplorer crawled our entire website for the last time.
Link Explorer | | wilcoXXL
A few months ago we added some internal textlinks to all of our productpages. Those are links to the brandpage of that particular product. Some of our brands do have up to 1500 products, so there should be at least 1500 internal links pointing to that brandpage. But OSE still gives me a wrong count. It says there is only 1 internal link pointing to that brandpage.
Is it because OSE never crawled our site again? Or am i missing something here?(maybe too many internal links to one specific brandpage is not okay?) I'll hope you guys can help me out with this one.0 -
Open Site Explorer Not Showing Full Pro Version
Um, could someone please let me know why OpenSiteExplorer is treating me like I don't pay $200 for a Moz Pro Membership? See screenshot attached VdCrxzY
Link Explorer | | RickyShockley0 -
Removing the clutter of site-wide links
I have a multi-part question with regard to the moz link index and some presentation suggestions. Firstly I would be interested to know how the link index treats site-wide links with regard to metrics such as DA, and PA. We all know that it is highly likely that SE's are unlikely to pass full link value across from sitewide links, and therefore it would make sense for Moz values to account for this as well - if they do not already. One annoying thing that also relates to sitewides is that they tend to clutter the much of the information presentation in a few of the tools (you can't see wood for trees as it were). This is most prominent in the "Just Discovered" page - if you have a sitewides on a large site, you can often find that this screen is just totally filled with these links as they are found. It would be very useful to be able to filter these out, as they are of little interest - currently I can't see a way of filtering them out. A further value where they create to much noise is the 'Total Links' value. Where sitewides are included in this value, the value actually becomes pretty meaningless as you can find that the majority of that value is sitewides. It would therefore be useful if there was another value for 'Total Links - Excluding Sitewides' where maybe value of 1 was just added to the count for a site wide
Link Explorer | | James770