Repeated mysterious 404's from ancient site structure killing my rankings
-
Several years ago I changed my site structure to go from a flash based site to a blog based wordpress site. After doing so I went from page 1 to page 30 for my relevant search terms. I have employed people to help me track down the problem and I believe that they have narroed it to the existance of 404's being created from some unknown internal source. I have been for years getting links like this...
<colgroup><col width="792"></colgroup>
||
......regularly showing in webmaster tools, (this is from a top pages report from MOZ where there are hundreds also shown).
When I do a moz crawl of the site, none of these links show up. Therefore I have no way of finding the source of these links (they also do not show me the source in WMT as they should).
We have completely cleared the site and rebuilt it and although it is still only a couple of weeks in it still does not appear to have stopped them.
Does anyone have any way of helping me find the source of these mysterious 404's?
-
Why bother trying to clean anything up? If somewhere out there there are links to your domain, and they're 404'ing, just 301 them to new pages on your site! Capture that link juice, don't let it run out
-
Thanks for your reply EEE3
The ancient link says it is linked from another non existent ancient page that no longer exists and it is always first crawled and last detected on the day that it arrives.
eg. last crawled 4/23/14, first detected 4/23/14
http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/03/st-kilda-wedding......
linked from
http://dfphotographer.com.au/brisbaneweddingphotographer/index.php/2011/03/st-kilda-wedding.....
and
http://dfphotographer.com.au/brisbaneweddingphotographer/2011/03/st-kilda-wedding....
-
Thanks for your response Keri,
Being staff can you please tell me where does the top pages data come from? Is it from crawling my site (like a google spider) or is it sourced from google or somewhere else. How often is that data refreshed?
In answer to your response, I have tried both screaming frog and xenu and my nice clean site structure is all it picks up. None of the ancient messy site structure appears.
Have been through the list of domains looking for an old sitemap or something similar that may have been scraped off my site but after a long and arduous task could not locate any reference to any of these links that show up in top pages and webmaster tools (which says they are linked from other ancient pages - which I will expand on below)
We have looked at all the usual suspects - old sitemaps, plugins and rebuilt the site just in case we missed anything that was lingering around. I have had really good people looking at it who continue to do so it just never seems to go away.
-
In Webmaster Tools, when you click on the 404 and the popup window appears, what is showing in the Linked from tab?
-
I edited the post so the URLs didn't run together. Still not perfect, but a little easier to read.
I'm not exactly sure where those links are coming from. You might run a tool like Xenu Link Sleuth or Screaming Frog on your site to see if there is an internal linking widget gone awry. The other thought I have is to look at Open Site Explorer to see what sites are linking to you and if they're linking to any of those pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is Moz's backlink checker.... just... not good?
Hey everyone! Can somebody explain to me why this keeps happening: Whenever I'm trying to backlink my competitors, I typically use RavenTools. Every time, without fail, if I put that same URL into Moz's Open Site Explorer - It gives me about 1/20th of what RavenTools shows me. Sometimes it literally comes up with 2 or 3 links total. Unfortunately, RavenTools has a cap on how many backlink checks you can perform in a month - so once I've used those up, I have to start using OSE... But, it just doesn't work. Does anyone else have this issue? Thanks!
Link Explorer | | TaylorRHawkins1 -
Open Site Explore page titles show as "No Title"
This has been asked many times but I cannot find an answer. On Open Site Explorer, most URLs I enter show as "No title" which means you have to hover over the URL to see which page is being referred to. I know you'd want an example, so here's your own site Moz.com 🙂 CWrkxZQ
Link Explorer | | clifra0 -
Inbound links that does not appear in open site explorer
There are several links to my website that are old, and are not indexed to locate in Open Site Explorer. Which may be the problem from appearing? This is shown in webmaster tools
Link Explorer | | hectordlux0 -
Open Site Explorer Messge - Mysite.com redirects to www.mysite.com
Hello All, When I plug in my site's URL http://mysite.com ( Not my actual URL of-course) into the OpenSiteExplorer tool I get the following message: "You entered the URL http://Mysite.com which redirects to http://www.Mysite.com/.
Link Explorer | | jjimen03
Because it's likely to have more accurate metrics, we're showing data for the redirected URL instead.
Click here to analyze http://Mysite.com instead?" Not sure why I am getting this message. On my hosting server, I have defined my preferred domain to be the non-www. version. I have also used a Canonical Tag on Mysite.com/index.html file ( ). In addition, on Google Search Console ( Webmaster Tools ) , I have added both the www. and non-www. properties, where on both property settings, I have defined the non-www. version as my preferred one. Also, on my server, I have not implemented any 301 redirects at all. So my questions are: Why am I receiving this message when plugging in my non-www. URL version of my website into OpenSiteExplorer? Is my DA or PA possibly being split up between the two versions as a result? Should I be concerned? How can I fix this? Thanks in advance.0 -
Open Site Explorer is finding old html Files that havn't been on my site in two years... even after a 301 Redirect. HELP!
Hello!
Link Explorer | | morganlindsaycole
My problem started when I became aware that when I checked my backlinks for the past two years, it states that no backlinks have been found. When I ran a site analysis on SEMrush - No backlinks are found on the URL, or Domain. There are 7 Backlinks on the Root Domain and those were configured in 2012. I made a second domain www.columbusweddingphotographersreviews.comwhere I linked to my domain at www.morganlindsayphotography.com so I could test that google had crawled both websites and after, still no backlink was found. I have also been published on a dozen or so wedding websites that has linked to my website where they are follow links and still nothing. (http://www.brendasweddingblog.com/blogs/2015/2/23/an-elegant-fall-wedding-in-ohio-with-morgan-lindsay-photography) **Website Background- **
In 2012 I had two separate websites - One for Seniors that was an HTML website I build in Dreamweaver at www.morganlindsayphotography/seniors and another for Wedding Clients found at www.morganlindsayphotography.com/Wedding - (wordpress) I had a Splash page wish was found atwww.morganlindsayphotography.com. Two years ago when I became aware splash pages were frowned upon in Google, I combined the two websites and stayed with the Wordpress which was www.morganlindsayphotography.com/Wedding
Because I did not want users to have to go to www.morganlindsayphotography.com/Wedding to view my url, Godaddy moved my wordpress site from thewww.morganlindsyphotography.com/Wedding directory towww.morganlindsayphotography.com When I ran the Open Site Explorer with Moz I found after runningwww.morganlindsayphotography.com the TOP pages on this domain according to Page Authority are old HTML files from my senior website, as well as old Posts from when my wordpress site was found atwww.moragnlindsayphotography.com/Weddings
No current pots or pages are showing up besideswww.morganlindsyphotography.com I do run a cache management system to speed up my system and recently cleaned out my .htcacess folder and still had no luck. This is difficulty something **Last night I made a 301 Redirect in my htaccess for all the old links pointing to the new links as best as I could. My htacess folder looks like this.. BEGIN WordPress <ifmodule mod_rewrite.c="">RewriteEngine On
RewriteBase /
RewriteRule ^index.php$ - [L]
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule . /index.php [L]</ifmodule> END WordPress Permanent URL redirect - generated by www.rapidtables.com Redirect 301 /Wedding http://www.morganlindsayphotography.com/ Permanent URL redirect - generated by www.rapidtables.com Redirect 301 /Wedding/ http://www.morganlindsayphotography.com/ Permanent URL redirect - generated by www.rapidtables.com Redirect 301 /about.html http://www.morganlindsayphotography.com/about-morgan-lindsay/ Permanent URL redirect - generated by www.rapidtables.com Redirect 301 /app.html http://www.morganlindsayphotography.com/blog/ Permanent URL redirect - generated by www.rapidtables.com Redirect 301 /experience.html http://www.morganlindsayphotography.com/senior-sessions/ Permanent URL redirect - generated by www.rapidtables.com Redirect 301 /index.html http://www.morganlindsayphotography.com/ Permanent URL redirect - generated by www.rapidtables.com Redirect 301 /senior.html http://www.morganlindsayphotography.com/ohio-senior-photographer/ Permanent URL redirect - generated by www.rapidtables.com Redirect 301 /seniorsconstruction.html http://www.morganlindsayphotography.com/ohio-senior-photographer/ Permanent URL redirect - generated by www.rapidtables.com Redirect 301 /Wedding/2012/06/22/brittany-reis-jason-mcclaflin-tiffin-ohio-wedding/ http://www.morganlindsayphotography.com/holy-family-church-columbus-wedding/ After I ran the open site moz explorer and the www.morganlindsayphotography/Wedding was still there..0 -
Page Authority dropped to 1 for subdirectory in my site
Hey, one of my sites https://www.automationanywhere.com/testing has had all it's pages drop to a page authority of 1 in Moz and the campaign doesnt appear to be indexing any pages from this directory. We launched a new site in this directory on the 12th and havent been getting any moz love since. The pages are indexable and followable, and are being indexed by google and others. The site on the root domain has been unaffected. Please help me get my moz campaign back on track. Thanks!
Link Explorer | | aatethys0 -
Facebook "likes" not showing up in Open Site Explorer
Hi All: I'm dealing with a company that has 299 Facebook likes, but Open Site Explorer only shows three. I checked the NAP on Facebook and it's correct. Anyone know why this might be happening? I'm worried if Open Site Explorer is not seeing the likes, Google might not as well, for whatever reason. Thanks! Wes
Link Explorer | | wrconard0 -
Open Site Explorer not detecting linking domain on my site
So I was using Open Site Explorer to analyze my domain authority and page authority as can be seen here. My site has been covered by some press and in the article it actually links my site. Some of the articles can be seen on, here, here, and here. Any idea why none of these articles are detected as a linking domain to my site? Is there something that I am doing wrong?
Link Explorer | | herlamba1