Repeated mysterious 404's from ancient site structure killing my rankings
-
Several years ago I changed my site structure to go from a flash based site to a blog based wordpress site. After doing so I went from page 1 to page 30 for my relevant search terms. I have employed people to help me track down the problem and I believe that they have narroed it to the existance of 404's being created from some unknown internal source. I have been for years getting links like this...
<colgroup><col width="792"></colgroup>
||
......regularly showing in webmaster tools, (this is from a top pages report from MOZ where there are hundreds also shown).
When I do a moz crawl of the site, none of these links show up. Therefore I have no way of finding the source of these links (they also do not show me the source in WMT as they should).
We have completely cleared the site and rebuilt it and although it is still only a couple of weeks in it still does not appear to have stopped them.
Does anyone have any way of helping me find the source of these mysterious 404's?
-
Why bother trying to clean anything up? If somewhere out there there are links to your domain, and they're 404'ing, just 301 them to new pages on your site! Capture that link juice, don't let it run out
-
Thanks for your reply EEE3
The ancient link says it is linked from another non existent ancient page that no longer exists and it is always first crawled and last detected on the day that it arrives.
eg. last crawled 4/23/14, first detected 4/23/14
http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/03/st-kilda-wedding......
linked from
http://dfphotographer.com.au/brisbaneweddingphotographer/index.php/2011/03/st-kilda-wedding.....
and
http://dfphotographer.com.au/brisbaneweddingphotographer/2011/03/st-kilda-wedding....
-
Thanks for your response Keri,
Being staff can you please tell me where does the top pages data come from? Is it from crawling my site (like a google spider) or is it sourced from google or somewhere else. How often is that data refreshed?
In answer to your response, I have tried both screaming frog and xenu and my nice clean site structure is all it picks up. None of the ancient messy site structure appears.
Have been through the list of domains looking for an old sitemap or something similar that may have been scraped off my site but after a long and arduous task could not locate any reference to any of these links that show up in top pages and webmaster tools (which says they are linked from other ancient pages - which I will expand on below)
We have looked at all the usual suspects - old sitemaps, plugins and rebuilt the site just in case we missed anything that was lingering around. I have had really good people looking at it who continue to do so it just never seems to go away.
-
In Webmaster Tools, when you click on the 404 and the popup window appears, what is showing in the Linked from tab?
-
I edited the post so the URLs didn't run together. Still not perfect, but a little easier to read.
I'm not exactly sure where those links are coming from. You might run a tool like Xenu Link Sleuth or Screaming Frog on your site to see if there is an internal linking widget gone awry. The other thought I have is to look at Open Site Explorer to see what sites are linking to you and if they're linking to any of those pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Does moz pro standard give access to open site explorer & keyword explorer?
Does moz pro standard give access to open site explorer & keyword explorer?
Link Explorer | | SearchOpt1 -
Spam Score and crawling of my site
Hello, I'm trying to analyze the spam score of my site which is 9/17 Actually I have few backlinks and all of them have a low spam score (max 4/17, just one). I think there's some kind of issue with the crawler since I get strange spam factors: Large Site with Few Links (likely true, I recently deleted a lot of tags used once) Low Number of Pages Found (wasn't it a "Large Site"??) Low Number of Internal Links (I got a considerable number) No Contact Info (I have a link to my facebook in the menu and a "contacts" page) Thin Content (It's just a blog with min 300 words per post, why thin?) Site Link Diversity is Low (likely true) Ratio of Followed to Nofollowed Subdomains (likely true) Low MozTrust or MozRank Score (true) Ratio of Followed to Nofollowed Domains (likely true) Can you please help me to understand it, is it a crawling problem or similar? If needed I will post the url of the website. Thank you so much Marco
Link Explorer | | MarcoBP0 -
What's the Story on Mozscape Updates?
Hey gang, As you may be aware, we were considerably late with our last index release. You have my sincere apologies for that and the apologies of the entire team. In the interest of transparency, I want to try to explain what's been going on. Since stepping down as CEO, I've been asked to take on a few roles in the company. One of those is product architect (basically the product owner) of our Big Data team, who produces the Mozscape link index. For several years, that team has been almost exclusively focused on getting us closer to a near real-time indexing system that does not have scalability issues. Mozscape is currently smaller than our major competitors, and we're also often slower. Our metrics (PA, DA, MozRank, MozTrust, Spam Score, Social Data, etc) have been the unique value we provide, but it's not enough. We need to be competitive on size and freshness. Building a raw link index (without processed metrics like PA/DA et al) is hard, but it's possible. Building a link index with those metrics is really tricky, and requires computer science knowledge and skills far beyond the scope of my understanding. That's what our team's been working on, and they've made some progress, but it's been slow, hampered by unknown unknowns, and materially hurt by a lack of experienced talent we can hire to help (we've had open job posts for years now). In the meantime, our historic Mozscape index structure keeps encountering challenges - this latest round is still somewhat unexplained (we believe there's hardware issues compounded by how the system is architected to handle large domains, but there may be other issues). The team's struggled to split time between keeping the old Mozscape running and hunkering down to finish the new system. I'm trying to help them balance things as best I can, and we're going to be putting effort toward making sure we get index releases out on time. However, to do that, we'll need to scale down size, and then rebuild back up. We think we can do this while also improving the prioritization of which links we crawl (e.g. deeper on important domains that link out, less so on deep pages that don't link anywhere) so the index overall improves. However, I don't want to minimize the risks - we may have some slow updates, some smaller indices, and some less-than-ideal data in the next one or two indices while we work to remedy this issue. I HOPE we don't, and that things actually get better immediately, but we can't promise that until the work gets finished. TL;DR - Mozscape V2 is in development and will let us as big and faster as any link index. In the meantime, current Mozscape's having issues & we're making smaller indices in an attempt to diagnose and repair. As always, thanks for your understanding, continued support, and if you have any questions, feel free to leave them below. I realize that this level of service/product quality is NOT OK, and I'm doing everything in my power to fix it.
Link Explorer | | randfish8 -
No back links showing in site explorer but..
I am quite surprise with this, normally site explorer show more and valid database for back links but this time its opposite. the site url is www.motortrader.com.pk webmaster shows me about 2000 where as site explorer is null. is there any particular reason for that?
Link Explorer | | Mustansar0 -
Site Mark-up is Abnormally Small
My site www.brightonsoundsystem.co.uk has been optimised for speed so I have minimised the code needed. Now if I put it through the OSE spam analysis it has a flag for "Site Mark-up is Abnormally Small". What ratio of visible text compared to mark-up code is being used to trigger this flag. Also as this is the only flag I have is ti worth the time fixing.
Link Explorer | | Brighton-Soundsystem0 -
Sites internal links are not showing as inbound links
My sites internal links are not showing as inbound links while my competitor site’s internal links are showing as their inbound links (In OSE). Is my site’s inter-linking weak? Or there could be other reasons.
Link Explorer | | vivekrathore0 -
I want to do a Keyword Difficulty and SERP Analysis for a core keyword and compare the top 10 ranking pages againt my page. How can I do that? Running a full report? Thanks!
I want to do a Keyword Difficulty and SERP Analysis for a core keyword and compare the top 10 ranking pages againt my page. How can I do that? Running a full report? Thanks!
Link Explorer | | estebancitus0 -
Open Site Explorer not reporting all 301 redirected links
Our site had over 2,000 root domains linking to it as reported in MOZ Open Site Explorer (and Google Webmaster Tools). We then changed the domain and made sure that 301 redirects were set up for all pages across the site. That was about 1 month ago. Open Site Explorer is now reporting less than 300 linking root domains. For the links that it is reporting the majority of these are being 301 redirected to the new URLs (some of them we changed the links directly). However the majority of the links that are being 301 redirected are not being reported. It is reporting most of the links coming from the old to the new domain. Google Webmaster Tools is reporting over 1,000 linking root domains to the new domain (it has a max of 1,000). We did notice that MOZ took some time to update the domain authority of the new domain. It was 1 for along time and it has now jumped up to 46 ( it was previously at 74). Maybe this is a time delay thing and eventually Open Site Explorer will report all of the 301 redirected links? It is a bit frustrating at the moment as we can't fully analyse the links to the site to try to focus on the high domain linking sites to get them to change the link directly. Also, If all links are being 301 redirected to the new domain should the authority not be close to where it was previously after one month? Our search traffic has dropped considerably since the launch of the new site and hasn't returned yet, so just wondering if the 301 redirected links pass on as much value as thre original direct links. Thanks, Damien
Link Explorer | | james.harris0