404 Errors generating in WP
-
Our crawl reports are generating back several 404 errors for pages with urls that look like:
/category/consulting/page/5/
The tag changes, the page number changes, but the result is always the same: A big glaring 404. Our sites are built on WordPress Multi-site, and I am fairly certain this issue is on the WP end, but I can't figure out why it is generating pages out to infinity, essentially, from the tags and categories.
It is worse on some sites than others, but is happening across the board (my initial concern was that it might be a theme issue, but that does not seem to be the case).
If anyone has run into this issue and knows a fix, you're insight would be greatly appreciated.
Thanks!
-
This is the exact answer that you want if you have any issues running into problems once you have isolated them with screaming frog you can easily pick them out and redirect them either through your host or through a plug-in depending upon where you're hosted and how your site is being managed.
-
Here's what to do;
- Crawl the site with Screaming Frog (you can do up to 500 URLs with the free version).
- Look for the suspect pages in your crawl list.
- Click once on the page in the list
- Below, click "To Links" - and this box will show you the page(s) linking to the bad 404 page.
- View the "From" page (which links to the bad page) in your browser.
- Look at the source code for that page - do a "find" and paste in the bad page's URL. This will show you where it's linking from.
- Fix the spot in your theme or template where it is linking to pages that do not exist.
-
I want to first say I apologize for the wording on my first answer I was using voice recognition and it made a few mistakes. I also want to say I agree with what Dan has said he really knows WordPress and I would take
his advice the fact that he's endorsed Lynn's answer makes me believe that it is the correct one and screaming frog spider is one of the best tools in the world for any type of website and can help you create 301's easily to get rid of your 404's on the paid version
here is a great Screaming frog spider guide By Seer
http://www.seerinteractive.com/blog/screaming-frog-guide
I hope this helps,
Thomas
-
Hi Melissa, you've received some great responses. Did any of them help you resolve your issue?
-
I second Lynn's answer. You need to find where the link is coming from to begin with. Could also use Screaming Frog SEO Spider or Webmaster Tools - they will all get you the same thing. Find out where the bad URLs are linked from, and then you can narrow down the source of bad code or whatever it may be.
-Dan
-
Yoast's SEO plug-in for WordPress
Will eliminate the /1/ /2/ /3/ page effect.
http://yoast.com/wordpress/seo/
I hope this is of help. The running multisite through a non-subdomain set up on expecting correct?
-
Hi Melissa,
It is actually pretty common for wp based sites to spin out repeated 404 errors (or at least have the potential to do this). It could be a theme issue, a plugin issue or a setup issue or a combination of all 3 depending on the site.
First thing to do is figure out where the link is coming from. Download your moz error report in csv and filter by 404's. On the left you will have the 404 page and on the far right you will have the referring page. Go to the referring page and view the source and try to find that link. Keep an eye out for relative links from deep structure pages also (so you have page /category/page/5 and the link is a relative href="/" or href="/6" which can have the effect of spawning repeated 404s like /category/page/5/6 etc etc).
Once you have found the link, you should be able to identify the template part which is producing them and then act accordingly to either edit the template or adjust the settings/plugin so that they stop. If you can give us a real example of a 404 and its linking page we should be able to give you more specific info.
-
I seriously doubt the issue is with WP. perhaps a plugin might be conflicting with how the page is rendered. Make sure you have a database and file backup. You could FTP in and rename the plugins folder to _plugins ( this will disable all plugins ) and check if the issue persists. Rename the folder back to plugins once you finish testing it.
NOTE : with some plugins you will have to manually go in a enable and configure them after you do this.
-
Hi 404 are extremely common. I would build 404 page and I would also use this plug-in here but make sure that you understand that this is by far no way perfect. You must check the logs consistently and make sure their note for forest mistaken for instance I had used it once and it had four or my feed back my homepage
http://wordpress.org/plugins/redirection/
However this is a very good plug-in that has been progressively getting better. It is recommended by some of the best people in WP I strongly recommend using it and I hope I've helped you if you need anymore help please let me know.
sincerely,
Thomas
Yes this will also work if you're using Nginx a much faster web server type that Apache or even lightspeed. However it is very hard to create 301 anyone use a Nginx web server environment. However this tool is able to do so and then most Managed WordPress hosts are the ones that are actually using Nginx with this Internet these days. Or managed WordPress hosts are able to then rewrite the correct 301 with this redirection plug-in is able to create 301 100% then most of the time people running Nginx host make the corrections after the redirection have to spend time.
The specialty of this redirection plug-in the is catching 404's and redirecting them to a page that it should be redirected to logically for instance if I was going to have /about/ 404 and then /about- us/ the plug-in easily figure that out that should be the correct redirect and then mixed the correction automatically you can also turn off this mode once you believe 404's are no longer an issue.
You can also use an Apache only mode which depending on your Web server might me your needs better depending on what type of Webster rerunning
I think this is an outstanding plug-in and definitely believe that it is a huge help if you have a any amount of 404s.
I hope I've been of help,
Thomas
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why would someone go to same 404 page over and over?
Good morning, I've been using the redirection plugin on my wordpress site and noticed i have multiple IP addresses going to the same folder on my site - like "mydomain.com/folder-name/". The "folder-name" is obviously not anything remotely like any folder or file name I have on my domain - so it's obviously spammy in nature. And, there are multiple IP addresses going to this same URL address every 3 hours on the dot, so it's appears automated. Is this something to be concerned about? Should I "do" anything? Thanks in advance for reading and replying!
Moz Pro | | mlm120 -
Errors in my coding how significant is this regarding rankings ?
I posted a question on here yesterday about the homepage asking for advice regarding the content and then was told by two people were very helpful bbut moved over comment not on content but to say taht the major problem was that the coding on my website basically has too many errors which would result in me receiving lower rankings in the search engines. I realise this website is old-fashioned Dreamweaver template which was constructed several years ago which I've updated and I'm certainly not a professional, but I watch my Google analytics and there doesn't seem to be any significant change in the stats from this time last year. This is the site http://www.endeavourcottage.co.uk/ I realise the site is old format and has been around for several years it's just from customer feedback they seem to think it looks okay for the products old cottages but I guess technically it's not the best now. I have run a test using Silktide Nibbler - a free online service that gives you a good complete overview of your website with an overall score. And it did give my website an overall good score but did point out errors in the coding but when I checked some of my competitors near the top of Google for the short tail keywords some of them also have errors in their coding, very similar to my own error score.. I then went to Google Webmaster tools and there were no warning messages. So the big question is how important are these errors scores when it appears that most of the top competition also are in the same situation? I think it's quite possible I could do with a redesign using responsive design Best Alan
Moz Pro | | WhitbyHolidayCottages0 -
503 Error or 200 OK??
So, in a Moz crawl and a Screaming From crawl, I'm getting some 503 Service Unavailable responses on the some pages. So I go to the pages in question, and the Moz bar is showing a 200 OK. The SEOBook http status checker (http://tools.seobook.com/server-header-checker/) also shows a 200 OK. What gives? The only reason I'm looking at this is because rankings plummeted a couple of weeks ago. Thanks! UPDATE So, I decided to use the mozbar to set the user agent as Googlebot and when I tried to access the pages in question I receive this message. I don't think this is an issue... anyone else have much experience here? Your access to this site has been limited Your access to this service has been temporarily limited. Please try again in a few minutes. (HTTP response code 503) Reason: Fake Google crawler automatically blocked Important note for site admins: If you are the administrator of this website note that your access has been limited because you broke one of the Wordfence firewall rules. The reason you access was limited is: "Fake Google crawler automatically blocked". If this is a false positive, meaning that your access to your own site has been limited incorrectly, then you will need to regain access to your site, go to the Wordfence "options" page, go to the section for Firewall Rules and disable the rule that caused you to be blocked. For example, if you were blocked because it was detected that you are a fake Google crawler, then disable the rule that blocks fake google crawlers. Or if you were blocked because you were accessing your site too quickly, then increase the number of accesses allowed per minute. If you're still having trouble, then simply disable the Wordfence firewall and you will still benefit from the other security features that Wordfence provides. If you are a site administrator and have been accidentally locked out, please enter your email in the box below and click "Send". If the email address you enter belongs to a known site administrator or someone set to receive Wordfence alerts, we will send you an email to help you regain access. Please read our FAQ if this does not work.
Moz Pro | | wiredseo0 -
Crawl Errors from URL Parameter
Hello, I am having this issue within SEOmoz's Crawl Diagnosis report. There are a lot of crawl errors happening with pages associated with /login. I will see site.com/login?r=http://.... and have several duplicate content issues associated with those urls. Seeing this, I checked WMT to see if the Google crawler was showing this error as well. It wasn't. So what I ended doing was going to the robots.txt and disallowing rogerbot. It looks like this: User-agent: rogerbot Disallow:/login However, SEOmoz has crawled again and it still picking up on those URLs. Any ideas on how to fix? Thanks!
Moz Pro | | WrightIMC0 -
How to run down the actual source of a 404 error that is reported.
In my 404 errors, the second entry is as follows: URL: http://www.virginiahomesandforeclosures.com/listing/0428387-lot-k-commerce-park-franklin-va-23851/REWIDX_URL_CDNimg/no-image.gif Is there a simple way to find the root or page in which this error was generated? IF I visit this page " http://www.virginiahomesandforeclosures.com/listing/0428387-lot-k-commerce-park-franklin-va-23851" without the attached gobble de gook, I see a good page. So bottom line its possible it could be in one of my sitemaps, but I have 50 of those so its time consuming to search thru all 50 for each error like this since I have so many. I am pretty sure its not in my sitemaps, since google has not picked up any of these errors and they have crawled over 12,000 urls so far. When google gives me a 404 error I can click on the link and find what pages they found the link and go there and correct it at the root. Any suggestions would be greatly appreciated. I have more than 1,000 of these errors with the bad url with the junk attached to the end and have not been able to isolate the cause yet. Thanks in advance.
Moz Pro | | tommytx0 -
Crawl Errors Confusing Me
The SEOMoz crawl tool is telling me that I have a slew of crawl errors on the blog of one domain. All are related to the MSNbot. And related to trackbacks (which we do want to block, right?) and attachments (makes sense to block those, too) ... any idea why these are crawl issues with MSNbot and not Google? My robots.txt is here: http://www.wevegotthekeys.com/robots.txt. Thanks, MJ
Moz Pro | | mjtaylor0 -
Crawl Diagnostic Errors
Hi there, Seeing a large number of errors in the SEOMOZ Pro crawl results. The 404 errors are for pages that look like this: http://www.example.com/2010/07/blogpost/http:%2F%2Fwww.example.com%2F2010%2F07%2Fblogpost%2F I know that t%2F represents the two slashes, but I'm not sure why these addresses are being crawled. The site is a wordpress site. Anyone seen anything like this?
Moz Pro | | rosstaylor0 -
404 errors
I have a few 404 errors found using seomoz tool. It also shows the url, but I am not sure where in the site is it linking from. Is there a way to find out the origin of the 404 error. Thanks
Moz Pro | | Accounts0