"/blogroll" causing 404 error
-
I'm running a campaign, and the crawling report for my site returned a lot of 4xx errors. When I look at the URLs, they all have a "/blogroll" in the end, like:
mysite.com/post-number-1/blogroll
mysite.com/post-number-2/blogroll
And so on, for pretty much all the pages. The thing is, I removed the blogroll widget completely, so I really wouldn't know what can possibly point to links like that.
Is there anything to fix on the site?
Thanks
-
Hi Andrea
Are you all set with this? The transfer may have had to do with it, but the main importance now is to follow Adam's good advice - find the source of the 404 links and change them on your site. If they're indexed or backlinked to from elsewhere on the web, you need to 301 them to an existing page.
Let us know if you still need help!
-Dan
-
OK, so, I crawled my site with Screaming Frog and found the same errors. Actually I found out that the "privacy policy" page is causing the same 404 with the same type of URL "mysite.com/post-number-1/privacy-policy" (SEOmoz crawler had detected those as well, I just hadn't noticed).
The privacy policy page is actually published, but I cannot remove it, as I wouldn't be compliant with Google Adsense policy.
A couple of more things though:
-
I checked a couple of those 404 pages in Google with the "site:" command, and they're not indexed. I think those pages simply don't exist.
-
the blogroll was in the sidebar, and the privacy policy page is in the footer, which means, both of them are site-wide
-
I had a site before, then I deleted it and started my current one from scratch, importing all the content from Wordpress to Wordpress. Maybe this transfer has something to do with the issue?
-
-
Sorry Ben but I have to disagree with you here. That is very bad practice and also very poor advice. You shouldn't just ignore 404 pages from a site crawl.
Really the only time you should let pages just 404 is when Google has indexed them, there is no relevant page on your site to redirect them to, there are no high value links pointing to them and they are not being linked to from within your site.
However, in this case the 404 pages are being linked to from within the site. This means that value is being passed to these pages from within the site that could otherwise be passed to other pages.
Best practice in this situation is to fix the links that point to the 404 pages and 301 redirect the 404 pages to relevant pages on the site.
P.s. running a quick site crawl and fixing the 404s should only take minutes and not hours to do!
-
Check GA (Google Analytics)
- Are the 404d pages receiving search traffic?
- Are the 404d pages ruining your user experience? (Are they accessible via your site links)
If no to both, is this really worth a couple hours of your time?
-
Hi Andrea,
If the crawl is returning 404 errors then this means, although you have removed the widget, the pages are still being linked to somewhere on your site.
My advice would be to use the Screaming Frog crawler or if you have access to another crawler then use that. Once you have crawled the site using a crawler, you should be able to find out which pages are still linking to the 404 pages. Once you have found these, you will get a better idea of how to fix the issue.
Remember, a crawler will crawl your entire site, including all links, and if 404s are found then these are being linked to internally.
Hope that helps,
Adam.
-
Hei Don,
thanks for the quick help.
Yes, I'm running Wordpress, with the Catalyst framework.
I was using the blogroll widget in the sidebar, but when I started to see the crawling errors I removed it just in case. The crawl is now complete, but even more errors of the same type have come out.
-
Hi Andrea
I'm not sure about the issue, but it may help others if you mention what type of software you're running.
I would assume Wordpress since you said widget but could also be Joomla or another CMS.
Good Luck,
Don
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Sudden Indexation of "Index of /wp-content/uploads/"
Hi all, I have suddenly noticed a massive jump in indexed pages. After performing a "site:" search, it was revealed that the sudden jump was due to the indexation of many pages beginning with the serp title "Index of /wp-content/uploads/" for many uploaded pieces of content & plugins. This has appeared approximately one month after switching to https. I have also noticed a decline in Bing rankings. Does anyone know what is causing/how to fix this? To be clear, these pages are **not **normal /wp-content/uploads/ but rather "index of" pages, being included in Google. Thank you.
Technical SEO | | Tom3_150 -
Pages being flagged in Search Console as having a "no-index" tag, do not have a meta robots tag??
Hi, I am running a technical audit on a site which is causing me a few issues. The site is small and awkwardly built using lots of JS, animations and dynamic URL extensions (bit of a nightmare). I can see that it has only 5 pages being indexed in Google despite having over 25 pages submitted to Google via the sitemap in Search Console. The beta Search Console is telling me that there are 23 Urls marked with a 'noindex' tag, however when i go to view the page source and check the code of these pages, there are no meta robots tags at all - I have also checked the robots.txt file. Also, both Screaming Frog and Deep Crawl tools are failing to pick up these urls so i am a bit of a loss about how to find out whats going on. Inevitably i believe the creative agency who built the site had no idea about general website best practice, and that the dynamic url extensions may have something to do with the no-indexing. Any advice on this would be really appreciated. Are there any other ways of no-indexing pages which the dev / creative team might have implemented by accident? - What am i missing here? Thanks,
Technical SEO | | NickG-1230 -
Confused about repeated occurences of URL/essayorg/topic/ showing up as 404 errors in our site logs
Working on a Wordpress website, https://thedoctorwithin.comScanning the site’s 404 errors, I’m seeing a lot of searches for URL/essayorg/topic, coming from Bingbot, as well as other spiders (Google, OpensiteExlorer). We get at least 200 of these irrelevant requests per week. Seems like each topic that follows /essayorg/ is unique. Some include typos: /dissitation/Haven't done a verification to make sure the spiders are who they say they are, yet.Almost seems like there are many links ‘in the wild’ intended for Essay.Org that are being directed towards the site I’m working on.I've considered redirecting any requests for URL/essayorg/ to our sitemap… figuring that might encourage further spidering of actual site content. Is redirection to our sitemap xml file a good idea, or might doing so have unintended consequences? Interested in suggestions about why this might be occurring. Thank you.
Technical SEO | | linkjuiced0 -
Looking for feedback about "look-ahead" navigation
Our company has been creating websites where the navigation is developed in such a way as to allow the visitor to get a preview of the image and/or content on that is on the page. Here are two websites that use this technology:
Technical SEO | | TopFloor
http://www.uniquepadprinting.com/
http://www.empathia.com/ (On this site, the previews are only available if you click on "Whole", "Productive" or "Safe" at the top of the page. I'm looking for feedback such as: What do you call this type of navigation (We call it look-ahead, but I can't find much info that term on the web) Have you experienced any issues with this type of navigation? Do you have any recommendations on it? Some of the things we've seen are: It adds the same content to every page of the website It creates a lot of internal links It can create a lot of code on pages It can slow page-load times0 -
Blog.furnacefilterscanada.com/ or furnacefilterscanada.com/blog/
My shopping cart does not allow to instal a WordPress blog on a sub-domain like: furnacefilterscanada.com/blog/ But I can host my blog on another server with a sub-domain like: blog.furnacefilterscanada.com In a SEO point of view is there a difference between the 2? Link juice? Page authority? Thank you, BigBlaze
Technical SEO | | BigBlaze2050 -
Rel="Follow"? What the &#@? does that mean?
I've written a guest blog post for a site. In the link back to my site they've put a rel="follow" attribute. Is that valid HTML? I've Googled it but the answers are inconclusive, to say the least.
Technical SEO | | Jeepster0 -
Does http://my.dudamobile.com/ Effect SEO
Hi, Hope everyone is enjoying the new year! I was wondering if converting your desk top website to a mobile one, example via http://my.dudamobile.com/, has any negative effects on SEO. Did it effect your site? Do you recommend doing it? Does it effect links? When people link to your desk top URL does that authority carry to the mobile, or would it be better if they link to the mobile (m.website.com) URL? Is http://my.dudamobile.com/ a good choice? Any feedback, as always, is greatly appreciated! Thanks Jimmy
Technical SEO | | jimmy02250 -
What is "canonical." And what do I need to do to fix it?
I'm seeing about 450 warnings on this. What is "Using rel=canonical suggests to search engines which URL should be seen as canonical." And what do I need to do to fix it?
Technical SEO | | KimCalvert0