Soft 404s for unpublished & 301'd content
-
Hi,
One site I work with unpublished a lot of thin content. Great idea, right?
These unpublished pages were then 301'd up to the main category page that they previously existed in.
Now Google Webmaster Tools calls them out as soft 404 errors. This seems unexpected since the pages
were 301'd. Here is my question; Is this a serious problem that may affect the site's overall organic results
and if so what should I do about it?
Thanks... Darcy
-
Short answer: create a custom 404 page, not just for these pages, but one that can show for everypage on your site.
A few resources:
https://support.google.com/webmasters/answer/93641?hl=en
Example: http://moz.com/sadfklfadsadfjs
-
Cyrus, thanks for hanging in there with my questions. If I just give back a 404, what am I showing them on the page?
I would think seeing the main questions page would be better than just sitting at the original url and looking at 404 page notice - seems like a bad user experience if Google wants to get all user-experiency about it.
Thanks... Darcy
-
Yes, it's possible, but that could be considered cloaking. I'd say best to return a 404.
-
Hi Cyrus,
Have not experienced a dip, but things have been a little static.
Can you do both... forward the page and give back a 404?
What would you do?
Thanks... Darcy
-
Yes, I would think that at the point Google crawls it and finds it forwarded it would drop it from the index and not waste resources crawling it again unless linked somewhere. I will keep an eye out for links, but don't believe that there are any.
Thanks, Dirk... Darcy
-
In that case, sounds like you should either:
- 404 them if you have evidence these have hurt your rankings/traffic (have you experienced a dip?)
- Ignore them and go about your day
-
Hi Cyrus,
Thanks for the info. These are forum pages where no one ever answered the question, so
there is no helpful info and very little content.
They were forwarded up to the main questions page (one / up the url structure).
The page they were forwarded to is like a questions category page, not specific to the subject of the
forwarded page. These forwarded pages don't get much/any traffic because they never ranked
and we didn't promote them.
If it doesn't hurt overall search on other pages, I'd rather not go to the substantial effort of finding subject-relevant pages to forward to, since no one will ever go to the original url and need to see something super relevant.
Your thoughts? Thanks! Best... Darcy
-
If Fetch like Google is also giving a 301 - I would mark them as solved in WMT & check if they re-appear.
If you click on the i next to the redirect message in Fetch like Google - it shows the type of redirect & the page it's redirecting to. I assume you checked that this is also a 301.I have a similar issue on one of my sites - if a user gets to a non-existing url - the server first tries to find out if the page exists - if it doesn't it's redirected to a 404 page. Although technically it is a 301 - WMT sees them as a soft 404 as the destination page is a "Page not found" type of page (called 404.php) - which (quite ironically) renders a 200 status.
On the destination page - do you mention somewhere a message like "page not found" or is it just a plain category page?
The SEO impact is difficult to assess - Google says these pages are mainly wasting the bot's time as it's indexing pages that do no longer exist, not sure if it is also affecting rankings. As you did the crawl with Screaming Frog, I guess you are also removing all internal links to these redirected pages? If these links disappear, and as the content was thin, I suspect you don't have many external links pointing to them, so the problem should disappear after a while.
rgds,
Dirk
-
If Google thinks the 301 leads to a page that isn't relevant enough, they may flag it as a "soft 404" even though it returns a 301. That's Google's way of saying they think you should 404 these pages instead.
How much will it hurt you? Probably not much, but it's hard to say.
Let's ask these questions:
- How much traffic goes to these pages? If not much, is it okay to 404 them?
- Are there more relevant pages you could redirect these to? (ideally, something with a similar title as the original page?)
- Have you seen much traffic loss overall? If not, it's likely this isn't hurting you.
Hope this helps! Best of luck with your SEO.
-
Okay, that is extra weird. It could be that GWT hasn't update your information since you made the changes. Since everywhere else is telling it's correct -- especially the fetch tool -- then you should wait a few more days and see if it updates.
-
Hi Erica,
I'm saying that the only place it shows a soft 404 is in GWT errors. Screaming Frog, web-sniffer and now Fetch As Google In GWT, all show them as 301 re-directs. I can't re-direct them more than they are. So, is GWT just goofy?
Thanks... Darcy
-
Hi Darcy,
Yeah, if it's still showing as a soft 404, there's still something wrong. I'd try using fetch and render as Google bot and see what happens.
Best of luck!
-
Hi Dirk,
Thanks for the suggestion. As noted above, I put the whole list thru screaming frog and a few thru your suggestion of web-sniffer.net.
95% of the whole list is 301s and 100% of the few put one at a time thru web-sniffer come back as 301s.
My question remains "Is this a serious problem that may affect the site's overall organic results
and if so what should I do about it?"
Thanks... Darcy
-
Hi Erica,
I put the list through screaming frog and 95% of the urls are shown as 301s.
Do you think screaming frog has it right or is there something they wouldn't catch?
Thanks... Darcy
-
Maybe an obvious question but did you check that the url's are indeed properly redirected - checking them with 'Fetch like Google' in WMT or by using a tool like web-sniffer.net?
rgds,
Dirk
-
I'd check to make sure your 301s were done correctly. If they are showing up as soft 404s, they are probably implemented wrong.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
After hack and remediation, thousands of URL's still appearing as 'Valid' in google search console. How to remedy?
I'm working on a site that was hacked in March 2019 and in the process, nearly 900,000 spam links were generated and indexed. After remediation of the hack in April 2019, the spammy URLs began dropping out of the index until last week, when Search Console showed around 8,000 as "Indexed, not submitted in sitemap" but listed as "Valid" in the coverage report and many of them are still hack-related URLs that are listed as being indexed in March 2019, despite the fact that clicking on them leads to a 404. As of this Saturday, the number jumped up to 18,000, but I have no way of finding out using the search console reports why the jump happened or what are the new URLs that were added, the only sort mechanism is last crawled and they don't show up there. How long can I expect it to take for these remaining urls to also be removed from the index? Is there any way to expedite the process? I've submitted a 'new' sitemap several times, which (so far) has not helped. Is there any way to see inside the new GSC view why/how the number of valid URLs in the indexed doubled over one weekend?
Intermediate & Advanced SEO | | rickyporco0 -
Should I better noindex 'scripted' files in our portfolio?
Hello Moz community, As a means of a portfolio, we upload these PowerPoint exports – which are converted into HTML5 to maintain interactivity and animations. Works pretty nicely! We link to these exported files from our products pages. (We are a presentation design company, so they're pretty relevant). For example: https://www.bentopresentaties.nl/wp-content/portfolio/ecar/index.html However, they keep coming up in the Crawl warnings, as the exported HTML-file doesn't contain text (just code), so we get errors in: thin content no H1 missing meta description missing canonical tag I could manually add the last two, but the first warnings are just unsolvable. Therefore I figured we probably better noindex all these files… They appear to don't contain any searchable content and even then; the content of our clients work is not relevant for our search terms etc. They're mere examples, just in the form of HTML files. Am I missing something or should I better noindex these/such files? (And if so: is there a way to include a whole directory to noindex automatically, so I don't have to manually 'fix' all the HTML exports with a noindex tag in the future? I read that using disallow in robots.txt wouldn't work, as we will still link to these files as portfolio examples).
Intermediate & Advanced SEO | | BentoPres0 -
HTTPS & Redirects
Hi We're moving to https imminently & I wondered if anyone has advice on redirects. Obviously we'll be redirecting all http versions to https - but should I be checking how many redirects are in each chain and amending accordingly? If there's 4-5 in a chain, remove the middle unnecessary URLS ? Advice please 🙂
Intermediate & Advanced SEO | | BeckyKey0 -
My site has a loft of leftover content that's irrelevant to the main business -- what should I do with it?
Hi Moz! I'm working on a site that has thousands of pages of content that are not relevant to the business anymore since it took a different direction. Some of these pages still get a lot of traffic. What should I do with them? 404? Keep them? Redirect? Are these pages hurting rankings for the target terms? Thanks for reading!
Intermediate & Advanced SEO | | DA20130 -
Affiliate & canonicals
Hi, any help with this one would be great.... www.example.com sells widgets online. They are also promoted on a 3rd party website www.partner.com. Currently www.partner.com links to a page on www.example.com that is completely branded with the 'partners' design, style and unique copy (you would think you were still on 'partner' website). I saw this interesting article from 2011: http://www.seomoz.org/blog/getting-seo-value-from-your-affiliate-links (in particular idea 1) Do you think adding a rel=canonical on www.example.com's partner page is still safe? All the best & thank you, Richard
Intermediate & Advanced SEO | | Richard5550 -
Blog content - what to do, and what to avoid in terms of links, when you're paying for blog content
Hi, I've just been looking at a restaurant site which is paying food writers to put food news and blogs on their website. I checked the backlink profile of the site and the various bloggers in question usually link from their blogs / company websites to the said restaurant to help promote any new blogs that appear on the restaurant site. That got me wondering about whether this might cause problems with Google. I guess they've been putting about one blog live per month for 2 years, from 12/13 bloggers who have been linking to their website. What would you advise?
Intermediate & Advanced SEO | | McTaggart0 -
Which duplicate content should I remove?
I have duplicate content and am trying to figure out which URL to remove. What should I take into consideration? Authority? How close to the root the page is? How clear the path is? Would appreciate your help! Thanks!
Intermediate & Advanced SEO | | Ocularis0 -
301 a page and then remove the 301
I have a real estate website that has a city hub page. All the homes for sale within a city are linked to from this hub page. Certain small cities may have one home on the market for a month and then not have any homes on the market for months or years. I call them "Ghost Cities". This problem happens across many cities at any point in time. The resulting city hub pages are left with little to no content. We are throwing around the idea of 301 redirecting these "Ghost City" pages to a page higher up in the hierarchy (Think state or county) until we get new homes for sale in the city. At that point we would remove the 301. Any thoughts on this strategy? Is it bad to turn 301s on and off like that? Thanks!
Intermediate & Advanced SEO | | ChrisKolmar0