100K Webmaster Central Not Found Links?
-
http://screencast.com/t/KLPVGTzM I just logged into our Webmaster Central account to find that it shows 100k links that are not found? After searching through all of them they all appear to be from our search bar, with no results? Are we doing something wrong here?
-
Ya, I read through that article yesterday & see that they recommend the same setting as the Yoast plugin should be doing? Although I didn't ever get a response from me to see if there is something missing?
For now, I plan on adding this to the robots.txt file & see what results I get?
Do you know the time frame that it takes to get the updates in GWT? Will this update within a few weeks or would it take longer than that?
Thanks for all the help!
BJ
-
Hello BJ.
The robots.txt file must be on your server, in the document root.
Here is information about how to configure robots.txt
Note that is does have a warning at the end, about how you could possibly lose some link juice, but that is probably a much smaller problem than the problem you are trying to fix.
Nothing is perfect, and with the rate that google changes its mind, who knows what is the right thing to do this month.
Once you have edited robots.txt, you don't need to do anything.
- except I just had a thought - how to get google to remove those items from your webmaster tools. I think you should be able to tell them to purge those entries from GWT. Set it so you can see 500 to a page and then just cycle through and mark them fixed.
-
Sorry to open this back up after a month, in adding this to the robot.txt file is there something that needs to be done within the code of the site? Or can I simply update the robots.txt file within Google Webmaster Tools?
I was hoping to get a response from Yoast on his blog post, it seems there were a number of questions similar to mine, but he didn't ever address them.
Thanks,
BJ
-
We all know nothing lasts forever.
A code change can do all kinds of things.
Things that were important are sometimes less important, or not important at all.
Sometimes yesterdays advice no longer is true.
If you make a change, or even if you make no change, but the crawler or the indexer changes, then we can be surprised at the results.
While working on this other thread:
http://www.seomoz.org/q/is-no-follow-ing-a-folder-influences-also-its-subfolders#post-74287
I did a test and checked my logs. A nofollow meta tag and a nofollow link do not stop the crawlers from following. What it does (we think) is to not pass pagerank. That is all it does.
That is why the robots.txt file is the only way to tell the crawlers to stop following down a tree. (until there is another way)
-
Ok, I've posted a question on Yoast.com blog to see what other options we might have? Thanks for the help!
-
It is because Roger ignores those META tags.
Also, google often ignores them too.
The robots.txt file is a much better option for those crawlers.
There are some crawlers that ignore the robots file too, but you have no control over them unless you can put their IPs in the firewall or add code to ignore all of their requests.
-
Ok, I just did a little more research into this, to see how Yoast was handling this within the plugin & came across this article: http://yoast.com/example-robots-txt-wordpress/
In the article he stats that this is already included within the plugin on search pages:
I just confirmed this, by doing this search on my site & looking at the code: http://www.discountqueens.com/?s=candy
So this has always been in place. Why would I still have the 100K not found links still showing up?
-
We didn't have these errors showing up previously, so that's why I was really suspicious? Also we have Joost De Valk's SEO plugin installed on our site & I thought there was an option to turn off the searches from being indexed?
-
Just to support Alan Gray's response, I'll say it's very important to block crawlers from your site search, because it not only throws errors (bots try to guess what to put in a search box), but also because any search results that get into the index will cause content conflicts, dilute ranking values, and worst case scenario, potentially create the false impression that you have a lot of very thin content / near duplicate content pages.
-
the search bar results are good for searchers but not for search engines. You can stop all search engines and Roger (the seomoz crawler) from going into those pages by adding an entry to your robots.txt file. Roger only responds to his own section of the robots file, so anything you make global will not work for him.
User-agent: rogerbot Disallow: /search/*
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Identify Page Not Found Visits
Hello everyone! I have always known enough about Google Analytics and SEO to be dangerous, but was not a focus for me. I am working on a project were I am looking at stuff where my knowledge is limited. The scenario is that the domain I am looking at will serve a 404 error, but keeps the url, I guess for tracking purposes. At the same time, there is a page "Page_Not_Found" that has elevated visits. I am not sure how to tell where the visits are coming from to the PNF since the Previous Page is mostly identified as "(entrance)" Is the PNF correlated to the process of serving an error page but not changing the URL? Ideally, I am looking to identify and improve the 404 visits. I hope that I provided clear enough information. Happy to provide more as needed.
Intermediate & Advanced SEO | | HankHoffmeier0 -
Can I rank without links
Let's say I have great content. I have a great website design (easy to navigate for user) that answers their questions but I have no links. Can I still rank on on a keyword that has a difficulty score of 24. I imagine that I can that google can't penalise me for not having links. Does it mean that without links it will take longer to rank than with links but that google with rank me at some point ? Thank you,
Intermediate & Advanced SEO | | seoanalytics0 -
Swiss based, USA links only
Hello, My company is based is Switzerland with a Swiss address and US number but my client are only in the USA. I only have links from US websites and no Swiss website. Can I be penalised by google for that ? Thank you,
Intermediate & Advanced SEO | | seoanalytics0 -
Pop up question and link flow?
Does a pop up like the one on this site www stressfreeprint co uk (top left corner about us, who we are) count as an external link or would link juice not flow to it. I like to have a few pages that i don't want to waste link juice on but would still like to have them and hope this is the answer.
Intermediate & Advanced SEO | | BobAnderson0 -
Spammy Inbound Links
Hello, We have been using Zendesk to manage our customer support tickets for approx 2 years. We recently noticed that the attached forum had lot's of spam comments attached to it. Promoting Viagra and the like. The system was installed as a subdomain of my site support.mysite.com We have since deleted our account with Zendesk but Moz and Google are reporting loads of inbound links to that subdomain that are all total spam with Viagra in the anchor text etc. The subdomain no longer exists and now throws a 404. Can these links still hurt me? Is there other steps I need to take? I have disavowed all the links.
Intermediate & Advanced SEO | | niallfred0 -
Webmaster tools 404
Hey, I'm getting a soft 404 error on a webpage that has content and is deferentially not a 404. We've redirect a load of urls to the web page. The url has parameters which was used before the redirect but are no longer used on by the new url, these parameters have been carried over in the redirect. Is this whats causing the soft 404 error or is there another problem that may need addressing? Also a canonical has been set on the webpage. Thanks, Luke.
Intermediate & Advanced SEO | | NoisyLittleMonkey1 -
Linking Back
Hello, I have a blog www.digitaldiscovery.eu and I have been working the link building. Now I have a few links pointing into my blog and in Google Webmaster and in Open Site Explorer I can see the URL of those websites. In scale from 1 to 10 how usefull is to have a blogroll in my blog pointing back to those high PR links? How usefull is this in link-building strategy? Tks in advance! PP
Intermediate & Advanced SEO | | PedroM0 -
Is this splitting my authority or link juice?
Hi Using seomoz i am getting told that a 302 temporary redirect is occurring on some of my pages for instance. http://www.eco-environments.co.uk/solar-power/ Then redirects here http://www.eco-environments.co.uk/solar-power/default.phuse is this splitting my page authority because of the temporary redirect? I just want to make sure i have fully understood what's happening before i go to the company who designed and developed our site as i am convinced this is hurting my rankings. Thanks
Intermediate & Advanced SEO | | Nickhoyle10