GSC is reporting a lot of chopped URLs
-
Recently, in the last two weeks, I started seeing a lot of odd 404 errors in GSC for my site. Upon investigation, the URLs are for fairly new articles, and the URLs are chopped in various places. From missing a character at the end to missing about 10 characters at the end of the URL. (an old similar issue is that GSC reports duplicate contents on weird subdomains that we've never used like 'smtp' 'ww1' or even random ones like 'bobo'.)
GSC doesn't report any 'linked from' for those odd URLs and I know for sure these links aren't on the site itself. They're definitely not errors in the CMS.
The site is a long established site (started 1997-1998) and we've been subject to a lot of negative SEO. I recently had to disavow about 1000 .ru domain linking to us, with some domains containing over a million link each.
Could these chopped links be a new tactic of negative SEO? How do I find these seemingly intentionally broken links to us?
-
Thanks for the question. It isn't uncommon for there to be strange 404 errors in Search Console with little information/bad information. They are working hard to improve this, but I wouldn't take everything you see there as set-in-stone.
This doesn't sound like a negative SEO tactic. I would just mark them all as fixed, and see if they appear again in about a week. If they do, I'd make sure they are actually served as 4xx status and not worry too much about it. If you want to do more digging...
Some ideas of where you could look further
- Logs logs logs. This will be the ultimate truth - you will be able to see whether or not GoogleBot is actually hitting those URLs.
- It could be something weird happening with a plugin of yours that generates those URLs (particularly on Wordpress).
- Perhaps you have a filtering system setup that generates these URLs?
- If you have a search function on the site, sometimes weird URLs can be generated through that.
- Do the URLs come-up when you crawl the site at all?
Just a few ideas!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
URL Too Long vs. 301 Redirect
We have a small number of content pages where the urls paths were setup before we started looking really hard at SEO. The paths are longer than recommended (but not super crazy IMHO) and some of the pages get a decent amount of traffic. Moz suggests updating the URLs to make them shorter but I wonder if anyone has experience with the tradeoffs here. Is it better to mark those issues to be ignored and just use good URLs going forward or would you suggest updating the URLs to something shorter and implementing a 301 redirect?
Intermediate & Advanced SEO | | russell_ms0 -
Sanity Check: NoIndexing a Boatload of URLs
Hi, I'm working with a Shopify site that has about 10x more URLs in Google's index than it really ought to. This equals thousands of urls bloating the index. Shopify makes it super easy to make endless new collections of products, where none of the new collections has any new content... just a new mix of products. Over time, this makes for a ton of duplicate content. My response, aside from making other new/unique content, is to select some choice collections with KW/topic opportunities in organic and add unique content to those pages. At the same time, noindexing the other 90% of excess collections pages. The thing is there's evidently no method that I could find of just uploading a list of urls to Shopify to tag noindex. And, it's too time consuming to do this one url at a time, so I wrote a little script to add a noindex tag (not nofollow) to pages that share various identical title tags, since many of them do. This saves some time, but I have to be careful to not inadvertently noindex a page I want to keep. Here are my questions: Is this what you would do? To me it seems a little crazy that I have to do this by title tag, although faster than one at a time. Would you follow it up with a deindex request (one url at a time) with Google or just let Google figure it out over time? Are there any potential negative side effects from noindexing 90% of what Google is already aware of? Any additional ideas? Thanks! Best... Mike
Intermediate & Advanced SEO | | 945010 -
My client wants to rebrand their company including URL...
Hi all! An easy one for probably most of you. I have a client who wants to re-brand their business name and also match their URL to the new name. Their current domain name URL has been out there for over 5 years, and the site is performing quite well in search. Switching to a new URL will obviously be a very bad thing, but what are the options? newname.com redirect to the aged oldname.com, but when they are on the site, or when they find them in search, the oldname.com has nothing to do with the new brand. Or should we 301 every page of the oldname.com site to the newname.com be good enough? What is recommended? Thanks!
Intermediate & Advanced SEO | | BBuck0 -
WIX? How to change URLS and is it any good for SEO
I have Wix website and want to change its url structure, but not able to do so, any one know how to do so?. Does below URLs work for SEO? And Is Wix is good for SEO or not? abc.com/#!how-it-works/c46c abc.com/#!party-event/c1lzb
Intermediate & Advanced SEO | | Dan_Brown10 -
HTML for URL markup
Hi, We are changing our URLs to be more SEO friendly. Is there any negative impact or pitfall of using <base> HTML-tag? Our developers are considering it as a possible solution for relative URLs inside HTML-markup in the Friendly URL context.
Intermediate & Advanced SEO | | theLotter0 -
Spaces in URL line
Hi Gurus, I recently made the mistake of putting a space into a URL line between two words that make up my primary key word. Think www.example.com/Jelly Donuts/mmmNice.php instead of www.example.com/JellyDonuts/mmmNice.php This mistake now needed fixing to www.example.com/Jelly Donuts/mmmNice.php to pass W3, but has been in place for a while but most articles/documents under 'Jelly Donuts' are not ranking well (which is probably the obvious outcome of the mistake). I am wondering whether the best solution from an SEO ranking viewpoint is to: 1. Change the article directory immediately to www.example.com/JellyDonuts/mmmNice.php and rel=canonical each article to the new correct URL. Take out the 'trash' using robots.txt or to 301 www.example.com/Jelly Donut to the www.example.com/JellyDonut directory? or perhaps something else? Thanks in advance for your help with this sticky (but tasty) conundrum, Brad
Intermediate & Advanced SEO | | BM70 -
How important is it to clarify URL parameters?
We have a long list of URL parameters in our Google Webmasters account. Currently, the majority are set to 'let googlebot decide.' How important is it to specify exactly what googlebot should do? Would you leave these to 'let googlebot decide' or would you specify how googlebot should treat each parameter?
Intermediate & Advanced SEO | | nicole.healthline0 -
Impact of Non-English target keywords in URL
Hi all, our site language is Farsi (Persian) so at first we tried to create URLs that contain our target keywords in Farsi too. The problem with this approach is that our URLs are not shown in a friendly style anymore: a bunch of unicode numeric codes instead of Farsi characters. Do you know which is the best approach? 1. Creating ugly looking URLs containing Farsi keywords 2. Forget about putting our keywords in URLs and have nice English URLs Thanks in advance for your time and help 🙂
Intermediate & Advanced SEO | | diki0