Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Why are Seemingly Randomly Generated URLs Appearing as Errors in Google Webmaster Tools?
-
I've been confused by some URLs that are showing up as errors in our GWT account. They seem to just be randomly generated alphanumeric strings that Google is reporting as 404 errors. The pages do 404 because nothing ever existed there or was linked to. Here are some examples that are just off of our root domain:
/JEzjLs2wBR0D6wILPy0RCkM/WFRnUK9JrDyRoVCnR8=
/MevaBpcKoXnbHJpoTI5P42QPmQpjEPBlYffwY8Mc5I=
/YAKM15iU846X/ymikGEPsdq 26PUoIYSwfb8 FBh34=
I haven't been able to track down these character strings in any internet index or anywhere in our source code so I have no idea why Google is reporting them.
We've been pretty vigilant lately about duplicate content and thin content issues and my concern is that there are an unspecified number of urls like this that Google thinks exist but don't really.
Has anyone else seen GWT reporting errors like this for their site? Does anyone have any clue why Google would report them as errors?
-
Glad to hear you located the root cause of the issue.
-
Update: I think I found the source of the errors. We have a form submission box on every page of our site and there are random alphanumeric strings that are unique to every page that look like these urls that Google is reporting as errors. Maybe Google is trying to use our form submission tool.
It's still really odd that Google would think they are urls and report them as errors since they appear in an tag not an <a> tag</a>
<a>```
<input id="authenticity_token" name="authenticity_token" type="hidden" value="ElCkBCszxMddaqVyVIx+flkzRlM9xb7mHb3cFISlJvY=" /> -
The next step would be to check Google WMT > Diagnostics > Crawl errors > Not Found tab
Click the "Linked from Detected" text. That will show the source of the link.
-
That's not the case. We have an in house content system and no shopping cart. These urls return a 404 error and always have because they've never existed.
No crawlers have returned urls that look like this either. I can't get these urls to show up in Google's index either.
-
Hi Jimmy,
My bet is you are using a CMS or Shopping Cart on your site. The software is generating these seemingly random URLs, and the Google crawler is seeing them.
Crawlers don't move through your site the way people do. If you take your home page and right-click, then choose View Page Source, you will find many URLs. The crawler will then follow each and every URL throughout your site. The URLs you are seeing could be generated on any given page.
To track the issue, use the SEOmoz crawler on your site, then analyze the results. When you find these types of URLs, take note of the page they are located on.
Alternatively, you can contact your CMS or shopping cart's software developer and inquire about the URLs. If these URLs show a folder path before the random characters, that information can help track down which pages or software is generating these pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Strange landing page in Google Analytics
Hello MOZ Community, The website in question is https://x-y.com/ When i looked at the landing pages report in GA , x-y.com is appended at the end of every URL like this. https://x-y.com/x-y.com When i open the above URL in GA interface, it shows page not found. This is obvious as there is no such URL.
Reporting & Analytics | | Johnroger
The metrics like sessions, Users, Bounce rate all look good. In the property settings, The default URL is written like this http:// cell-gate.com (Please note that s is missing in property settings). But how is traffic tracked correctly How do i solve this problem. What settings should we change to make the landing pages report look ok Thanks0 -
Why is Indeed.com traffic appearing as organic in Google Analytics?
A large number of sessions in my client's Google Analytics account appear to come from medium: organic and source:Indeed. Since I'm focused on SEO for this project, I'd prefer that Indeed be treated as referral traffic. Any ideas for fixing this issue? Also, and I'm sure the answer is no, is there a way to fix the past data in Google Analytics that has already reported Indeed as an organic medium?
Reporting & Analytics | | Kevin_P0 -
Google Analytics - Average Position
Hi Just trying to get some clarity on Google Analytics Average Positions in "Aquistions/Search Engine Optimisation". For a very competitive keyword Google Analytics is saying i am on average position of 6. Is this Page 6? I am assuming position six would be 1.6?
Reporting & Analytics | | Cocoonfxmedia0 -
How to get multiple pages to appear under main url in search - photo attached
How do you get a site to have an organized site map under the main url when it is searched as in the example photo? SIte-map.png
Reporting & Analytics | | marketingmediamanagement0 -
Parenthesis in URL?
For some reason, we have some URLs on our site with parentheses in them such as somesite.com/used-this(that)What will parenthesis do to the ranking of those pages?
Reporting & Analytics | | CFSSEO0 -
Why google stubbornly keeps indexing my http urls instead of the https ones?
I moved everything to https in November, but there are plenty of pages which are still indexed by google as http instead of https, and I am wondering why. Example: http://www.gomme-auto.it/pneumatici/barum correctly redirect permanently to https://www.gomme-auto.it/pneumatici/barum Nevertheless if you search for pneumatici barum: https://www.google.it/search?q=pneumatici+barum&oq=pneumatici+barum The third organic result listed is still http. Since we moved to https google crawler visited that page tens of time, last one two days ago. But doesn't seems to care to update the protocol in google index. Anyone knows why? My concern is when I use API like semrush and ahrefs I have to do it twice to try both http and https, for a total of around 65k urls I waste a lot of my quota.
Reporting & Analytics | | max.favilli0 -
How to remove unwanted dynamic parameters from a URL in Google Analytics
Hi, Would really appreciate some help with this. I have been experimenting with RegEx to achieve this but as I’ve never used it before am currently failing miserably. We have conversion pages i need to set goals for that are formatted as below: https://www.domain.co.uk//Application_Form/(S(ewhbqp5cki0mppuzukunkqno))/enterCardDetails.aspx I need to remove the (s(xxx)) section from the URL as rather than one pages i currently have thousands of unique URL's. What’s catching me out is that as it’s not a URL parameter I can’t discount and as half way through can’t just do head matches etc to /entercarddetails Help would be much appreciated. Thanks.
Reporting & Analytics | | Sarbs0