Can anyone explain why and how these odd URLs could be working?
-
In our GWT and Google Analytics traffic reports, I often see some very oddly formed URLs. Here's an example
http://www.ccisolutions.com/storefront/www.ccisolutions.com
and here's another
http://www.ccisolutions.com/StoreFront/category//www.ccisolutions.com/StoreFront/CEW.catWhat strikes me about this particular URL is two things:
- It renders this page http://www.ccisolutions.com/StoreFront/category/on-disc-printing, but not with that URL, the URL stays http://www.ccisolutions.com/StoreFront/category//www.ccisolutions.com/StoreFront/CEW.cat
- When I break this URL into pieces
http://www.ccisolutions.com/StoreFront/category/CEW.cat
and www.ccisolutions.com/StoreFront/CEW.cat,
both redirect to: http://www.ccisolutions.com/StoreFront/category/on-disc-printingThis makes me wonder, is there something (a rule?) in the
backend (maybe the .htaccess file?)that was set up that sayshttp://www.ccisolutions.com/StoreFront/category/CEW.cat
= www.ccisolutions.com/StoreFront/CEW.cat
(or maybe vice versa?), and as a result an odd URL for the page is being
written automatically?This scenario worked on every category page I checked. All had the same results. For example, I tried:
http://www.ccisolutions.com/StoreFront/category//www.ccisolutions.com/StoreFront/AAA.cat
and it rendered the Live Sound category page, but without redirecting to the
user friendly URL. This URL stayed unchanged in the address barWhen I broke it into pieces, like
http://www.ccisolutions.com/StoreFront/category/AAA.cat
and www.ccisolutions.com/StoreFront/AAA.cat, both of these redirected to http://www.ccisolutions.com/StoreFront/category/sound-video-lighting-equipment-expertsHave any of you ever encountered a problem like this? Any sugeestions as to what might be causing it and how to remedy the problem? It is definitely causing us a duplicate content headache. Thanks!
Dana
-
Thanks George! Fantastic detail and I think between your suggestions and Ben's too we are going to get further to solving this than we've ever gotten before. Perhaps we'll even solve this. That would be so great. As I mentioned, the company identified this problem 4 years before they hired me, and it's never been solved. I feel like part of why I am there as there SEO strategist is to pound away at these problems until they're fixed.
Thanks so much to you both. I can't wait to go in on Monday morning and use these suggestions to solve a five year old problem! Awesome.
I'll let you know what happens. If we fix it, I owe you and Ben dinner! (at the very least)
-
Thanks Ben. No apology necessary, it's all good. Your suggestion in combination with George's could lead us to an answer. This is definitely going to get us closer to finding the problem than we've ever gotten before. The company has been aware of this problem for almost 5 years but hasn't ever identified how to fix it. I've only been there a year now and I'm on the warpath to fix these technical issues. There are so many of them causing duplicate content problems that any SEO I do is undermined by problems like these.
I really really appreciate your reply and suggestions!
Dana
-
I'm not sure what CMS you are using, but I've seen this before with Joomla when setting the SEO Settings in the Global Configuration section of the Administration panel. Specifically, when working with the Apache mod_rewrite setting; which is related to the .htaccess question you had.
There are a number of things wrong with the way some CMS's have set up their redirects and how they present content. You may end up playing with each combination to fix your issue (depending on how you want to fix it).
If I were looking into this, I would do the following:
- I would determine if I was using Joomla. If so, check your configuration.php file and see if you have your domain name provided in the property for "live_site". If you do, try changing this from 'www.ccisolutions.com' (or whatever is there) to the empty string '' (aka just two single quotes).
- If you are not using Joomla, see if there is a configuration file for the CMS you are using and look for something similar to the above.
- If there is not a configuration setting that is providing for this "duplication" of domain name, look at the .htaccess file itself to see why it redirects when you break the URL up, but not when it has a second domain string in the URL (e.g. the second "www.ccisolutions.com").
- Then look at the code for the CMS and see how it interprets your URLs. To me it looks like you are using some sort of MVC framework which is taking each piece of the URL and translating it into variables to determine what content to show (REST-like). When it is parsing the URL, it seems to be looking for the end of the domain name and then taking anything off the end to translate into content.
However you figure out the issue, I suggest looking at how your CMS is actually producing the canonical tag. Right now this URL (http://www.ccisolutions.com/StoreFront/category/www.ccisolutions.com/StoreFront/CEW.cat) is using the following canonical:
rel="canonical" href="on-disc-printing"/>
I don't think that is what you are looking for in your canonical tags.
I hope this helps and answers your questions.
-
Hi Dana,
I wrote the following after assuming , for no reason at all, that you didn't know much about SEO. However having looked at your profile I realized that I was wrong and that my tone is probably a little patronizing. That being said it's 1am over here and I really don't want to rewrite it so please accept my apologies.
If I had to guess (and it is a guess as I'm not technical) I would say it was some badly formed links.
You know how some of your error pages have an Origin parameter (like this one) that say where the page was generated? Well these URLs follow the same format as the error pages that you're finding. It looks like rather than using an absolute link (like http://www.ccisolutions.com/page) the onclick action is actually generating a relative link (so just /page).
When you use a relative link your site adds the partial URL (/page) onto the end of your domain to give you a full URL (http://www.ccisolutions.com + /page = http://www.ccisolutions.com/page). It looks like you're using relative links as if they were static ones. Which is why you have "www.ccisolutions" in each URL twice.
If I had to blame anything it would be whatever is powering your IAFDispatcher however as I haven't been able to replicate your problem I couldn't be certain. If you can track how these URLs were generated by looking at the preceding pages that are sending traffic/bots to them then you should be able to narrow it down to which links are broken.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Rel Canonicals not working properly.
We recently implemented rel=canonicals on a few of our pages to prevent query parameters from showing up in the SERPs. The two pages we added the tags to are no longer ranking. The pages used to rank very well for branded terms such as "morningstar direct" and "morningstar sustainability", but now don't show up at all. When you search for the urls specifically, for example "products/direct morningstar" the query parameter is still indexing. Does anyone know why this might be or what we can do to fix this issue? The two pages are www.morningstar.com/products/direct and https://www.morningstar.com/company/sustainability
Technical SEO | | jmigdal0 -
URL Parameters to Ignore
Hi Mozers, **We have a glossary of terms made up of a main page that lists out ALL of the terms, and then individual pages per alphabet letter that limit the results to that specific alphabet letter. These pages look like this: ** https://www.XXXX.XXX/publications/dictionaries/XXX-terms?expand=A https://www.XXXX.XXX/publications/dictionaries/XXX-terms?expand=B https://www.XXXX.XXX/publications/dictionaries/XXX-terms?expand=C https://www.XXXX.XXX/publications/dictionaries/XXX-terms?expand=D etc. If I'd like Google to remove all of these "expand=" pages from the index, such that only the main page is indexed, what is the exact parameter that I should ask Google to ignore in Search Console? "expand=" ? Just want to make sure! Thanks for the help!!!
Technical SEO | | yaelslater1 -
URL Parameters for Product Variation
Hey there Mozzers, So I have a website that uses URL Parameters for product variations. So for example when I am in the product "test" that product has 5 different variations. So the url is as followed /product/test /products/test?variant=10271886529 /products/test?variant=10271886530 etc. Does Google understand that this is the same page ? Does it automatically exclude the variable in the url?
Technical SEO | | AngelosS0 -
Why can no tool crawl this site?
I am trying to perform a crawl analysis on a client's website at https://www.bravosolution.com I have tried to crawl it with IIS for SEO, Sreaming Frog and Xenu and not one of them makes it further than the home page of the site. There is nothing I can see in the robots.txt that is blocking these agents. As far as I can see, Google is able to crawl the site although they have noticed a significant drop in organic traffic. Any advise would be very welcome Regards Danny
Technical SEO | | richdan0 -
Changing URL of posts
HI, I need to change the urls and permalink structure of my blogposts. How I have to deal all this with google? Do I have to re-submit the pages to google with fetch as google? Will google display duplicate content of the same article ( having changed the url) or will it automatically replace the old url with the new ones? Tx for your support guys!
Technical SEO | | tourtravel0 -
Duplicate content with same URL?
SEOmoz is saying that I have duplicate content on: http://www.XXXX.com/content.asp?ID=ID http://www.XXXX.com/CONTENT.ASP?ID=ID The only difference I see in the URL is that the "content.asp" is capitalized in the second URL. Should I be worried about this or is this an issue with the SEOmoz crawl? Thanks for any help. Mike
Technical SEO | | Mike.Goracke0 -
Has anyone used Micrositez? Or can recommend them?
Hi all, After using SEOmoz for a while (and I have learnt alot - it's a fantastic resource) I think it's about time our website is given the nudge by a professional, who unlike me isn't learning the ropes and trying to compete with the big boys! So easy question, has anyone used/can vouch for http://www.micrositezdigital.co.uk/ I've been in contact with them alot recently and spent hours on the phone. 1 side of me is impressed, the other is scared to death of the monthly price. Any help would be appreciated with this, It would nice to know how good they are especially in regards to their bespoke campaigns. Thanks, Allan
Technical SEO | | allan-chris0 -
Go Daddy can anyone explain the point of their auction site
Hi i am very confused. I found out that go daddy was selling in2town.com and they are the owners of the domain name so i went ahead and bid for the domain name. i then won it and was told i would have to wait six days to use it and now i have received this email from them. This message is to notify you that the current registrant of in2town.com has renewed their domain name and we have ended the auction per the terms of the legal agreement (Universal Terms of Service) that you acknowledged at the time of bidding. If you have already paid for this domain name, you will automatically receive a refund within the next 6 days. We're sorry you did not win this domain. However, we have thousands of other great names available at Go Daddy Auctions. Visit our Web site to search for your next domain. If you have any questions, Customer Support is available 24 hours a day, 7 days a week: – Email: auctions@godaddy.com
Technical SEO | | ClaireH-184886
– Phone: (480) 505-8892
– Online Support Sincerely,
Go Daddy Auctions Team0