Long urls created by filters (not with query parameters)
-
A website adds subfolders to a category URL for each filter that's selected. In a crawl of the website some of these URLs reach over 400 characters. For example, if I select shoe size 5, 5.5 and 6, white and blue colour, price $70-$100, heel and platform styles, the URL will be as follows:
There is a canonical that points to www.example.com/shoes/womens/ so it isn't a duplicate content issue.
But these URLs still get crawled. How would you handle this? It's not a great system so I'm tempted to tell them to start over with best practice recommendations, but maybe I should just tell them to block the "/filters/" folder from crawlers? For some products however, filtered content would be worth having in search indexes (e.g. colour).
-
I certainly know the feeling.
-
Completely bad Google day. Hacked universities set me off.
-
I would simply block the /filters/ folder for this client since those URLs aren't indexed (due to the canonical tag) and probably have zero links to contribute to the pagerank of the canonical page. All they're doing from a search engine's perspective is eating up crawl budget.
I understand about the color filter, however. There are several options:
- Don't worry about it. Right now it's not helping anyway since the color filter URLs rel canonical to the main category URL. If you are seeing traffic from search engines going directly into a color filter URL as a landing page from the SERPs then the canonical tag probably isn't working. If you're not seeing them as organic search landing pages, then what difference does it make, traffic-wise, if you block them?
- Create sub-categories for color if the pages are that important.
- Force the color filter to show up first in the URL and exempt it from the robots.txt block...
allow: ///filters/color/
disallow: ///filters/I'm not sure about what Travis is trying to say. Sounds like he's having a bad Google day.
-
Apparently you don't have to worry about any of that if you just hack a .edu site. XD (see screen shot - no worries, did the right thing and emailed their technical contact) Seriously though, that isn't recommended.
Mother of pearl, some of the keywords the FDU site ranks for are low competition gold. Run it through SEM Rush and have fun. I can't exactly make a strong case for any super technical theory with crap like that ranking.
The Googles.... makes my head hurt... I'm going to go cry now.
Something something... save crawl budget doing it other ways...
Something something... look at Zappos and Nike's left sidebar menus.
I quit.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Can I remove certain parameters from the canonical URL?
For example, https://www.jamestowndistributors.com/product/epoxy-and-adhesives?page=2&resultsPerPage=16 is the paginated URL of the category https://www.jamestowndistributors.com/product/epoxy-and-adhesives/. Can I remove the &resultsPerPage= variation from the canonical without it causing an issue? Even though the actual page URL has that parameter? I was thinking of using this: instead of: What is the best practice?
Intermediate & Advanced SEO | | laurengdicenso0 -
Long tail there are no long tail keywords....
Hi I am struggling trying to optimise product pages for a product area which doesn't have a lot of specific longtail product related searches. It's 'Lockers' I have more specific sub-category pages which drill down such as - Wire Mesh Lockers Charging Lockers Laptop Lockers Just to name a few, but to drill down more to product names doesn't offer much. Or, in some cases the products are so similar they focus on similar keywords, for example '2 tier metal lockers' applies to loads of different products. Do I do the best I can with product titles, then focus on sub-categories? Love to hear thoughts 🙂
Intermediate & Advanced SEO | | BeckyKey0 -
URL Parameters, Forms & SEO
Hi I have some pages on the site which have a quote form, in my site crawl I see these showing as duplicate content - my webmaster says this isn't the case, but I'm not sure. Landing page - https://www.key.co.uk/en/key/high-esd-chairs Page with form - https://www.key.co.uk/en/key/high-esd-chairs?quote-form - this also somehow has a canonical on it pointing to https://www.key.co.uk/en/key/high-esd-chairs?quote-form Which neither of us have added. I'm thinking we need to get the canonical needs to be updated to https://www.key.co.uk/en/key/high-esd-chairs Is it worth doing this for all these pages or am I worrying about nothing? Becky
Intermediate & Advanced SEO | | BeckyKey0 -
Product search URLs with parameters and pagination issues - how should I deal with them?
Hello Mozzers - I am looking at a site that deals with URLs that generate parameters (sadly unavoidable in the case of this website, with the resource they have available - none for redevelopment) - they deal with the URLs that include parameters with *robots.txt - e.g. Disallow: /red-wines/? ** Beyond that, they userel=canonical on every PAGINATED parameter page[such as https://wine****.com/red-wines/?region=rhone&minprice=10&pIndex=2] in search results.** I have never used this method on paginated "product results" pages - Surely this is the incorrect use of canonical because these parameter pages are not simply duplicates of the main /red-wines/ page? - perhaps they are using it in case the robots.txt directive isn't followed, as sometimes it isn't - to guard against the indexing of some of the parameter pages??? I note that Rand Fishkin has commented: "“a rel=canonical directive on paginated results pointing back to the top page in an attempt to flow link juice to that URL, because “you'll either misdirect the engines into thinking you have only a single page of results or convince them that your directives aren't worth following (as they find clearly unique content on those pages).” **- yet I see this time again on ecommerce sites, on paginated result - any idea why? ** Now the way I'd deal with this is: Meta robots tags on the parameter pages I don't want indexing (nofollow, noindex - this is not duplicate content so I would nofollow but perhaps I should follow?)
Intermediate & Advanced SEO | | McTaggart
Use rel="next" and rel="prev" links on paginated pages - that should be enough. Look forward to feedback and thanks in advance, Luke0 -
Duplicate URLs on eCommerce site caused by parameters
Hi there, We have a client with a large eCommerce site with about 1500 duplicate URLs caused by the parameters in the URLs (such as the sort parameter where the list of products are then sorted by price, age etc.) Example: www.example.com/cars/toyota First duplicate URL: www.example.com/cars/toyota?sort=price-ascending Second duplicate URL: www.example.com/cars/toyota?sort=price-descending Third duplicate URL: www.example.com/cars/toyota?sort=age-descending Originally we had advised to add a robots.txt file to block search engines from crawling the URLs with parameters but this hasn't been done. My question: If we add the robots.txt now and exclude all URLs with filters - how long will it take for Google to disregard the duplicate URLs? We could ask the developers to add canonical tags to all the duplicates but these are about 1500... Thanks in advance for any advice!
Intermediate & Advanced SEO | | Gabriele_Layoutweb0 -
404 broken URLs coming up in Google
When we do a search for our brand, we are get the following results in google.com.au (see image attachment). As outlined in red, there are listings in Google that result in 404 Page Not Found URLs. What can we do to enable google to do a recrawl or to ensure that these broken URLs are no longer listed in Google? Thanks for your help here! sBqpvtj
Intermediate & Advanced SEO | | Gavo0 -
URL tracking on offline material
Hi there, Hope someone can give some advice. We are doing some magazine advertising, the main purpose of the advert is to promote one of our new products, however the URL goes something like this: http://www.domain.com/products/new-product-libra-furniture/ which is just too long for anyone to remember, I think it should be simply domain.com/libra which redirects to the product page, however how can I track this in Google Analytics? if using a 301 that's impossible? Any advice would be grateful.
Intermediate & Advanced SEO | | Paul780 -
How long for new pages to rank
Hi Guys, Our website has some really good serps for our established keyword phrases some of which are quite competitive. We recently acquired and have begun selling some new brands through our online shop and launched new pages for these brands around 2 months ago. They are quite competitive ("merrell shoes" and "timberland boots" for example in google.co.uk) terms. Do you think we should get some keyword rich links built into these new pages from external sites such as blogs - or is there chances of ranking well driven more off our overall site authority/link profile? In other peoples experience, what is a typical realistic timeframe to start getting meaningful serps on new pages/keyword phrases (I know that is hard to answer - but ball parks figures appreciated). Thank you everyone in advance. Kind Regards (and happy thanksgiving to our US friends)
Intermediate & Advanced SEO | | ConradC
Conrad Cranfield0