Best-practice URL structures with multiple filter combinations
-
Hello,
We're putting together a large piece of content that will have some interactive filtering elements. There are two types of filters, topics and object types.
The architecture under the hood constrains us so that everything needs to be in URL parameters. If someone selects a single filter, this can look pretty clean:
www.domain.com/project?topic=firstTopic
or
www.domain.com/project?object=typeOneThe problems arise when people select multiple topics, potentially across two different filter types:
www.domain.com/project?topic=firstTopic-secondTopic-thirdTopic&object=typeOne-typeTwo
I've raised concerns around the structure in general, but it seems to be too late at this point so now I'm scratching my head thinking of how best to get these indexed. I have two main concerns:
- A ton of near-duplicate content and hundreds of URLs being created and indexed with various filter combinations added
- Over-reacting to the first point above and over-canonicalizing/no-indexing combination pages to the detriment of the content as a whole
Would the best approach be to index each single topic filter individually, and canonicalize any combinations to the 'view all' page? I don't have much experience with e-commerce SEO (which this problem seems to have the most in common with) so any advice is greatly appreciated.
Thanks!
-
Thanks for the detailed answer Jonathan. What you suggested was definitely in line with my thinking - indexing just the single topics at most and trying to either noindex or canonicalize all the thousands of possible variations. I definitely agree that all those random combinations of topics/objects hold no real value and at best will eat up crawl budget unnecessarily.
I can make sure Google treats these parameters as URLs via Search Console, they're unique to this piece of content; and I think I can noindex all the random combinations of filters (hopefully).
I'm still waiting to hear more from the dev team but I have a feeling that I won't be able to change the format to subdirectories instead of differentiating everything with query parameters - not the ideal situation but I'll have to make do.
Anyways, thanks again for your thoughtful reply!
Josh
-
Google is supposed to disregard everything after the ? in the query string when indexing. However, I know at times query strings will get indexed if the content on the query stringed URL appears different enough to Google. So I would agree with your motive to try to get these dynamic URLs simplified.
From what i have read on similar scenarios, and my first thought is, do these filtered view pages benefit searchers? Typically it benefits searchers to index maybe the category level of pages. In your instance, this may be the first topic. But once URLs start referencing very specific content that one user was filtering for, I would probably suggest a noIndex meta tag. There should be a scalable solution to this so you don't have to individual go into every filtered page possibility and add noIndex to the head.
If there is a specific filtered view you believe may benefit searches, or you have already seen a demand for, I would suggest making this a page using subfolders
www.domain.com/project/firstTopic/typeOne
and noIndexing all the crazy dynamically generated query string URLs. This should allow you to seize opportunities where you see search demand and alleviate any duplicate content risks.
If you don't want to noIndex, I would gauge the quality of these nitty gritty filtered pages, and if you see value in them, I would agree canonicalizing to the preceding category page sounds like a good solution.
I think this article does a good job explaining this. It suggests that if your filters are just narrowing content on the page rather than changing it, to noIndex or canonicalize (Although, the author does remind you that canonicalization is only a suggestion to Google and is not followed 100% of time for all scenarios).
I hope this helps, and if you don't see how these solutions would be implemented on your site, this issue may require some dev help.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Merger & Acquisition Best Practices
Our company (DA 40) recently acquired another company (DA 20). The domain for the acquired company is up, and I am being asked if we should keep paying for the domain. I have a couple of questions regarding this. Is it best practice to continue paying for the older domain and do 301 redirects to our website? If yes, do I have to add 301 redirects for individual matching pages? For example, if the old site has the topic XYZ, should a 301 go to our site if we have a similar page topic XYZ? What about contact pages, about us, etc? Do we redirect all non-matching topics to the new home page? Or in the case of their blogs, do we redirect their blogs to the new blog home page even though we are not keeping their old blogs? What if our company keeps acquiring other companies, are we to assume we have to keep paying for the domains of the acquired companies? Is there anyone out there that would say stop paying for the old domain and just review the company's inbound links, reach out to those sites, and make sure they link to the new site instead? Thanks for the help in advance.
Intermediate & Advanced SEO | | CharityHBS0 -
My url disappeared from Google but Search Console shows indexed. This url has been indexed for more than a year. Please help!
Super weird problem that I can't solve for last 5 hours. One of my urls: https://www.dcacar.com/lax-car-service.html Has been indexed for more than a year and also has an AMP version, few hours ago I realized that it had disappeared from serps. We were ranking on page 1 for several key terms. When I perform a search "site:dcacar.com " the url is no where to be found on all 5 pages. But when I check my Google Console it shows as indexed I requested to index again but nothing changed. All other 50 or so urls are not effected at all, this is the only url that has gone missing can someone solve this mystery for me please. Thanks a lot in advance.
Intermediate & Advanced SEO | | Davit19850 -
Process for moving existing articles to new structure (URLs, titles, etc)
I am in the midst of a major redesign of my site, including revamping existing articles . I have a couple of hundred articles and I am reviewing all aspects of these articles, including titles, URLs, content, etc. I am putting together a process as I move each article across to the new site and have SEO very much in mind. I'd appreciate any feedback on this. First off, let me be clear that I consider the quality of the content paramount. Anything suggested below is considered "supporting" (that content) from an SEO perspective. But, since I am moving this content across, I may as well take the opportunity to clean things up. The existing articles don't have particularly good SEO-related attributes, in terms of their titles, URLs, use of keywords and so on. So, I plan to do the following for each article. For illustrative purposes (our site serves the wedding industry), I will use an article about how to involve children at a wedding. Questionsunder each bullet. Use the "Keyword Difficulty" feature on Moz Pro to research a specific keyword for each article. In the example case I used "involving children in our wedding". Honestly, I am not really sure what to do with this feature 🙂 I've read everything from "focus on the long tail" to "don't fear highly competitive keywords". So, my current thinking is merely to use it as interesting information for they keyword I choose but not actually make any specific decisions from that ie. make sure the keyword is relevant to the article as the first priority and use the tool to check out search volume. Not sure what I should read into a zero for recent Bing searches. Is that really an important factor? I'm assuming the Google information is not available from Google (it would be displayed here otherwise, I'm guessing) Use a title that uses these keywords. In this case, I simply went with "Involving children in our wedding". Same for URL - /wedding-guests/involving-children-in-our-wedding If I have a reasonable, short and human-friendly term like this (I can do this with virtually every article quite easily), is there any reason why the URL and the title should not be the same? In short, the title and URL are both a relatively concise "mini-sentence" Make sure the meta description of the article is easy-to-read (for humans) and uses the keyword (sentence) Make sure that the theme (we are moving to WordPress) uses H1 for the page header/title and H2 for sections within the document Implement 301 redirects from the old URL (old site) to the new URL This seems like a pretty obvious approach for articles where the URL has changed (which will be most of them). But what do I do with articles that I am going to remove. Should I redirect (301) to a related article (so at least the visitor ends up on a page that is generally relevant) or just let this "fall through" as a non-existent page (401)? As I say, I have 200+ articles to go through I want to make sure I am taking this advantage to clean things up. Anything leaping out as missing/problematic? Thanks in advance Mark
Intermediate & Advanced SEO | | MarkWill0 -
Attack of the dummy urls -- what to do?
It occurs to me that a malicious program could set up thousands of links to dummy pages on a website: www.mysite.com/dynamicpage/dummy123 www.mysite.com/dynamicpage/dummy456 etc.. How is this normally handled? Does a developer have to look at all the parameters to see if they are valid and if not, automatically create a 301 redirect or 404 not found? This requires a table lookup of acceptable url parameters for all new visitors. I was thinking that bad url names would be rare so it would be ok to just stop the program with a message, until I realized someone could intentionally set up links to non existent pages on a site.
Intermediate & Advanced SEO | | friendoffood1 -
Best way to implement canonical tags on an ecommerce site with many filter options?
What would be the best way to add canonical tags to an ecommerce site with many filter options, for example, http://teacherexpress.scholastic.com? Should I include a canonical tag for all filter options under a category even though the pages don't have the same content? Thanks for reading!
Intermediate & Advanced SEO | | DA20130 -
Question about best approach to site structure
I am curious if anyone can share some advice. I am working on planning architecture for a tour company. The key piece of the content strategy will be providing details on each of the tour destinations, with associated profiles for each city within those destinations. Lots of content, which should be great for the SEO strategy. With regards to the architecture, I have a ‘destinations’ section on the Website where users can access each of the key destinations served by the tour company. My question is – from a planning perspective I can organize my folder structure in a few different ways. http://www.companyurl.com/destinations/touring-regions/cities/ or http://www.companyurl.com/destinations/ http://www.companyurl.com/touring-regionA/ http://www.companyurl.com/touring-regionB/cities-profile/ I am curious if anyone has an opinion on what might perform best in terms of the site structure from an SEO perspective. My fear is taking all of this rich content and placing it so many tiers down in the architecture of the site. Any advice that could be offered would be appreciated. Thanks.
Intermediate & Advanced SEO | | VERBInteractive0 -
Google Maps results doesn't show my site url but rather the maps url, why is this?
For several of my clients landing pages that show up in the Maps results the website url has been overwritten by the maps url (maps.google.com). Even though on my places page I have the correct website set up. Does anyone have any idea why they would be doing this and how I can correct it? Thanks kinldy in advance, Aaron. maps-url.png
Intermediate & Advanced SEO | | afranklin0 -
Don't want to lose page rank, what's the best way to restructure a url other than a 301 redirect?
Currently in the process of redesigning a site. What i want to know, is what is the best way for me to restructure the url w/out it losing its value (page rank) other than a 301 redirect?
Intermediate & Advanced SEO | | marig0