URL Parameters to Ignore
-
Hi Mozers,
**We have a glossary of terms made up of a main page that lists out ALL of the terms, and then individual pages per alphabet letter that limit the results to that specific alphabet letter. These pages look like this: **
https://www.XXXX.XXX/publications/dictionaries/XXX-terms?expand=A
https://www.XXXX.XXX/publications/dictionaries/XXX-terms?expand=B
https://www.XXXX.XXX/publications/dictionaries/XXX-terms?expand=C
https://www.XXXX.XXX/publications/dictionaries/XXX-terms?expand=D
etc.
If I'd like Google to remove all of these "expand=" pages from the index, such that only the main page is indexed, what is the exact parameter that I should ask Google to ignore in Search Console?
"expand=" ?
Just want to make sure! Thanks for the help!!!
-
I agree with what is said above, in addition you could also add the ignore parameter in GSC. As it 's basically adjusting the page content based on that. It's a bit unclear how much information that is really sending to the crawlers but it probably can't hurt.
-
Hi!
What billbill369 said is correct, but will only prevent google from crawling those pages.
My suggestion is to use canonical tags in every URL with a parameter pointing to the correct url (the one without parameters)For further reading:
SEO Best Practices for Canonical URLs + the Rel=Canonical Tag - Whiteboard Friday Consolidate duplicate URLs - Google Search Console HelpHope it helps.
Best luck.
GR. -
Useragent:* Disallow: /*?expand= This should work put it in your robots.txt
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
URL is invalid: Why?
Hello everyone, I am currently listing my company on business directories. For some websites however when I add my website URL, it comes up as URL is invalid. What could be the reason for this? I have tried different variations like www., http:// and https://. Kind Regards,
Technical SEO | | SMCCoachHire
Aqib0 -
Migration to new URL structure
Hi guys, Just wondering what your processes are when moving a large site to a completely new URL structure on the same domain. Do you 301 everything from old page to new page, or are your more selective - i.e. only 301 pages that have a certain page authority, for example. Thanks!
Technical SEO | | A_Q0 -
To include / at the end of a URL or not
Hi I have recently noticed my site works with / and the end of a URL and without. I wanted to know if there is any SEO impact on this? Will it be seen as 2 different pages? if so what is the best option to go for www.mydomain.com/page/ or www.mydomain.com/page Thanks E
Technical SEO | | Direct_Ram0 -
How to fix these unwanted URLs?
Right now i have wordpress, one page website, but google also show wp-content. KIndly check below in google. site:http://baltimoreelite.com/ How I can fix this issue?
Technical SEO | | marknorman0 -
%20 URL accessible, does this matter?
I have a rewrite on the CMS I work on. What happens here is that if someone creates a page on the website and uses spaces as the name then the CMS automatically replaces the spaces with -'s. I noticed this morning that the %20 URLs are accessible but not indexed at all. Only the - URLs are indexed. could this cause duplicate content or penalties? I know best practice is to have only ONE URL for a page but somehow the developer can't redirect the %20 URLs to the - URLs. Opinions?
Technical SEO | | DROIDSTERS0 -
Starting a Blog and URL Structure Advice
Hello SEOmoz Community, We are going to start a blog on our website and have a slight dilemma. Our site is a .Net site and the blog platform we've chosen (BlogEngine) only allows us to use the following url structure: www.domain.com/blog/post/post-name. We've looked at other .Net blog software and this one meets all of our needs except for the ideal URL Structure. We would like to remove the /post/ directory; however have not technically found a way to do it. We wanted to get some opinions on whether or not we should just start with this URL structure and not worry about the extra directory, or work to find another solution that eliminates this extra directory. Ideally we want to keep the posts as close to the root as possible for link juice distribution, and the extra directory could get in the way. Also, if anyone has any advice on a more flexible .Net blog platform, suggestions would be greatly appreciated. We thank you so much in advance for your time and help.
Technical SEO | | All-Star-Vacation-Homes0 -
Canonical URL
I previously set the canonical Url in google web masters to the non www version, when I check my on page opt, it tells me that I have a critical issue with this. Should I change it in google web masters back to the www version? if so is there the possibility of negative results? Or is there a better way to deal with this? Note, I have inbound links pointing to both types.
Technical SEO | | bronxpad0 -
What to do with extremely high number of URLs on your site?
Here is the situation: The site has tons of business and personal profiles, the information needed to be categorized as such directories were created in an attempt to keep the URL structure clean - so for example: www.abc.com/product/um/name-here/city-name/state/lastname:3458765 Each profile has a unique ID#, and for some reason there needed to be a category for a user in this case /um/ stands for user name. Webmaster tool steps to resolve state to use an rel=canonical which can be done for that directory /um/ but I am concerned about the bot not being able to find the other pages beyond that directory, like the profile name, city, state associated. So I guess my ultimate question is if I use rel=canonical will the rest of the content not get crawled or indexed as well?
Technical SEO | | TLO0