URL Parameters
-
Hi there, I have a magento sort by feature which has indexed loads of pages in Google with urls that have /shopby/ in them.Over 8k pages have been indexed like this. I cannot edit the robots within the page but have now disallowed the urls in robots.txt - i guess this will prevent new ones being indexed but not deindex current ones?
So I looked into URL parameters, I added 'shopby' as a parameter in webmaster tools and told Google not to crawl any urls with this in it, will this deindex the pages already indexed?
The only other way seems to be manually removing 8k urls, which i do not want to do.
Any advice much appreciated. Obviously I do not want these urls indexed as they are weak/duplicate sort by search pages, I fear the panda update would not be too kind on it long term?
-
That would be correct. What you have are "self referencing" canonical tags. That does the exact opposite of what you need it to do. It tells Google all of those pages are valid, where you need it to tell Google all of those pages are just copies of only ONE valid page.
-
Yes.
The idea of having a canonical is to point it to another page, many just don't get this
-
Hi guys. Well the site has been setup so every page has a unique canonical tag, the canonical tag being the url it is on.
I guess I need to find a way in magento to make all /shopby/ urls have the same canonical tag then it will deindex once Google recrawl?
-
Assuming you have your canonicals done correctly, the pages will disappear in time.
the pages you wont to de-index, should have a canonical tag that points to the original.
-
Hi there, the canonical tags are there but the pages are still indexed.
No links point to these pages, they are just sort by urls being generated off a widget.
-
I would not de index the page either with robots or WMT.
links in your site that point to any of these pages will now pour their link juice into un indexed pages.
use a canonical tag to fix the problem.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved URL Crawl Reports providing drastic differences: Is there something wrong?
A bit at a loss here. I ran a URL crawl report at the end of January on a website( https://www.welchforbes.com/ ). There were no major critical issues at the time. No updates were made on the website (that I'm aware of), but after running another crawl on March 14, the report was short about 90 pages on the site and suddenly had a ton of 403 errors. I ran a crawl again on March 15 to check if there was perhaps a discrepancy, and the report crawled even fewer pages and had completely different results again. Is there a reason the results are differing from report to report? Is there something about the reports that I'm not understanding or is there a serious issue within the website that needs to be addressed? Jan. 28 results:
Reporting & Analytics | | OliviaKantyka
Screen Shot 2022-03-16 at 3.00.52 PM.png March 14 results:
Screen Shot 2022-03-15 at 10.31.22 AM.png March 15 results:
Screen Shot 2022-03-15 at 4.06.42 PM.png0 -
Direct traffic coming to URLs with /rss_feedIP#
I'm doing a site audit for an organization that has a bunch of really messy old Drupal sites. In looking at their traffic, I see that a majority of it is coming to landing pages that look like this: http://clientsdomain.com/rss_feed173.8.208.97 plus other IP addresses. The bounce rate is 100% and time on site is less than a second. It looks like something that an RSS feed tool might use, but I've never seen something like it before. It creates its own landing page, hits the site, then appears to bounce. This is making their Analytics data look a whole lot worse than the site is actually doing, since the bounce rate is 100% on all that fake traffic. I have some experience with Drupal, but I've never seen anything like this in Drupal or any other CMS. Has anyone out there ever experienced something like this, where direct traffic comes to an rss feed landing page and bounces immediately?
Reporting & Analytics | | newwhy0 -
How to get multiple pages to appear under main url in search - photo attached
How do you get a site to have an organized site map under the main url when it is searched as in the example photo? SIte-map.png
Reporting & Analytics | | marketingmediamanagement0 -
How to setup Google Analytic goals that go to different urls?
I would like to setup my form's conversion rate as a goal in my Google Analytics account. The goal is complete when someone enters their information and hits submit. Logically, the "Thank you page" would be the destination url for the goal. My problem is that the thank you page is outside of the primary domain, for example, my site is: SiteExampl.com but the Thank you URL is thankyouurl.com Is it possible to setup a goal like this? If so how. Thanks in advance!
Reporting & Analytics | | christinarule0 -
Regex Filter To Exlude lower case urls
Buon Pormeriggio from Wetherby 22 degrees the summer continues! I need to set up a regex filter to knock out lowecase versions of http://www.sandersonweatherall.co.uk/Sales/ Thing is Analytics is returning this lowercase version which i want to regex filter out.So if Regex filter /Sales/$ returns what i eant how do i knock out urls beginning with lowe case s. Grazie,
Reporting & Analytics | | Nightwing
David0 -
Webmaster Not found URLs
Dear All, I would really like help with this. Due to some unknown reason (another thread is open for this reason), my google webmaster is showing 7000 not found URL's. Now, when i try to find out the day these broken URLs were detected, webmaster is showing dates between November 2011 to December 2nd, 2011. I havent found a single not found error showing after 2nd December 2011. So does that mean that the mistake has been solved? Because daily webmaster is adding 200-300 not found URLs. Along with this, my traffic has dropped drastically since 12th December and has still not recovered. Are these not found URLs the reason for this sudden traffic drop? If so, then i m ready to find someone for paid seo to remove this error. I would love to have some concrete answers for these questions. Thanksss
Reporting & Analytics | | hith2340 -
Strange 404 Error URL
Can anyone help determine how a URL like "www.mycompany.com/lago_www.bad-nsfw-content.com" would appear on the "not found" crawl error list in Google Webmaster Tools? The "www.bad-nsfw-content" site has nothing to do with our company and I don't how it would get associated with our site.
Reporting & Analytics | | pbhatt0 -
Tagging URLs Linkbuilding and anchor links
Hi, I am going to publish a press release on a number of different websites. First and foremost, I want to build anchor links back to website for specific keywords. Secondly I want to measure clickthrus from each site using parameter tracking in GA. I want to know if I put in a url with ?utm_source=xxx, will this have any impact upon my linkbuilding efforts? i.e. will search engines attribute the keyword to the long url with tracking or the url without tracking. I understand that everything from the ? mark is ignored. However, i just want to double check before I publish release. Thanks for your help. Mik
Reporting & Analytics | | increation0