URL Parameters
-
Hi there, I have a magento sort by feature which has indexed loads of pages in Google with urls that have /shopby/ in them.Over 8k pages have been indexed like this. I cannot edit the robots within the page but have now disallowed the urls in robots.txt - i guess this will prevent new ones being indexed but not deindex current ones?
So I looked into URL parameters, I added 'shopby' as a parameter in webmaster tools and told Google not to crawl any urls with this in it, will this deindex the pages already indexed?
The only other way seems to be manually removing 8k urls, which i do not want to do.
Any advice much appreciated. Obviously I do not want these urls indexed as they are weak/duplicate sort by search pages, I fear the panda update would not be too kind on it long term?
-
That would be correct. What you have are "self referencing" canonical tags. That does the exact opposite of what you need it to do. It tells Google all of those pages are valid, where you need it to tell Google all of those pages are just copies of only ONE valid page.
-
Yes.
The idea of having a canonical is to point it to another page, many just don't get this
-
Hi guys. Well the site has been setup so every page has a unique canonical tag, the canonical tag being the url it is on.
I guess I need to find a way in magento to make all /shopby/ urls have the same canonical tag then it will deindex once Google recrawl?
-
Assuming you have your canonicals done correctly, the pages will disappear in time.
the pages you wont to de-index, should have a canonical tag that points to the original.
-
Hi there, the canonical tags are there but the pages are still indexed.
No links point to these pages, they are just sort by urls being generated off a widget.
-
I would not de index the page either with robots or WMT.
links in your site that point to any of these pages will now pour their link juice into un indexed pages.
use a canonical tag to fix the problem.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google SERP showing a URL with UTM_source attached - why? Can I stop it?
I just found a Google search results page showing a URL with a UTM source tag attached. Any idea how or why this has happened? How can I stop it as I'm guessing this is overwriting my organic visits with referrals from this site. See attached photo for pic of SERP page. The link is going here: http://employment.govt.nz/er/holidaysandleave/parentalleave/?utm_source=newzealandnow.govt.nz 5vxTDTi.png
Reporting & Analytics | | DanielleNZ0 -
Direct traffic coming to URLs with /rss_feedIP#
I'm doing a site audit for an organization that has a bunch of really messy old Drupal sites. In looking at their traffic, I see that a majority of it is coming to landing pages that look like this: http://clientsdomain.com/rss_feed173.8.208.97 plus other IP addresses. The bounce rate is 100% and time on site is less than a second. It looks like something that an RSS feed tool might use, but I've never seen something like it before. It creates its own landing page, hits the site, then appears to bounce. This is making their Analytics data look a whole lot worse than the site is actually doing, since the bounce rate is 100% on all that fake traffic. I have some experience with Drupal, but I've never seen anything like this in Drupal or any other CMS. Has anyone out there ever experienced something like this, where direct traffic comes to an rss feed landing page and bounces immediately?
Reporting & Analytics | | newwhy0 -
URL open with double domain names when click on visit URL link in Google Analytics
I have configured Advance Filter to track the sub-domains traffic as follow : Filter
Reporting & Analytics | | gamesecure
Type: Custom filter > Advanced Field A: Hostname Extract A: (.*) Field B: Request URI Extract B: (.*) Output To: Request URI Constructor: $A1$B1 After that, I am able to see sub-domains record and View Full Page URL In Reports. But when I check reports in All page (e.g. Behavior >> All Pages) or selecting Landing Page as a Primary Dimension. Further I click on Icon given next to displayed Full URL to visit to same domain page, in browser the page
opened but the double domain name comes so page not open successfully. For example : In landing page list following URL given : www.sitegeek.com/compareHosting/arvixe_vs_hostgator If I click on icon given next the displayed URL, in browser following URL will
open https://sitegeek.comwww.sitegeek.com/compareHosting/arvixe_vs_hostgator Is this First Domain with HTTPs, coming from Google Analytic 'View' where this is taken ? How Can I remove double domains? Thanks, Rajiv0 -
How to setup Google Analytic goals that go to different urls?
I would like to setup my form's conversion rate as a goal in my Google Analytics account. The goal is complete when someone enters their information and hits submit. Logically, the "Thank you page" would be the destination url for the goal. My problem is that the thank you page is outside of the primary domain, for example, my site is: SiteExampl.com but the Thank you URL is thankyouurl.com Is it possible to setup a goal like this? If so how. Thanks in advance!
Reporting & Analytics | | christinarule0 -
Figuring Out the Source of "direct traffic" by looking at landing page parameters
I have a client who runs an e-commerce website, and I noticed that 40% of his traffic and 25% of his sales are all attributable to Direct Traffic. At first, I tried to solve this problem by tagging all of the previously untagged links in his e-newsletter, which I expect to be very helpful. However, then I looked at the landing pages for his direct traffic, and I see that it is almost entirely filled with thousands of unique URLs that begin with a question mark followed by the name of his e-newsletter or shopping cart vendor. It would be the equivalent of having a url like the following: "www.willmarlow.com/?constantcontact=keya;sldkfjsdlfkjdf;sldkjf" If we have this amount of information in the link, shouldn't there be a way to add additional parameters to the URL to move this traffic out of the Direct column? Has anyone encountered this before? Thanks.
Reporting & Analytics | | williammarlow0 -
We have detected that the root domain is not a live URL.
I'm trying to add a URL that is having some obvious issues so I can further investigate. When trying to add this site to a campaign in SEOmoz i get the following: Roger has detected a problem: We have detected that the root domain theurbandater.com is not a live URL. Using this domain, we will be unable to crawl your site or present accurate SERP information. == What does that error mean? Where should I be looking to begin troubleshooting? The initial issue was that back on 9/1 according to Google Webmaster Tools this site began getting a high number of 500 erros and that number continued to rise up to 3200 of the same type of error. So something screwy is going on and I'm not sure where to start looking.
Reporting & Analytics | | digisavvy0 -
Why do I have few different index URL addresses?
Yes I know, sorry guys but I also have a problem with duplicate pages. It shows that almost every page of my site has a duplicate content issue and looking at my folders in the server, I don't see all these pages... This is a static Website with no shopping cart or anything fancy. The first on the list is my [index] page and this is giving me a hint about some sort of bad settings on my end with the SEOMOZ crawler??? Please advice and thank you! index-variations.jpg
Reporting & Analytics | | cssyes0 -
How to find out which URLs are NOT indexed on a site
Is there a way to easily find out which URLs on a store-type site are NOT being indexed in Google? For example, if my sitemap information in Google Webmaster tools shows I have 7342 URLs in my sitemap and 5699 of those indexed, how do I find out what the 1643 non-indexed URLS are? Thanks for any help!
Reporting & Analytics | | GregWalt0