Staging site - Treated as duplicate?
-
Last week (exactly 8 days ago to be precise) my developer created a staging/test site to test some new features. The staging site duplicated the entire existing site on the same server.
To explain this better -My site address is - www.mysite.com
The path of the new staging site was www.mysite/staging
I realized this only today and have immediately restricted robot text and put a no index no follow on the entire duplicate server folder but I am sure that Google would have indexed the duplicate content by now?
So far I do not see any significant drop in traffic but should I be worried? and what if anything can I do at this stage?
-
Yes, it would show up in your analytics as an active user but the fact that the query returns no results means it's not been indexed. All good.
Peter
-
Hey Peter,
The Analytics code could have helped to get the site indexed. Or even a G +1/Facebook Like/share/Stumble/etc button clicked by error.
@Rajat
Doing the search Peter suggested should return any indexed page.
-
Got it. No, no results show up but interestingly when I go to www.mysite.com/staging, it does show up as 1active user on analytic report, which is what got me worried and made me realize of this problem.
-
Hi Rajat,
No what I mean is put the following query into the search box
site:<yourdomainname>/<yourstagingfolder></yourstagingfolder></yourdomainname>
where yourdomainname is your domain name (e.g. mysite.com) and yourstagingfolder is your staging folder (e.g. staging), so ike this:
site:mysite.com/staging
Peter
-
Thanks Pete. When I search for mysite.com/staging on google, I only see mysite.com as first result...and nothing at all on staging. Is that what you mean I should check?
-
Hi Rajat
The analytics code may have given some signals to Google of pages to index but to test it the staging server's pages are in Google use site:mysite.com/staging (NB. no spaces between site and the domain name).
Peter
-
Thanks Federico. That's re-assuring. Also, a related point, since the whole site was duplicated, so was the Google analytic code.
Does that have any impact?
Also, is there a way to check if the test server was in fact indexed or not?
-
Hi Rajat
I agree with Federico. Also, if there was no active link on mysite.com to mysite.com/staging then it's unlikely Google would have found it unless the staging site had been submitted to Google via a sitemap for indexing. You should be fine.
Peter
-
You have done the necessary steps (disallowing in robots plus setting a noindex tag). There's shouldn't be anything to worry about. If you want to be entirely sure, you can add some HTTP authentication to the folder so only those knowing the credentials can access (you could find that some robots may not follow the disallow flag or noindex tag).
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How long does google takes to crawl a single site ?
lately i have been thinking , when a crawler visits an already visited site or indexed site, whats the duration of its scanning?
Algorithm Updates | | Sam09schulz0 -
Anyone suspect that a site's total page count affects SEO?
I've been trying to find out the underlying reason why so many websites are ranked higher than mine despite seemingly having far worse links. I've spent a lot of time researching and have read through all the general advice about what could possibly be hurting my site's SEO, from page speed to h1 tags to broken links, and all the various on-page SEO optimization stuff....so the issue here isn't very obvious. From viewing all of my competitors, they seem to have a much higher number of web pages on their sites than mine does. My site currently has 20 pages or so and most of my competitors are well in the hundreds, so I'm wondering if this could potentially be part of the issue here. I know Google has never officially said that page number matters, but does anyone suspect that perhaps page count matters towards SEO and that competing sites with more total pages than you might have an advantage SEOwise?
Algorithm Updates | | ButtaC1 -
Placement of /p/ in URL structure for ecommerce site product URLs
Hi, We're a discussion about how to structure a clients ecommerce site product page URLs where 12345 represent the product SKU/number: https://domain.com/Item--i-12345 https://domain.com/product-name/p/12345 https://domain.com/p/12345 It's a toss up between the second and the third URL, but the SEO company is saying the third is best because of the placement with the /p/ and creating a silo for "products" that help search engines recognize it is a product. Does anyone have thoughts on this? Thanks!
Algorithm Updates | | AliMac260 -
Links from high Domain authority sites
I have a relatively uncompetitive niche ranking around number 6 for my keywords. Would getting a few links from some Moz DA 80-90 and DA 90-100 sites help my rankings a lot? Some of the pages linking to me from these sites might be deep in the site pretty far away from the home page with pagerank of "unranked" or a grayed out bar and these pages linking to me might not have many links at all other than from the internal links of the site itself and would have a Moz PA of 10 or 20. Would these pass much pagerank or authority to my site or would they not be worth going after? These links to my site would be in context on a blog. Thanks mozzers!
Algorithm Updates | | Ron100 -
Post penguin & panda update. what would be a good seo strategies for brand new sites
Hi there. I have the luxury of launching a few sites after the penguin and panda updates, so I can start from scratch and hopefully do it right. I will get SEO companies to help me with this so i just want to ask for advices on what would be a good strategies for a brand new site. my understand of the new updates is this content and user experience is important, like how long they spend, how many pages etc social media is important. we intent to engage FB and twitter alot. in New Zealand, not too many people use google+ so we will probbaly just concentrate on the first two hopefully we will try to get people to share our website via social media, apparent that is important should only concentrate on high quality backlinks with a good diverse set of alt tags, but concentrate on branding rather than keywords. Am i correct to say that so far? if that is the principle, what would be the strategy to implement these goals? Links to any articles would also be great please. Love learning. i just want to do this right and hopefully try to future proof the sites against updates as possible. i guess quality content and links will most likely to be safe. Thank you for your help.
Algorithm Updates | | btrinh0 -
Our Developer Site randomly drops 10+ places in Google searches for our Company Name. Why?
Hey everyone, At Betable, we have a player-facing site and a developer-facing site. We also have a developer-facing blog. We have this issue where our developer-facing site will randomly drop 10+ places in Google's Search results for the keyword "betable". This problem can be reproduced by others and in incognito mode, so it's not just one person's results. Furthermore, the developer-facing blog and our social media accounts all suddenly rank higher than the developer site. Even stranger, this problem randomly fixes itself after a few days. This has happened twice so far, and on each occasion there were no changes to the website that would have prompted a drop in rank. After the first drop, we did our best to neutralize any SEOMoz "red alerts" but to no avail, the drop happened again last week. Can someone help us understand what's going on? Are there ways to avoid this? Thanks, Tyler
Algorithm Updates | | Betable0 -
Large site with faceted navigation using rel=canonical, but Google still has issues
First off, I just wanted to mention I did post this on one other forum so I hope that is not completely against the rules here or anything. Just trying to get an idea from some of the pros at both sources. Hope this is received well. Now for the question..... "Googlebot found an extremely high number of URLs on your site:" Gotta love these messages in GWT. Anyway, I wanted to get some other opinions here so if anyone has experienced something similar or has any recommendations I would love to hear them. First off, the site is very large and utilizes faceted navigation to help visitors sift through results. I have implemented rel=canonical for many months now to have each page url that is created based on the faceted nav filters, push back to the main category page. However, I still get these damn messages from Google every month or so saying that they found too many pages on the site. My main concern obviously is wasting crawler time on all these pages that I am trying to do what they ask in these instances and tell them to ignore and find the content on page x. So at this point I am thinking about possibly using robots.txt file to handle these, but wanted to see what others around here thought before I dive into this arduous task. Plus I am a little ticked off that Google is not following a standard they helped bring to the table. Thanks for those who take the time to respond in advance.
Algorithm Updates | | PeteGregory0 -
Google seems to have penalised one section of our site? Is that possible?
We have a page rank 5 website and we launched a new site 6 months ago in February. Initially we had horrible urls with a bunch of numbers and stuff and we since changed them to lovely human readable urls. This had an excellent effect across the site except on one section of the site: http://www.allaboutcareers.com/careers/graduate-employers Although Google has indexed these pages and several have a PR 2 they do not appear in Google when previously they were on page 1 when we had the old urls. We figured we just needed some time for Google to get used to it, but it hasn't done anything. It is also worth mentioning we changed the page titles from: FIRM NAME | DOMAIN NAME then... FIRM NAME | Graduate Scheme, Jobs, Internships & Apprenticeships | DOMAIN NAME then.. FIRM NAME | Graduate Scheme, Jobs, Internships & Apprenticeships Do you think these are being penalised? There are two types of page: Example A: http://www.allaboutcareers.com/careers/graduates/addleshaw-goddard.htm Example B: http://www.allaboutcareers.com/careers/graduates/accenture.htm
Algorithm Updates | | jack860