Staging site - Treated as duplicate?
-
Last week (exactly 8 days ago to be precise) my developer created a staging/test site to test some new features. The staging site duplicated the entire existing site on the same server.
To explain this better -My site address is - www.mysite.com
The path of the new staging site was www.mysite/staging
I realized this only today and have immediately restricted robot text and put a no index no follow on the entire duplicate server folder but I am sure that Google would have indexed the duplicate content by now?
So far I do not see any significant drop in traffic but should I be worried? and what if anything can I do at this stage?
-
Yes, it would show up in your analytics as an active user but the fact that the query returns no results means it's not been indexed. All good.
Peter
-
Hey Peter,
The Analytics code could have helped to get the site indexed. Or even a G +1/Facebook Like/share/Stumble/etc button clicked by error.
@Rajat
Doing the search Peter suggested should return any indexed page.
-
Got it. No, no results show up but interestingly when I go to www.mysite.com/staging, it does show up as 1active user on analytic report, which is what got me worried and made me realize of this problem.
-
Hi Rajat,
No what I mean is put the following query into the search box
site:<yourdomainname>/<yourstagingfolder></yourstagingfolder></yourdomainname>
where yourdomainname is your domain name (e.g. mysite.com) and yourstagingfolder is your staging folder (e.g. staging), so ike this:
site:mysite.com/staging
Peter
-
Thanks Pete. When I search for mysite.com/staging on google, I only see mysite.com as first result...and nothing at all on staging. Is that what you mean I should check?
-
Hi Rajat
The analytics code may have given some signals to Google of pages to index but to test it the staging server's pages are in Google use site:mysite.com/staging (NB. no spaces between site and the domain name).
Peter
-
Thanks Federico. That's re-assuring. Also, a related point, since the whole site was duplicated, so was the Google analytic code.
Does that have any impact?
Also, is there a way to check if the test server was in fact indexed or not?
-
Hi Rajat
I agree with Federico. Also, if there was no active link on mysite.com to mysite.com/staging then it's unlikely Google would have found it unless the staging site had been submitted to Google via a sitemap for indexing. You should be fine.
Peter
-
You have done the necessary steps (disallowing in robots plus setting a noindex tag). There's shouldn't be anything to worry about. If you want to be entirely sure, you can add some HTTP authentication to the folder so only those knowing the credentials can access (you could find that some robots may not follow the disallow flag or noindex tag).
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Rel canonical on other page instead of duplicate page. How Google responds?
Hi all, We have 3 pages for same topics. We decided to use rel canonical and remove old pages from search to avoid duplicate content. Out of these 3 pages....1 and 2 type of pages have more similar content where 3 type don't have. Generally we must use rel canonical between 1 and 2. But I am wondering what happens if I canonical between 1 and 3 while 2 has more similar content? Will Google respects it or penalise as we left the most similar page and used other page for canonical. Thanks
Algorithm Updates | | vtmoz0 -
SEO Myth-Busters -- Isn't there a "duplicate content" penalty by another name here?
Where is that guy with the mustache in the funny hat and the geek when you truly need them? So SEL (SearchEngineLand) said recently that there's no such thing as "duplicate content" penalties. http://searchengineland.com/myth-duplicate-content-penalty-259657 by the way, I'd love to get Rand or Eric or others Mozzers aka TAGFEE'ers to weigh in here on this if possible. The reason for this question is to double check a possible 'duplicate content" type penalty (possibly by another name?) that might accrue in the following situation. 1 - Assume a domain has a 30 Domain Authority (per OSE) 2 - The site on the current domain has about 100 pages - all hand coded. Things do very well in SEO because we designed it to do so.... The site is about 6 years in the current incarnation, with a very simple e-commerce cart (again basically hand coded). I will not name the site for obvious reasons. 3 - Business is good. We're upgrading to a new CMS. (hooray!) In doing so we are implementing categories and faceted search (with plans to try to keep the site to under 100 new "pages" using a combination of rel canonical and noindex. I will also not name the CMS for obvious reasons. In simple terms, as the site is built out and launched in the next 60 - 90 days, and assume we have 500 products and 100 categories, that yields at least 50,000 pages - and with other aspects of the faceted search, it could create easily 10X that many pages. 4 - in ScreamingFrog tests of the DEV site, it is quite evident that there are many tens of thousands of unique urls that are basically the textbook illustration of a duplicate content nightmare. ScreamingFrog has also been known to crash while spidering, and we've discovered thousands of URLS of live sites using the same CMS. There is no question that spiders are somehow triggering some sort of infinite page generation - and we can see that both on our DEV site as well as out in the wild (in Google's Supplemental Index). 5 - Since there is no "duplicate content penalty" and there never was - are there other risks here that are caused by infinite page generation?? Like burning up a theoretical "crawl budget" or having the bots miss pages or other negative consequences? 6 - Is it also possible that bumping a site that ranks well for 100 pages up to 10,000 pages or more might very well have a linkuice penalty as a result of all this (honest but inadvertent) duplicate content? In otherwords, is inbound linkjuice and ranking power essentially divided by the number of pages on a site? Sure, it may be some what mediated by internal page linkjuice, but what's are the actual big-dog issues here? So has SEL's "duplicate content myth" truly been myth-busted in this particular situation? ??? Thanks a million! 200.gif#12
Algorithm Updates | | seo_plus0 -
Embedded site on directory from other country
Dear all, With Google search console I found my site embedded on some directories from other countries, with 1000 links to my site. E.g.: http://www.lmn24.com/it/go-scoopy-2714.html My question is: should I remove my embedded site on this directories? should I remove my embedded site if these directories have good DA (domain authority)?
Algorithm Updates | | Tormar0 -
Condensing content for web site redesign
We're working on a redesign and are wondering if we should condense some of the content (as recommended by an agency), and if so, how that will affect our organic efforts. Currently a few topics have individual pages for each section, such as (1) Overview (2) Symptoms and (3) Treatment. For reference, the site has a similar structure to http://www.webmd.com/heart-disease/guide/heart-disease-overview-fact. Our agency has sent us over mock-ups which show these topics being condensed into one and using a script/AJAX to display only the content that is clicked on. Knowing this, if we were to choose this option, that would result in us having to implement redirects because only one page would exist, instead of all three. Can anyone provide insight into whether we should keep the topic structure as is, or if we should take the agency's advice and merge all the topic content? *Note: The reason the agency is pushing for the merging option is because they say it helps with page load time. Thank you in advance for any insight! Tcd5Wo1.jpg
Algorithm Updates | | ATShock1 -
Our company is mentioned on some high-traffic, authoritative sites and some of our products are linked as well. If we link to those pages, does it affect our SEO? How can we take advantage of those mentions?
I heard that if you link to another site, when Google indexes your site, they crawl that page that is referenced. By whatever metrics they use, if that site has your name or a link to your site, Google would rank it higher. I am not sure how true that is, but what value does another site mentioned our site have on our SEO?
Algorithm Updates | | JonathonOhayon1 -
Ranking Drop After Switching Sites
I have a client who's rankings dropped after switching to out site. We know that rankings can drop a little after switching, but we are concerned that hers are still low. Any suggestions? As far as I can tell, the links to her site remained the same. Thanks Holly
Algorithm Updates | | hwade1 -
Implications of removing all google products from site
Is there any data on the implications of removing everything google from a site; analytics, adsense, webmaster tools, sitemaps, etc. Obviously they still have their search data and they say they dont use these other sources of data for ranking information but has anyone actually tried this or is there any existing data on this?
Algorithm Updates | | jessefriedman0 -
Dedicated IP Address on my forum site www.astigtayo.com?
Hello and Good Day, Does having a dedicated IP Address to my site affect my search engine ranking? https://www.astigtayo.com
Algorithm Updates | | ificallyoumine0