My automated build system is creating a duplicate website
-
Because of the tools my company is using for CI/CD (A CI/CD pipeline helps you automate steps in your software delivery process, such as initiating code builds, running automated tests, and deploying to a staging or production environment.) an extra URL is generated. The canonical for the generated site is that of our main website, but other than that it is the same website.
- Could this new URL compete with our website?
- Will Google count it against us since it is the same content BUT with canonical (it is not noindex-ed)?
- Does it matter?
- Surely others are using this method?
Answers/thoughts will be greatly appreciated. Thank you.
-
Do you have any control over the CI/CD pipeline URL?
If you control the domain enough so that you can be one to have validated and searched console them by all means. But it does not seem like you have the ability to control domain?
my correct?
https://support.google.com/webmasters/answer/7440203?hl=en
If the domain is 3ed party domain then you must trust the third-party or if you control the domain of pages which links or third-party domain URLs are embedded on you can add noindex nofollow
https://www.deepcrawl.com/blog/best-practice/noindex-disallow-nofollow/
I hope that helps,
Tom
-
Unfortunately, since URL is generated from the original site, I cannot change the robots.txt. It uses the same one as the main site. That would exclude adding a noindex meta tag, as well. Any other ideas?
Is there a way to add the duplicate URL to search console & tell google not to crawl?
Thank you.
-
I understand using CI cool
i agree get the bad content being made by CI blocked ASAP
“have an extra URL is generated. The canonical for the generated site is that of our main website, but other than that it is the same website.”
but it’s not the same content being made that will hurt you unless you’re pointing the canonicals to a similar page (get the automated content off your domain)
Remember to add using self pointing canonicals on the good pages you want to be indexed by Google or Search Engines
Hope this is of help,
Tom
-
To answer your questions:
- Technically it could compete with your current site as it's on its own domain, in reality, it's unlikely as you're canonicalizing the pages back to its original and making sure that the content itself through that way is attributed to your original site.
- What I would recommend is excluding the CI/CD site from the engines, through a robots.txt or a similar technique. That way you're making sure that the staging site itself isn't being crawled at all. In the end, I'd say there's very little upside of having that be the case currently.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Backlinks Tracking Websites/Tools/Software
I have multiple websites that I need to keep track of their backlinks. How do you guys keep track of your backlinks? What are some cool tools that you use ?
Reporting & Analytics | | AngelosS0 -
Specific Industry Website Conversion Rates: Lighting
Hi All, There's loads of info around on general retail conversion rates, but does anyone have any experience with online lighting shops and typical conversion rates? This is a highly price driven shopper, and from my experience so far they bounce around looking for the best price... We've recently taken ownership of this new site, and I'm not sure I can relate general metrics to this site... although there is lots of work to do on here! Cheers in advance.
Reporting & Analytics | | b4cab0 -
Tracking time spent on a section of a website in Google Analytics
Hi, I've been asked by a client to track time spent or number of pages visited on a specific section of their website using Google Analytics but can't see how to do this. For example, they have a "golf" section within their site and want to measure how many people either visit 5 page or more within the golf section or spend at least 6 minutes browsing the various golf section pages. Can anyone advise how if this can be done, and if so, how I go about it. Thanks
Reporting & Analytics | | geckonm0 -
Creating Meta Tags for a Web Hosting company
My client has recently had his site rewritten by a company which supplies a website/hosting/seo service. The client still wants to use me for establishing his site locally and for SEM. My problem is that the code is not accessible to me. They have set up GA tracking which feeds into the reports dashboard. There is no way of actually access the raw data. When I asked for access to the GA account I got this message from bOline solutions: "Your SEO consultant will need to create a new GA account and then add the meta tag to the reports tab of your site" Because I am still very much learning GA I do not know what meta tag they are referring too. I'm therefore stuck as to how to create it. I've started learning about "Google Tag Manager" and am working through "Digital Analytics Fundementals", I've a feeling this is just a question of terminology - can anyone more experience in the subject help me?
Reporting & Analytics | | catherine-2793880 -
Mobile Website Analytics Code and Button Tracking / Event Tracking
How to track the action on the mobile version. Action by pressing the "add comment" this code: onclick = "_gaq.push (['_trackEvent', 'comments', 'pressed'])
Reporting & Analytics | | meteorr
- Not suitable for mobile version Please help.0 -
Duplicate Content From My Own Site?!
When I ran the SEO Moz report it says that I have a ton of duplicate content. The first one I looked at was my home page. http://www.kisswedding.com/ http://www.kisswedding.com/index.html http://kisswedding.com/index.html All of the above 3 have varying internal links, page authority, and link root domains. Only the first has any external links. All of the others only seem to have 1 other duplicate page. It's a difference between the www and the non-www version. I have a verified acct for www.kisswedding.com in google webmaster tools. The non-www version is in there too but has not been verified. Under settings for the verified account (www.kisswedding.com), "Don't set a preferred domain" is checked off. Is that my mistake. And if so, which should I select? The www version or the non-www version? Thanks!
Reporting & Analytics | | annasus0 -
Using Clients GA account for their over all website to Optimize Their Blog? or is there a better way to compare apples to apples
I would like to optimize my clients blog. If I am using their GA account for their over all website but put in the sub folder for SEOMOz, will it detect that I only want to optimize the blog ex. http://xxx.com/blog or will I be seeing the GA for the entire site on SEOMOZ?
Reporting & Analytics | | CliffordC0 -
Strange Visitors To Website
OK, not quite sure how this is happening, but......... I am having referral traffic from online game sites. Actually quite a bit of it and it seems to be raising my bounce rate a bit. B Suggestions anyone? Below is my website: http://www.allianceconcretepumps.com Thank You!!
Reporting & Analytics | | APICDA0