My automated build system is creating a duplicate website
-
Because of the tools my company is using for CI/CD (A CI/CD pipeline helps you automate steps in your software delivery process, such as initiating code builds, running automated tests, and deploying to a staging or production environment.) an extra URL is generated. The canonical for the generated site is that of our main website, but other than that it is the same website.
- Could this new URL compete with our website?
- Will Google count it against us since it is the same content BUT with canonical (it is not noindex-ed)?
- Does it matter?
- Surely others are using this method?
Answers/thoughts will be greatly appreciated. Thank you.
-
Do you have any control over the CI/CD pipeline URL?
If you control the domain enough so that you can be one to have validated and searched console them by all means. But it does not seem like you have the ability to control domain?
my correct?
https://support.google.com/webmasters/answer/7440203?hl=en
If the domain is 3ed party domain then you must trust the third-party or if you control the domain of pages which links or third-party domain URLs are embedded on you can add noindex nofollow
https://www.deepcrawl.com/blog/best-practice/noindex-disallow-nofollow/
I hope that helps,
Tom
-
Unfortunately, since URL is generated from the original site, I cannot change the robots.txt. It uses the same one as the main site. That would exclude adding a noindex meta tag, as well. Any other ideas?
Is there a way to add the duplicate URL to search console & tell google not to crawl?
Thank you.
-
I understand using CI cool
i agree get the bad content being made by CI blocked ASAP
“have an extra URL is generated. The canonical for the generated site is that of our main website, but other than that it is the same website.”
but it’s not the same content being made that will hurt you unless you’re pointing the canonicals to a similar page (get the automated content off your domain)
Remember to add using self pointing canonicals on the good pages you want to be indexed by Google or Search Engines
Hope this is of help,
Tom
-
To answer your questions:
- Technically it could compete with your current site as it's on its own domain, in reality, it's unlikely as you're canonicalizing the pages back to its original and making sure that the content itself through that way is attributed to your original site.
- What I would recommend is excluding the CI/CD site from the engines, through a robots.txt or a similar technique. That way you're making sure that the staging site itself isn't being crawled at all. In the end, I'd say there's very little upside of having that be the case currently.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is there an easy way to switch hundreds of websites to https in GSC?
My company has hundreds of websites setup in Google Search Console but will soon be moving them all to secure domains. Is there an easy way to make the switch in GSC or do we have to change the address one by one?
Reporting & Analytics | | MJTrevens0 -
Links not pointing to my website?
Aloha everyone! I've asked questions to the Moz community before and have always been happy to see how responsive and helpful the community is as a SEO learning new comer! Many thanks in advance for any ideas or insights to solving this problem. So here is my question, I own a wedding photography business here on Kauai and am looking to rank for a few things, Kauai wedding photographer, Kauai Wedding videographer, and Kauai family photographer. That aside I've been pouring more time into getting back links from people I've shot with here on Kauai (local business's websites etc) and have had links dropped in on their sites to, in theory, bump my rankings up from back linking my website. Here's my website http://www.balihaiphoto.com/ Here's a few sites but are not showing up as backlinks/pushing my site higher on SEO results (screen shot as well) http://www.haleleakeiki.org/ (bottom of page footer linking to BaliHaiPhoto.com) and another site here http://www.wishingwellshaveice.com/ with the same thing, footer link pointing to my website. Both sites have been indexed and the shave ice one gets a good amount of traffic, does anyone know why this isn't showing as a back link or if its passing on any link juice my way? Should I do something diffrent here? Let me know what you guys think! Aloha from Kauai, Jon Gibb qsDOM
Reporting & Analytics | | Trey30 -
What is The Bounce Rate of Single Page Website?
Hi All, I just want to clear some of my confusion regarding bounce rate. Bounce rate depends upon time. If yes than how? What will be the bounce rate for single page website. Single page website will have same bounce rate and exit rate?
Reporting & Analytics | | RuchiPardal0 -
Alexa ranking certification will be usefull to handle the website ?
I have job portal site , i have idea to try alexa certification , Alexa certification will be useful ?
Reporting & Analytics | | jobtardis0 -
Duplicate content warnings
I have a ton of duplicate content warnings for my site poker-coaching.net, but I can't see where there are duplicate URLs. I cannot find any function where I could check the original URL vs a list of other URLs where the duplicate content is?
Reporting & Analytics | | CatfishTPA0 -
Duplicate Url with Google shopping feed
In webmaster tool I have many duplicate url tagged as google_shopping Obviously i'm tagging the url with the goog url builder Url: elettrodomestici.yeppon.it/cura-corpo/tagliacapelli/remington-tagliacapelli-funzionamento-rete-ricaricabile-lame-in-acciaio-inox-hc5150-garanzia/ Duplicate url: elettrodomestici.yeppon.it/cura-corpo/tagliacapelli/remington-tagliacapelli-funzionamento-rete-ricaricabile-lame-in-acciaio-inox-hc5150-garanzia/?utm_source=google_shopping&utm_medium=web&utm_content=Elettrodomestici+e+Clima+%3E+Cura+del+corpo+%3E+Tagliacapelli&utm_campaign=google_shopping How can I solve it? Thanks
Reporting & Analytics | | yeppon0 -
Open explorer is not finding my website information yet?
Majestic SEO and even yahoo sitemap see's it... why not SEOMOZ? The website is a few months old already. www.lackofsleephq.com
Reporting & Analytics | | sleepmaster0