Looking for a Way to Standardize Content for Thousands of Pages w/o Getting Duplicate Content Penalties
-
Hi All,
I'll premise this by saying that we like to engage in as much white hat SEO as possible. I'm certainly not asking for any shady advice, but we have a lot of local pages to optimize :).
So, we are an IT and management training course provider. We have 34 locations across the US and each of our 34 locations offers the same courses. Each of our locations has its own page on our website. However, in order to really hone the local SEO game by course topic area and city, we are creating dynamic custom pages that list our course offerings/dates for each individual topic and city. Right now, our pages are dynamic and being crawled and ranking well within Google. We conducted a very small scale test on this in our Washington Dc and New York areas with our SharePoint course offerings and it was a great success. We are ranking well on "sharepoint training in new york/dc" etc for two custom pages.
So, with 34 locations across the states and 21 course topic areas, that's well over 700 pages of content to maintain - A LOT more than just the two we tested. Our engineers have offered to create a standard title tag, meta description, h1, h2, etc, but with some varying components. This is from our engineer specifically:
"Regarding pages with the specific topic areas, do you have a specific format for the Meta Description and the Custom Paragraph? Since these are dynamic pages, it would work better and be a lot easier to maintain if we could standardize a format that all the pages would use for the Meta and Paragraph. For example, if we made the Paragraph:
“Our [Topic Area] training is easy to find in the [City, State] area.” As a note, other content such as directions and course dates will always vary from city to city so content won't be the same everywhere, just slightly the same.
It works better this way because HTFU is actually a single page, and we are just passing the venue code to the page to dynamically build the page based on that venue code. So they aren’t technically individual pages, although they seem like that on the web. If we don’t standardize the text, then someone will have to maintain custom text for all active venue codes for all cities for all topics. So you could be talking about over a thousand records to maintain depending on what you want customized.
Another option is to have several standardized paragraphs, such as:
“Our [Topic Area] training is easy to find in the [City, State] area. Followed by other content specific to the location
“Find your [Topic Area] training course in [City, State] with ease.” Followed by other content specific to the locationThen we could randomize what is displayed. The key is to have a standardized format so additional work doesn’t have to be done to maintain custom formats/text for individual pages.
So, mozzers, my question to you all is, can we standardize with slight variations specific to that location and topic area w/o getting getting dinged for spam or duplicate content. Often times I ask myself "if Matt Cutts was standing here, would he approve?" For this, I am leaning towards "yes," but I always need a gut check.
Sorry for the long message. Hopefully someone can help. Thank you!
Pedram
-
Having duplicate content isn't an issue, so much as having enough unique content for each page to be seen as valuable on their own.
A single template paragraph probably isn't enough, but if you can include other information such as address, driving directions, phone number, photos of the facility, class sizes, school hours, etc. that should be enough unique content for each location.
You can even make the schedule an image or iframe if the duplicate content issue is a concern. Or if the schedule is identical for every location anyway, create a single schedule page, and link to it from each of the locations.
-
Trenton, thanks for the quick reply. We actually did try that suggestion with little success. I'd assume it didn't work out because of all the locations. There was no "honed" location specific content, which spurred the test I launched initially and is now showing results. Also, the pages are dynamic, meaning that content is always going to change as the courses pass in date. So, that does make it somewhat unique in and of itself.
I think I will launch the standardized, generated content but I will watch it closely. I'll certainly not just flood the web with all of our pages at once - maybe put them on our site 20-50 a month and see how that goes. This is a long term strategy, so the patience will be worth it.
Thanks for your help. Everyone's input has really helped here. It's definitely a sticky topic. I'll try to update you all on how it goes after a few months.
-
I totally understand what you're trying to do. What I'm trying to say is that they may be another way to get this location specific information to your users. Perhaps if you had one "sharepoint training" page, you can include all the locations there, with a schedule that changes if you hover or click on a location, but keeps you on the same page. This would likely be much safer with Google and would reduce the amount of work significantly. However, you may be losing potential SEO value without the individual pages for each location. Again, it's a balance, if you are able to create the pages without them being seen as duplicate content, then you're safe. If you can't make them unique, try to think about another method.
-
Definitely not trying to game Google. We offer classes all around the country at different schedules and times. However, if a student would like to take a class that is offered in New York but lives in, say Atlanta, they have the option of taking the New York class by going to a local center that broadcasts the class live, online as if they are in the actual classroom with the ability to interact with the instructor and students via our patented technology. Thus, the schedules for our courses are all basically the same because students can take them from almost anywhere. This is where the content comes into play.
Say someone is searching for "sharepoint training in new york," they'd be taken to a custom page for all of our SharePoint course trainings in the New York area for the next couple of months. The page will have location specific content. Now, if someone searches for "sharepoint training washington dc," they'd be taken to a custom page for all of our SharePoint courses in DC. However, the schedule would be the same as the one seen for NY, simply because of the student's option to take a course locally even if the course is offered somewhere else - that's the only "duplicate content" I'm worried about, even though each page does have location specific content for each topic area. Hopefully that all makes sense.
My objective is to really let a user know we have courses they're looking for in the areas we are located. It's not like we're creating pages for Houston, for example, even though a student could technically take a course from Houston, we simply are not located there.
-
Head over to oDesk and hire someone part time to write unique and relevant stuff for those pages. The standardization will help get you started, but doing work like this will take you to the next level. We just did this for about 1800 product pages and have seen significant organic traffic gains and have reduced or eliminated the thin content on those pages.
-
I would agree with the other two commenters here, you don't need to worry about duplicate meta descriptions, but each page needs to be unique to a certain extent. I'll try to add something different to this discussion: If we're talking to Google and Matt Cutts, and we're interested in white-hat only techniques, then I don't think he would suggest you create so many different pages if they aren't going to be very different. If you have many pages that aren't very different, than what value is that giving to the user? Or are you actually attempting to game Google (black-hat) by creating all these pages strictly for SEO purposes? If so, perhaps you should reevaluate your strategy.
However, if each and every location and topic is different and contains unique content such as completely different schedules and topic content, then I don't think you should have much to worry about. Just make sure that the actual content of each page is unique. Once you start creating dozens of duplicate pages, it may make more sense to try and figure out a simpler way to build out your site. You can try to balance and compare the risk of duplicate content to the benefit of having so many pages. Just focus on different content for each location and topic and you should be fine. In fact, Moz will tell you if you have duplicate content in your Crawl Diagnostics.
-
Takeshi,
I think you are OK. While it is always better to write completely unique content I would say in this scenario you are OK.
I would implement this and watch your ranking as well as other indicators to verify this to make sure.
Ron
-
There is no problem with standardizing meta tags, titles, h1s, etc. This is standard practice for large sites.
What can be problematic is if ALL the content on your pages is just templatized/madlibs. Having some randomized content is obviously better than nothing, but it's not going to do well if that's all the content you have on those pages. Having some standardized paragraphs with words filled in is fine, but make sure you have unique content on all of those pages as well.
If you have user reviews, that can be one good way to get some free UGC onto your pages. 700 pages also isn't terribly many, you can get some decent unique content written for that amount for under $10k. If that's out of your budget, start by focusing on the highest value pages, and calculate how many pages it makes sense to write unique content for based on ROI.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What would be the best course of action to nullify negative effects of our website's content being duplicated (Negative SEO)
Hello, everyone About 3 months ago I joined a company that deals in manufacturing of transportation and packaging items. Once I started digging into the website, I noticed that a lot of their content was "plagiarized". I use quotes as it really was not, but they seemed to have been hit with a negative SEO campaign last year where their content was taken and being posted across at least 15 different websites. Literally every page on their website had the same problem - and some content was even company specific (going as far as using the company's very unique name). In all my years of working in SEO and marketing I have never seen something at the scale of this. Sure, there are always spammy links here and there, but this seems very deliberate. In fact, some of the duplicate content was posted on legitimate websites that may have been hacked/compromised (some examples include charity websites. I am wondering if there is anything that I can do besides contacting the webmasters of these websites and nicely asking for a removal of the content? Or does this duplicate content not hold as much weight anymore as it used to. Especially since our content was posted years before the duplicate content started popping up. Thanks,
White Hat / Black Hat SEO | | Hasanovic0 -
Unlisted (hidden) pages
I just had a client say they were advised by a friend to use 'a bunch of unlisted (hidden) pages'. Isn't this seriously black hat?
White Hat / Black Hat SEO | | muzzmoz0 -
One page sites
HI Guys, I need help with a one page site What is the best method to getting the lower pages indexed? Linking back to the site(Deeplinking) is looking impossible. Will this hurt my SEO? Are there any other tips on one page websites that you can recommend?
White Hat / Black Hat SEO | | Johnny_AppleSeed0 -
Ever seen this tactic when trying to get rid of bad backlinks?
I'm trying to get rid of a Google penalty, but one of the URLS is particularly bizarre. Here's the penalized site: http://www.travelexinsurance.com. One of the external links Google cited as not being natural that links to the penalized site is: http://content.onlineagency.com/index.aspx?site=6599&tide=769006&last=3111516 In the backlink profile of the penalized site, there are about 100 different backlinks pointing to www.travelexinsurance.com from content.onlineagency.com/... So when I visit http://content.onlineagency.com/index.aspx?site=6599&tide=769006&last=3111516 it actually is displaying content from http://www.starmandstravel.com/787115_6599.htm, which you can see after clicking the "Home" button. That company is a legit travel agency who I assume knows nothing about content.onlineagency.com and is not involved in whatever is going on. And that's the case for every link from content.onlineagency.com. So I'm just wondering if someone can help me understand what sort of tactic content.onlineagency.com is using. One of my predecessors I fear used some black hat tactics. I'm wondering if this is a remnant of that effort.
White Hat / Black Hat SEO | | Patrick_G0 -
Indexing content behind a login
Hi, I manage a website within the pharmaceutical industry where only healthcare professionals are allowed to access the content. For this reason most of the content is behind a login. My challenge is that we have a massive amount of interesting and unique content available on the site and I want the healthcare professionals to find this via Google! At the moment if a user tries to access this content they are prompted to register / login. My question is that if I look for the Google Bot user agent and allow this to access and index the content will this be classed as cloaking? I'm assuming that it will. If so, how can I get around this? We have a number of open landing pages but we're limited to what indexable content we can have on these pages! I look forward to all of your suggestions as I'm struggling for ideas now! Thanks Steve
White Hat / Black Hat SEO | | stever9990 -
Am I Syndicating Content Correctly?
My question is about how to syndicate content correctly. Our site has professionally written content aimed toward our readers, not search engines. As a result, we have other related websites who are looking to syndicate our content. I have read the Google duplicate content guidelines (https://support.google.com/webmasters/answer/66359?hl=en), canonical recommendations (https://support.google.com/webmasters/answer/139066?hl=en&ref_topic=2371375), and no index recommendation (https://developers.google.com/webmasters/control-crawl-index/docs/robots_meta_tag) offered by Google, but am still a little confused about how to proceed. The pros in our opinion are as follows:#1 We can gain exposure to a new audience as well as help grow our brand #2 We figure its also a good way to help build up credible links and help our rankings in GoogleOur initial reaction is to have them use a "canonical link" to assign the content back to us, but also implement a "no index, follow" tag to help avoid duplicate content issues. Are we doing this correctly, or are we potentially in threat of violating some sort of Google Quality Guideline?Thanks!
White Hat / Black Hat SEO | | Dirving4Success0 -
Webiste Ranking Differently Based on IP/Data Center
I have a site which I thought was ranking well, however that doesn't seem to be the case. When I check the site from different IPs within the US it shows that the site is on page 1 and on other IPs it shows that it's on page 5 and for some keywords it shows it's not listed. This site was ranking well, before but I think google dropped it when I was giving putting in too much work with it (articles and press releases), but now it seems to have recovered when I check with my IP, but on other data centers it still shows it prior to recovering. It was able to recover after not building links to for a period of time, it showed it moved back up from the data center I'm connected to, but it still shows the possibly penalized results on other data centers. Is it possible that site is still penalized? So the question is why does it show it recovered in some data centers and not others? How do I fix this? It's been about 2 months since it's recovered from some data centers. Is this site still penalized or what's going on? There are no warnings in web master tools. Any insights would be appreciated! This isn't an issue with the rank tracking software, I've tested this on a multitude of IPs with varying differences. Thanks!
White Hat / Black Hat SEO | | seomozzy0 -
Would you get link from this blog?
I have an opportunity to place a guest blog on a site. The site has the following metrics: DA/PA: 24/36 Inbound links: 3K+ from 16 root domains Here is what makes me uneasy: The number of links from the same domain, suggesting sitewide or footer links When I look at the backlinks, there are links from sites like http://best-american-law-firms.info/, or http://www.luvbuds.info/. They sare blogroll links that are likely paid for. Would you get a link from this blog?
White Hat / Black Hat SEO | | inhouseseo0