Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Duplicate Content with ?Page ID's in WordPress
-
Hi there,
I'm trying to figure out the best way to solve a duplicate content problem that I have due to Page ID's that WordPress automatically assigns to pages. I know that in order for me to resolve this I have to use canonical urls but the problem for me is I can't figure out the URL structure.
Moz is showing me thousands of duplicate content errors that are mostly related to Page IDs
For example, this is how a page's url should look like on my site
Moz is telling me there are 50 duplicate content errors for this page. The page ID for this page is 82 so the duplicate content errors appear as follows
and so on. For 47 more pages. The problem repeats itself with other pages as well.
My permalinks are set to "Post Name" so I know that's not an issue.
What can I do to resolve this? How can I use canonical URLs to solve this problem. Any help will be greatly appreciated.
-
this might do it as well
A flexible pattern URL mapping is a way of redirecting all URLs that match a particular pattern, to equivalent destination URLs, using a single mapping. It does this by allowing you to parse out and name portions of the requested URL to substitute into the destination URL. These types of URL mappings are useful when you are changing the structure or format of your URLs, but want to make sure you can redirect requests for pages under their old URL structure to their new URLs.
An example of a flexible pattern URL is the following:
/myblog/:post-year/:post-month/*rest-of-url?id=:post-id
Each portion of the URL above that starts with a colon (:) creates a named component that is matched until the next delimiter character (./=&?), and any portion that starts with an asterisk (*) creates a named component that is matched until the end of the URL (up to the query string).
The named components can then be used in the URL mapping's destination, with each name included inside of curly braces. For example, the named components defined in the flexible pattern URL above could be used to create the following destination:
/newblog/{post-year}/{post-month}/post-{post-id}/{rest-of-url}
To demonstrate how this flexible pattern URL mapping would work, let's consider the following example requested URL and where it would be redirected. The named components in the requested and destination URLs are highlighted.
Requested URL: http://www.mydomain.com/myblog/2013/12/marketing/inbound-marketing-rocks?id=98765
Redirected to: http://www.mydomain.com/newblog/2013/12/post-98765/marketing/inbound-marketing-rocks
With this pattern-based URL mapping we were able to retain all of the important, identifying parts of the original URL and insert them into the new URL structure. In addition, with this particular mapping, we were able to:
- capture the variable-length {rest-of-url} component (i.e. marketing/inbound-marketing-rocks) to be used in the destination url, by using an asterisk (*) at the beginning of that component's definition
- move the {post-id} component from the query string in the original URL into the middle of the URL in the destination
-
you may want to remove the flash file from the website. It is not compatible with iPhones and flash is really something I would stay away from on websites.
This is flash
http://www.spamedica.com/wp-content/themes/spamedia/flash/spamedica_flash.swf
See why that's not good
http://www.browserstack.com/screenshots/46bca267a16b8726a26c7438e76317ef51b877be
Try
http://www.browserstack.com/responsive
Here's an example of the exact same thing using the gif
http://cl.ly/image/2t2d470b3A2F/Screen Recording 2014-09-23 at 12.58 PM.gif
iPhone
The http://www.siteground.com/ your host is great for shared hosting
your IP
181.224.137.194 - 126 other sites hosted on this server
I honestly would give them a call and have them correct your htaccess file
However if you want something that's going to be faster and focused 100% on WordPress use manage WordPress hosting provider. Like GetFlywheel you get your own VPS only one site per an IP no shared resources and this type of stuff is never a problem for them to fix.
You can migrate 100% free in addition.
Overall I would say that your site needs some TLC
http://tools.pingdom.com/fpt/#!/exa05N/http://www.spamedica.com/
you have over 100 requests and your server does a good job of catching after it's been loaded but not that much prior to so the 1st visit will be slow.
Most important part is it needs to be responsive and work properly on mobile devices.
that includes using CSS 3 and HTML 5 to substitute for flash so render properly on all devices. In addition to allowing people to browse it on all devices.
http://www.spamedica.com/
Tested from New York City, New York, USA on September 23 at 14:00:29
<dl class="first">
<dt>Page size</dt>
<dd>1.9MB</dd>
</dl>
<dl>
<dt>Load time</dt>
<dd>2.31s</dd>
</dl>
<dl>
<dt>Requests</dt>
<dd>112</dd>
</dl>
<dl class="last">
<dt>Perf. grade</dt>
<dd>69/100</dd>
</dl>
Sincerely,
Thomas
-
You can check it with
https://yoast.com/wp-content/permalink-helper.php
https://yoast.com/change-wordpress-permalink-structure/
If you’re on Apache and you decide to do the redirect, having been on a
/yyyy/mm/dd/%postname%/
structure before, you might benefit from this simple redirect which you could throw into your .htaccess file:|
1
|RedirectMatch 301 /\d{4}/\d{2}/\d{2}/(.*) http://example.com/$1
| -
PS try to save your link structure to the way you want it and make sure you click the save button.
if it does not work there a problem that would require access to WordPress to fix
http://wordpress.stackexchange.com/questions/39665/custom-htaccess-rewrite-rule-for-page
-
Rewrite the URLs your hosting company for a plug-in like
https://yoast.com/wordpress/plugins/seo-premium/
Yoast makes it better be best redirect plug-in for WordPress I know of.
You can use The redirection plug-in
https://wordpress.org/plugins/redirection/
Hosting
http://getflywheel.com/help/do-you-support-htaccess-files/
You will want to clean up what WordPress did not.
Via 301 redirects it is most likely an error in your HTaccess file
Your WordPress install is not up to date as well you may want to ensure that is updated to 4.0.
Have http://getflywheel.com/tour/ do it
This is something I would recommend a company like Getflywheel.com at $15 a month you get manage WordPress your own all SSD VPS and all the issues with WordPress to take care of by them. http://getflywheel.com is a bargain.
Hope that helps,
Thomas
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content with tagging and categories
Hello, Moz is showing that a site has duplicate content - which appears to be because of tags and categories. It is a relatively new site, with only a few blog publications so far. This means that the same articles are displayed under a number of different tags and categories... Is this something I should worry about, or just wait until I have more content? The 'tag' and 'category' pages are not really pages I would expect or aim for anyone to find in google results anyway. Would be glad to here any advice / opinions on this Thanks!
On-Page Optimization | | wearehappymedia1 -
Duplicate page titles and hreflang tags
Moz is flagging a lot of pages on our site which have duplicate page titles. 99% of these are international pages which hreflang tags in the sitemap. Do I need to worry about this? I assumed that it wasn't an issue given the use of hreflang. And if that's the case, why is Moz flagging them as an issue? Thanks.
On-Page Optimization | | ahyde0 -
Why do I have 2 different URL's for the same page - is this good practice?
Hi GuysMy father is currently using a programmer to build his new site. Knowing a little about SEO etc, I was a little suspicious of the work carried out. **Anyone with good programming and SEO knowledge, please offer your advice!**This page http://www.thewoodgalleries.co.uk/gallery-range-wood-flooring/ which is soon to be http://www.thewoodgalleries.co.uk/engineered-wood/ you'll see has a number of different products. The products on this particular page have been built into colour categories like thishttp://www.thewoodgalleries.co.uk/engineered-wood/lights-greys http://www.thewoodgalleries.co.uk/engineered-wood/beiges http://www.thewoodgalleries.co.uk/engineered-wood/browns http://www.thewoodgalleries.co.uk/engineered-wood/darks-blacks This is fine. Eventually when we add to our selection of woods, we'll easily segment each product into "colour categories" for users to easily navigate to. My question is - Why do I have 2 different URL's for the same page - is this good practice? Please see below... Visible URL - http://www.thewoodgalleries.co.uk/engineered-wood/browns/cipressa/Below is the permalink seen in Word Press for this page also.Permalink: http://www.thewoodgalleries.co.uk/engineered-wood/browns-engineered-wood/cipressa/and in the Word Press snippet shows the same permalink urlCipressa | Engineered Brown Wood | The Wood Gallerieswww.thewoodgalleries.co.uk/engineered-wood/browns-engineered-wood/cipressa/ Buy Cipressa Engineered Brown Wood, available at The Wood Galleries, London. Provides an Exceptional Foundation for Elegant Décor, Extravagant .. If this is completely ok and has no negative search impact - then I'm happy. If not what should I advise to my programmer to do? Your help would be very much appreciated. Regards Faye
On-Page Optimization | | Faye2340 -
Noindex child pages (whose content is included on parent pages)?
I'm sorry if there have been questions close to this before... I've using WordPress less like a blogging platform and more like a CMS for years now... For content management purposes we organize a lot of content around Parent/Child page (and custom-post-type) relationships; the Child pages are included as tabbed content on the Parent page. Should I be noindexing these child pages, since their content is already on the site, in full, on their Parent pages (ie. duplicate content)? Or does it not matter, since the crawlers may not go to all of the tabbed content? None of the pages have shown up in Moz's "High Priority Issues" as duplicate content but it still seems like I'm making the Parent pages suffer needlessly... Anything obvious I'm not taking into consideration? By the by, this is my first post here @ Moz, which I'm loving; this site and the forums are such a great resource! Anyways, thanks in advance!
On-Page Optimization | | rsigg0 -
Page content length...does it matter?
As I begin developing my website's content, does it matter how long or short the actual text found in the is? I heard someone say before "a minimum of 250 words", but is that true? If so, what is the maximum length I should use?
On-Page Optimization | | wlw20090 -
Page title getting cut off in SERPS even though it's under 70 characters?
I re-wrote the page title of a home page for a site I'm working on and made sure it's under 70 characters (68 to be exact) to comply with best practices and make sure it doesn't get cut-off in the SERPS. It's still getting cut-off though and right when it gets to the brand/website name. Does a "-" have anything to do with it? Does that translate to an elipsis? Format: keywords - website/brand.com Can anybody tell me why this would be happening?
On-Page Optimization | | MichaelWeisbaum0 -
Would it be bad to change the canonical URL to the most recent page that has duplicate content, or should we just 301 redirect to the new page?
Is it bad to change the canonical URL in the tag, meaning does it lose it's stats? If we add a new page that may have duplicate content, but we want that page to be indexed over the older pages, should we just change the canonical page or redirect from the original canonical page? Thanks so much! -Amy
On-Page Optimization | | MeghanPrudencio0 -
Avoiding "Duplicate Page Title" and "Duplicate Page Content" - Best Practices?
We have a website with a searchable database of recipes. You can search the database using an online form with dropdown options for: Course (starter, main, salad, etc)
On-Page Optimization | | smaavie
Cooking Method (fry, bake, boil, steam, etc)
Preparation Time (Under 30 min, 30min to 1 hour, Over 1 hour) Here are some examples of how URLs may look when searching for a recipe: find-a-recipe.php?course=starter
find-a-recipe.php?course=main&preperation-time=30min+to+1+hour
find-a-recipe.php?cooking-method=fry&preperation-time=over+1+hour There is also pagination of search results, so the URL could also have the variable "start", e.g. find-a-recipe.php?course=salad&start=30 There can be any combination of these variables, meaning there are hundreds of possible search results URL variations. This all works well on the site, however it gives multiple "Duplicate Page Title" and "Duplicate Page Content" errors when crawled by SEOmoz. I've seached online and found several possible solutions for this, such as: Setting canonical tag Adding these URL variables to Google Webmasters to tell Google to ignore them Change the Title tag in the head dynamically based on what URL variables are present However I am not sure which of these would be best. As far as I can tell the canonical tag should be used when you have the same page available at two seperate URLs, but this isn't the case here as the search results are always different. Adding these URL variables to Google webmasters won't fix the problem in other search engines, and will presumably continue to get these errors in our SEOmoz crawl reports. Changing the title tag each time can lead to very long title tags, and it doesn't address the problem of duplicate page content. I had hoped there would be a standard solution for problems like this, as I imagine others will have come across this before, but I cannot find the ideal solution. Any help would be much appreciated. Kind Regards5