Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Duplicate Content with ?Page ID's in WordPress
-
Hi there,
I'm trying to figure out the best way to solve a duplicate content problem that I have due to Page ID's that WordPress automatically assigns to pages. I know that in order for me to resolve this I have to use canonical urls but the problem for me is I can't figure out the URL structure.
Moz is showing me thousands of duplicate content errors that are mostly related to Page IDs
For example, this is how a page's url should look like on my site
Moz is telling me there are 50 duplicate content errors for this page. The page ID for this page is 82 so the duplicate content errors appear as follows
and so on. For 47 more pages. The problem repeats itself with other pages as well.
My permalinks are set to "Post Name" so I know that's not an issue.
What can I do to resolve this? How can I use canonical URLs to solve this problem. Any help will be greatly appreciated.
-
this might do it as well
A flexible pattern URL mapping is a way of redirecting all URLs that match a particular pattern, to equivalent destination URLs, using a single mapping. It does this by allowing you to parse out and name portions of the requested URL to substitute into the destination URL. These types of URL mappings are useful when you are changing the structure or format of your URLs, but want to make sure you can redirect requests for pages under their old URL structure to their new URLs.
An example of a flexible pattern URL is the following:
/myblog/:post-year/:post-month/*rest-of-url?id=:post-id
Each portion of the URL above that starts with a colon (:) creates a named component that is matched until the next delimiter character (./=&?), and any portion that starts with an asterisk (*) creates a named component that is matched until the end of the URL (up to the query string).
The named components can then be used in the URL mapping's destination, with each name included inside of curly braces. For example, the named components defined in the flexible pattern URL above could be used to create the following destination:
/newblog/{post-year}/{post-month}/post-{post-id}/{rest-of-url}
To demonstrate how this flexible pattern URL mapping would work, let's consider the following example requested URL and where it would be redirected. The named components in the requested and destination URLs are highlighted.
Requested URL: http://www.mydomain.com/myblog/2013/12/marketing/inbound-marketing-rocks?id=98765
Redirected to: http://www.mydomain.com/newblog/2013/12/post-98765/marketing/inbound-marketing-rocks
With this pattern-based URL mapping we were able to retain all of the important, identifying parts of the original URL and insert them into the new URL structure. In addition, with this particular mapping, we were able to:
- capture the variable-length {rest-of-url} component (i.e. marketing/inbound-marketing-rocks) to be used in the destination url, by using an asterisk (*) at the beginning of that component's definition
- move the {post-id} component from the query string in the original URL into the middle of the URL in the destination
-
you may want to remove the flash file from the website. It is not compatible with iPhones and flash is really something I would stay away from on websites.
This is flash
http://www.spamedica.com/wp-content/themes/spamedia/flash/spamedica_flash.swf
See why that's not good
http://www.browserstack.com/screenshots/46bca267a16b8726a26c7438e76317ef51b877be
Try
http://www.browserstack.com/responsive
Here's an example of the exact same thing using the gif
http://cl.ly/image/2t2d470b3A2F/Screen Recording 2014-09-23 at 12.58 PM.gif
iPhone
The http://www.siteground.com/ your host is great for shared hosting
your IP
181.224.137.194 - 126 other sites hosted on this server
I honestly would give them a call and have them correct your htaccess file
However if you want something that's going to be faster and focused 100% on WordPress use manage WordPress hosting provider. Like GetFlywheel you get your own VPS only one site per an IP no shared resources and this type of stuff is never a problem for them to fix.
You can migrate 100% free in addition.
Overall I would say that your site needs some TLC
http://tools.pingdom.com/fpt/#!/exa05N/http://www.spamedica.com/
you have over 100 requests and your server does a good job of catching after it's been loaded but not that much prior to so the 1st visit will be slow.
Most important part is it needs to be responsive and work properly on mobile devices.
that includes using CSS 3 and HTML 5 to substitute for flash so render properly on all devices. In addition to allowing people to browse it on all devices.
http://www.spamedica.com/
Tested from New York City, New York, USA on September 23 at 14:00:29
<dl class="first">
<dt>Page size</dt>
<dd>1.9MB</dd>
</dl>
<dl>
<dt>Load time</dt>
<dd>2.31s</dd>
</dl>
<dl>
<dt>Requests</dt>
<dd>112</dd>
</dl>
<dl class="last">
<dt>Perf. grade</dt>
<dd>69/100</dd>
</dl>
Sincerely,
Thomas
-
You can check it with
https://yoast.com/wp-content/permalink-helper.php
https://yoast.com/change-wordpress-permalink-structure/
If you’re on Apache and you decide to do the redirect, having been on a
/yyyy/mm/dd/%postname%/
structure before, you might benefit from this simple redirect which you could throw into your .htaccess file:|
1
|RedirectMatch 301 /\d{4}/\d{2}/\d{2}/(.*) http://example.com/$1
| -
PS try to save your link structure to the way you want it and make sure you click the save button.
if it does not work there a problem that would require access to WordPress to fix
http://wordpress.stackexchange.com/questions/39665/custom-htaccess-rewrite-rule-for-page
-
Rewrite the URLs your hosting company for a plug-in like
https://yoast.com/wordpress/plugins/seo-premium/
Yoast makes it better be best redirect plug-in for WordPress I know of.
You can use The redirection plug-in
https://wordpress.org/plugins/redirection/
Hosting
http://getflywheel.com/help/do-you-support-htaccess-files/
You will want to clean up what WordPress did not.
Via 301 redirects it is most likely an error in your HTaccess file
Your WordPress install is not up to date as well you may want to ensure that is updated to 4.0.
Have http://getflywheel.com/tour/ do it
This is something I would recommend a company like Getflywheel.com at $15 a month you get manage WordPress your own all SSD VPS and all the issues with WordPress to take care of by them. http://getflywheel.com is a bargain.
Hope that helps,
Thomas
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate URL's in Sitemap? Is that a problem?
I submitted a sitemap to on Search Console - but noticed that there are duplicate URLs, is that a problem for Google?
On-Page Optimization | | Luciana_BAH0 -
Content hidden behind a 'read all/more..' etc etc button
Hi Anyone know latest thinking re 'hidden content' such as body copy behind a 'read more' type button/link in light of John Muellers comments toward end of last year (that they discount hidden copy etc) & follow up posts on Search Engine Round Table & Moz etc etc ? Lots of people were testing it and finding such content was still being crawled & indexed so presumed not a big deal after all but if Google said they discount it surely we now want to reveal/unhide such body copy if it contains text important to the pages seo efforts. Do you think it could be the case that G is still crawling & indexing such content BUT any contribution that copy may have had to the pages seo efforts is now lost if hidden. So to get its contribution to SEO back one needs to reveal it, have fully displayed ? OR no need to worry and can keep such copy behind a 'read more' button/link ? All Best Dan
On-Page Optimization | | Dan-Lawrence0 -
Why do I have 2 different URL's for the same page - is this good practice?
Hi GuysMy father is currently using a programmer to build his new site. Knowing a little about SEO etc, I was a little suspicious of the work carried out. **Anyone with good programming and SEO knowledge, please offer your advice!**This page http://www.thewoodgalleries.co.uk/gallery-range-wood-flooring/ which is soon to be http://www.thewoodgalleries.co.uk/engineered-wood/ you'll see has a number of different products. The products on this particular page have been built into colour categories like thishttp://www.thewoodgalleries.co.uk/engineered-wood/lights-greys http://www.thewoodgalleries.co.uk/engineered-wood/beiges http://www.thewoodgalleries.co.uk/engineered-wood/browns http://www.thewoodgalleries.co.uk/engineered-wood/darks-blacks This is fine. Eventually when we add to our selection of woods, we'll easily segment each product into "colour categories" for users to easily navigate to. My question is - Why do I have 2 different URL's for the same page - is this good practice? Please see below... Visible URL - http://www.thewoodgalleries.co.uk/engineered-wood/browns/cipressa/Below is the permalink seen in Word Press for this page also.Permalink: http://www.thewoodgalleries.co.uk/engineered-wood/browns-engineered-wood/cipressa/and in the Word Press snippet shows the same permalink urlCipressa | Engineered Brown Wood | The Wood Gallerieswww.thewoodgalleries.co.uk/engineered-wood/browns-engineered-wood/cipressa/ Buy Cipressa Engineered Brown Wood, available at The Wood Galleries, London. Provides an Exceptional Foundation for Elegant Décor, Extravagant .. If this is completely ok and has no negative search impact - then I'm happy. If not what should I advise to my programmer to do? Your help would be very much appreciated. Regards Faye
On-Page Optimization | | Faye2340 -
Schema and Rich Snippets What's the difference?
Sorry if this is a daft question but... what is the difference between Rich snippets and Schema markup? Are they one and the same? They seem to be used interchaneably and I'm confused. If someone could give a brief sentence or two about the differences between them that would be great. Thanks
On-Page Optimization | | AL123al1 -
Duplicate content penalty
when moz crawls my site they say I have 2x the pages that I really have & they say I am being penalized for duplicate content. I know years ago I had my old domain resolve over to my new domain. Its the only thing that makes sense as to the duplicate content but would search engines really penalize me for that? It is technically only on 1 site. My business took a significant sales hit starting early July 2013, I know google did and algorithm update that did have SEO aspects. I need to resolve the problem so I can stay in business
On-Page Optimization | | cheaptubes0 -
Duplicate Content for Men's and Women's Version of Site
So, we're a service where you can book different hairdressing services from a number of different salons (site being worked on). We're doing both a male and female version of the site on the same domain which users are can select between on the homepage. The differences are largely cosmetic (allowing the designers to be more creative and have a bit of fun and to also have dedicated male grooming landing pages), but I was wondering about duplicate pages. While most of the pages on each version of the site will be unique (i.e. [male service] in [location] vs [female service] in [location] with the female taking precedent when there are duplicates), what should we do about the likes of the "About" page? Pages like this would both be unique in wording but essentially offer the same information and does it make sense to to index two different "About" pages, even if the titles vary? My question is whether, for these duplicate pages, you would set the more popular one as the preferred version canonically, leave them both to be indexed or noindex the lesser version entirely? Hope this makes sense, thanks!
On-Page Optimization | | LeahHutcheon0 -
Duplicate Content from on Competitor's site?
I've recently discovered large blocks of content on a competitors site that has been copy and pasted from a client's site. From what I know, this will only hurt the competitor and not my client since my guy was the original. Is this true? Is there any risk to my client? Should we take action? Dino
On-Page Optimization | | Dino640 -
How to properly remove pages and a category from Google's index
I want to remove this category http://www.webdesign.org/web-design-news-all/ and all the pages in that category (e.g. http://www.webdesign.org/web-design-news-all/7386.html ) from Google's index. I used the following string in the "Reomval URS" section in Google Webmaster Tools: http://www.webdesign.org/web-design-news-all/* is that correct or I better use http://www.webdesign.org/web-design-news-all/ ? Thanks in advance.
On-Page Optimization | | VinceWicks0