Duplicate content issue
-
Hi everyone,
I have an issue determining what type of duplicate content I have.
www.example.com/index.php?mact=Calendar,m57663,default,1&m57663return_id=116&m57663detailpage=&m57663year=2011&m57663month=6&m57663day=19&m57663display=list&m57663return_link=1&m57663detail=1&m57663lang=en_GB&m57663returnid=116&page=116
Since I am not an coding expert, to me it looks like it is a URL parameter duplicate content. Is it?
At the same time "return_id" would makes me think it is a session id duplicate content. I am confused about how to determine different types of duplicate content, even by reading articles on Seomoz about it: http://www.seomoz.org/learn-seo/duplicate-content.
Could someone help me on how to recognize different types of duplicate content?
Thank you!
-
Thank you guys for being so helpful!!:)
-
Hello Jeff, I would like to say first that lots of sites have duplicate content problems. For the most part, this is not a huge issue. When search engines find duplicate content they choose one of the pages to list in the index, and then will ignore the other. This assumes, of course, that the nature of the duplicate content is not so bad that it would lead to the search engine wanting to ban you. This can happen if a review of your situation causes them to believe that you are deliberately trying to rank multiple times for the same search terms.
Here is a link that fixes the problem of duplicate content :
http://www.seomoz.org/blog/duplicate-content-in-a-post-panda-world
-
Let me try.
1. The answer to your first question is that it only matters if you're trying to figure out how to handle it programmaticaly. In this case you might have to ask the developer if this is being done by a session id. To me it looks more like a URL parameter, but without a live example I wouldnt know, could you provide the website in question? If not try visiting the website once, clear your cache and then visit again and see if the number after "return_id" changes. if it changes that is a session id. If it stays the same have a friend visit the website in the same manor and see if the number stays the same, if it changes then there's a good chance that this is a session id.
No matter if it's a session id adding it or not "return_id" is technically a URL parameter that is triggered by a session id.
2. The second question is still a bit vague, so let me see if this is correct. are you asking how to treat the duplicate content once you know what is causing it? If so, then follow these rules.
If the content changes significantly in the presence of the session id or parameter then this is not duplicate content. If the content does change do the following:
- make sure to use rel canonical for the root URL. In your example that would be: www.example.com/index.php?mact=Calendar
- set the URL parameters in Google and Bings webmaster tools to treat the parameter correctly.
- When the parameter or session id is present add the noindex, follow robots tag. this will allow the bots to spider through and pass on link juice in the event that someone links to your parameter versions
I think you have a larger issue, which is that your website's code is using the index.php to generate all of the pages, in the example that is calendar. This is a common mistake that programmers make since they work to do things as quickly and efficiently as possible. Its far easier to keep all of the code in the one file than to create several different dynamic files that work with each other.
If you dont have the ability to break this down and generate out different pages you might be able to use URL Rewrites to make browsers and bots think the URLs are actually different.
-
Thank you for your answers but I guess I didn't formulate properly my question.
My 1st question was: What kind of duplicate content is it?
- session id
- or url parameter
My second question is: How do you differentiate them? What do you look at when a duplicate content is a session id one or a url parameter issue?
-
You can determine if you have duplicate content several ways. search in google site:example.com and see how many pages google knows at your website. Also, when you are on page with this crazy url, open source code and see if a page has rel="canonical" tag. In your page that would be the best solution to signal robot that this is the same page as your index.php page.
Also, you can try Xenu. good and fast program to run your site on duplicates.
Hope it helps, you can show your website so we can take a look.
-
Hi Jeff,
index.php is the same as index.php?something=something&anotherthing=somethinglese
Each page should have a different url like index.php and page.php instead of always using index.php
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Duplicate Content
We have multiple collections being flagged as duplicate content - but I can't find where these duplications are coming from? The duplicate content has no introductory text, and no meta description. Please see examples:- This is the correct collection page:-
Technical SEO | | Caroline_Ardmoor
https://www.ardmoor.co.uk/collections/deerhunter This is the incorrect collection page:-
https://www.ardmoor.co.uk/collections/vendors How do I stop this incorrect page from showing?0 -
Setting up addon domains properly (bonus duplicate content issue inside)
A new client of mine is using 1and1 hosting from back in the dark ages. Turns out, her primary domain and her main website (different domain) are exactly the same. She likes to have the domains names of her books, but her intention is to have it redirect to her main site. Unfortunately, 1and1's control panel is light years behind cpanel, so when she set up her new domains it just pointed everything to the same directory. I just want to make sure I don't make this up, so please correct me if I'm wrong about something. I'm assuming this is a major duplicate content deal, so I plan to create a new directory for each add-on domain. Since her main site is an add-on itself, I'll have to move all the files into it's new home directory. Then I'll create an htaccess file for each domain and redirect it to her main site. Right so far? My major concern is with the duplicate content. She's had two sites being exactly the same for years. Will there be any issues leftover after I set everything up properly? Is there anything else I need to do? Thanks for the help guys! I'm fairly new to this community and love the opportunity to learn from the best!
Technical SEO | | Mattymar0 -
Mobile and hidden content - Any issue for SEO?
In reference to mobile - am I walking a fine SEO line when it comes to hidden content on mobile? On the responsive variations of sites we are working on some content is hidden (that displays on the desktop version of the site) so that pages on mobile can display correctly. Is this negative for SEO? Appreciate any feedback Cheers.
Technical SEO | | Oxfordcomma0 -
Duplicate Content
We have a ton of duplicate content/title errors on our reports, many of them showing errors of: http://www.mysite.com/(page title) and http://mysite.com/(page title) Our site has been set up so that mysite.com 301 redirects to www.mysite.com (we did this a couple years ago). Is it possible that I set up my campaign the wrong way in SEOMoz? I'm thinking it must be a user error when I set up the campaign since we already have the 301 Redirect. Any advice is appreciated!
Technical SEO | | Ditigal_Taylor0 -
Duplicate Page Content Report
In Crawl Diagnostics Summary, I have 2000 duplicate page content. When I click the link, my Wordpress return "page not found" and I see it's not indexed by Google, and I could not find the issue in Google Webmaster. So where does this link come from?
Technical SEO | | smallwebsite0 -
Duplicate content
I have two page, where the second makes a duplicate content from the first Example:www.mysite.com/mypagewww.mysite.com/mysecondpageIf i insert still making duplicate content?Best regards,Wendel
Technical SEO | | peopleinteractive0 -
Lots of duplicate content warnings
I have a site that says that I have 2,500 warnings. It is a real estate website and of course we use feeds. it says I have a lot of duplicate content. One thing is a page called "Request an appointment" and that is a url for each listing. Since there are 800 listings on my site. How could I solve this problem so that this doesn't show up as duplicate content since I use the same "Request an Appointment" verbeage on each of those? I guess my developer who used php to do it, created a dedicated url to each. Any help would be greatly appreciated.
Technical SEO | | SeaC0 -
Duplicate Pages Issue
I noticed a problem and I was wondering if anyone knows how to fix it. I was a sitemap for 1oxygen.com, a site that has around 50 pages. The sitemap generator come back with over a 2000 pages. Here is two of the results: http://www.1oxygen.com/portableconcentrators/portableconcentrators/portableconcentrators/services/rentals.htm
Technical SEO | | chuck-layton
http://www.1oxygen.com/portableconcentrators/portableconcentrators/1oxygen/portableconcentrators/portableconcentrators/portableconcentrators/oxusportableconcentrator.htm These are actaully pages somehow. In my FTP there in the first /portableconentrators/ folder there is about 12 html documents and no other folders. It looks like it is creating a page for every possible folder combination. I have no idea why you those pages above actually work, help please???0