There has been over 300 pages on our clients site with duplicate page content. Before we embark on a programming solution to this with canonical tags, our developers are requesting the list of originating sites/links/sources for these odd URLs.
How can we find a list of the originating URLs? If you we can provide a list of originating sources, that would be helpful.
For example, our the following pages are showing (as a sample) as duplicate content:
www.crittenton.com/Video/View.aspx?id=87&VideoID=11
www.crittenton.com/Video/View.aspx?id=87&VideoID=12
"How did you get all those duplicate urls? I have tried to google the "contact us", "news", "video" pages. I didn't get all those duplicate pages. The page id=87 on the most of the duplicate pages are not supposed to be there. I was wondering how the visitors got to all those duplicate pages. Please advise."
Note, the CMS does not create this type of hybrid URLs. We are as curious as you as to where/why/how these are being created. Thanks.