Bizarre PDF URL string
-
Hey folks,
I'm getting literally hundreds of Duplicate Title and Duplicate Content errors for a site, and most of them are a result of the same issue. The site uses javascript container pages a lot, but each gets their own URL. Unfortunately, it seems like each page is also loading all the content for all the other pages, or something.
For instance, I have a section of the site under /for-institutions/, and then there are 5 container pages under that. Each container page has it's own URL, so when you select it, you get the URL /for-institutions/products/ or /for-institutions/services/ etc. However, the institutions container page doesn't change, just the content within.
In my SEO results, I'm getting the following:
/for-institutions/$%7Bpdf%7D/
/for-institutions/$%7Bpdf%7D/$%7Bpdf%7D/ etc, each as a duplicate title and content page. How can I eliminate this? Is there a regular expression that rewrites URL segments beginning with $ ?
For your reference: The page is set up so that any URL that doesn't exist just refers to the subdirectory. /for-institutions/$%7Bpdf%7D/ displays /for-institutions/, but does not rewrite the URL. So too if I were to enter /for-institutions/dog.
-
I didn't develop the site and I don't currently have FTP access to actually edit the templates, or I could probably eliminate that myself. Thanks! I will figure out exactly what's going on with your help now. I appreciate it.
-
Did you get a chance to have your developer look at your code? Here's a snippet -
There's another one right above that I suspect also includes some items that aren't parsing.
<fieldset class="question">
<legend>${question}</legend>
{{each answers}}<label for="q${$data.id}a${$index}">${text}</label>
{{/each}}
</fieldset>
It's more than likely template code specific to the MonkCMS that isn't getting processed - perhaps there's not a matching variable / object setup in the system.
You will notice that both sections are currently hidden, so maybe you can just do without them altogether?
-
I'm not familiar with the Monk CMS, but looking at your code, you have some links to:
${pdf} - the %7B and %7D are basically URL encodes for the braces.
My guess is that ${pdf} is supposed to be parsing into a real URL, but it's not happening...
-
-
Can you provide a full URL for the website?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Best URL when adding an SSL certificate . . .
Our (small) company is a little late to the party on this, and we've only just realised that we're better off with an SSL certificate for our website. (Yes I know, I know, but we dropped SEO some time ago after getting severely bitten by a certain Penguin, and are only just making tentative step back to it after those intervening years, so we're running to get back up to date with these things.) This has now been implemented, but our web guy has dropped the 'www' element during the process. Our http://domain.com address has always historically been redicrected to our main http://www.domain.com address. Now our web guy has implemented the SSL cert, our website URL is appearing as https://domain.com, and he has redirected the http://www.domain.com to that new URL. Obviously all our historic (and more recent) link building has been to the http://www.domain.com address. Is this an issue, should the new Https URL keep the 'www', or does it make no difference what so ever? Conversely could it actually be of benefit dropping the 'www.' because our keyword specific product URL's are now 4 characters closer to the http and 4 digits shorter? Finally, on the links we have control of (professional trade associations etc) do we need to ask them to change the links to the new Https address, or does the transition from Http to Https make no difference?
Web Design | | Wookii0 -
Can the design still be considered adaptive if the URL is different?
I was under the impression our site had a mobile dedicated design, but my developers are telling me we have an adaptive design. The mobile site is set up different and has different content and the url is as follows: www.site.com/MobileView/MobileHome.aspx Can it still be considered adaptive if the URL is not the exact same? Hopefully this make sense and I appreciate anyone's input!
Web Design | | AliMac260 -
Googlebot Reports All URLs as Unreachable
Webmaster Tools is reporting that all of our site's URLs - located www.zuken.com - are "unreachable." The URLs display correctly in all browsers. We recently switched hosting providers, but they assure us there is no security setting that would be causing this issue. Any ideas? Is this a glitch with Webmaster Tools?
Web Design | | Zuken0 -
Switched from Wix to Wordpress dreaded hashtag URL
Recently took over managing a site for a non-profit which was using the dreaded Wix. Switched over to Wordpress but now Google still has the old URL's with the hashtag. Can't forward them in .htaccess and don't want to add javascript for fear of slowing down load time. I found a solution that seems like it will take hours and hours of work. I found the solution at http://www.thedriversgarage.com/web-technology/redirecting-hashbang-urls-wix-urls/ but it seems like it would take hours with all the URL's. I submitted an XML sitemap in Google webmaster tools. My question is, how serious could this effect SEO for my site? Google accepted the new sitemap but still has the old URL's in SERP. How long does this generally take to remove? Will the hashtag URL's penalize the site for duplicate content? If so is there a way to tell Google the homepage without hashtags is the page with original content? Sort of like the rel=canonical tag which I know wont work as the hashtag URL's all redirect to the homepage so they will all have the tag. Does Google ignore the hashtag? Could there even be a benefit to this, possibly the homepage getting more page authority due to the redirects? How serious is this? Thanks in advancing.
Web Design | | limited70 -
Pulling old site-map and URL structure of a site
Hey guys how do I pull an old sitemap or URL structure of a site ! This company I am helping out . Build a new site without any 301 redirect ! It's been about 2 months and hosting company sent me. SQL database file said we basically need to build another site ! Wondering if there are any other ways to see what exact urls were existent before their change over
Web Design | | BizDetox0 -
URLs appear in Google Webmaster Tools that I can't find on my own site?!?
Hi, I have a Magento e-commerce site (clothing) and when I had a look through some of the sections in Google Webmaster Tools I found URLs that I can't find on my site. For example, a product url maybe http://www.example.co.uk/product-url/ which is fine. In that product there maybe three sizes of the product (Small, Medium, Large) and for some reason Googlebot is sometimes finding a url like: http://www.example.co.uk/product-url/1202/ has been found and when clicked on is a live url (Status code: 200) with is one of the sizes (medium). However I have ran a site crawl in Screaming Frog and other crawl tests and can't seem to find where Googlebot is finding these URLs. I think I need to: 1. Find how Googlebot is finding these urls? 2. Find out how to keep out of index (e.g. robots.txt, canonical etc.... Any help would be much appreciated and I'm happy to share the URL with members if they think they can have a look and help with this problem. I can share specific URLs which might make the issue seem clearer, let me know? Thanks, Darrell
Web Design | | clickyleap0 -
Website URL Structures - Which does Google prefer or does it matter?
Which URL structure does google prefer..............OR DOES IT REALLY MATTER? Option A www.example.com/services/service#1 - this is the default that wordpress uses Option B www.example.com/service#1
Web Design | | webestate0 -
Long URLs due to foreign characters
I have a site which provides forum sections for various languages. When foreign characters are used in the post title, each letter is replace by a three character replacement such as %93. This conversion makes the URLs long. The site's software automatically uses the thread's title in the URL. It is never a problem except in these instances. Any suggestions on how to handle this issue?
Web Design | | RyanKent0