Single URL not indexed
-
Hi everyone!
Some days ago, I noticed that one of our URLs (http://www.access.de/karriereplanung/webinare) is no longer in the Google index.
We never had any form of penalty, link warning etc. Our traffic by Google is constantly growing every month. This single page does not have an external link pointing to it - only internal links.
The page has been indexed all the time. The HTTP status code is 200, there is no noindex or something in the code. I submitted the URL on GWMT to let Google send it to the index. It was crawled successfully by Google, sent to the index 5 days ago - nothing happened, still not indexed.
Do you have any suggestions why this page is no longer indexed? It is well linked internally and one click away from the home page. There is still the PR of 5 showing, I always thought that pages with PR are indexed.......
-
Hi Nick,
first of all, thanx for your responses.
I already did the "fetch as Googlebot" thing 5 days ago. The page was successfully crawled and has been sent to the index successfully, according to Google Webmaster Tools. But in these 5 days, nothing changed.
I like your suggestions with the extra text. We will add some and do the "fetch as Googlebot" again and see what happens.
And you are absolutely right when it comes to the "value" of this page. It didn't send that much traffic, just a little. It is no big deal for us if this page doesn't get back into the index - but as someone doing SEO I want to figure out the problem Google seems to have with this page - just to test and learn for future problems
-
Replying to myself because I just noticed something I was wrong about.
I thought that the first box at the top was an excerpt of the page it links to, but it looks like it IS actually unique.
So you probably don't need to add anything, though expanding on that text in the first box might be a good idea.
Try to get a link to that page and see if that helps.
-
The thing is those words do appear elsewhere on the site, and Google can probably figure out that what is on this particular page is excerpts and links to the originals.
This normally isn't a huge problem, though. Lots of sites and blogs have category and tag pages that fit that description and ARE indexed (though many are not).
Before messing around with adding text which you may not really need to add, try doing a Fetch as Googlebot of the page in Google Webmaster Tools and hit the submit button when the fetch is complete. It may be that the page just got dropped by accident. If it doesn't return to the index after a few days, try adding a little totally unique content. Just a sentence or two about what these links are should be enough. I have done this on a few sites with lots of thin tag or category pages and it doesn't take a lot of text to get them into the index.
Partner link pages are also typically thin, but they may be indexed anyway if the links are useful, or ignored if it is simply a link exchange page that doesn't really have any value other than swapping links (which isn't much value). Like most things related to Google search, there isn't always a specific thing that will make the difference.
What you may want to consider is whether or not you want or need that page to appear in search, and if you think it could or should actually rank well for anything. If it doesn't matter, I wouldn't be too concerned unless there are many pages on the site that are not indexed.
-
Quite strange - I see someone visiting this URL in the Google-Analytics real-time-report.
Traffic source is direct, and Google labels this site as "/empty". Any ideas why?
-
Hi Nick,
I knwo the page is not full of content - but if you count the words, they are almost 300. And we do not have pages with the same content or links on our domain.
It could be a solution to add more text, but what about pages with partner links, for example? They normally have no content and lots of external links - so they should also be seen an "thin pages"?!
-
It may be worth generating and submitting an XML sitemap, with this page relatively high up in the map, and submitting it to Google. This then might prompt Google to crawl the page and index it.
ScreamingFrog is a free tool that generates an XML sitemap for you, while there are also free generators out there as well with just a quick google search.
-
Hi Tom,
well, honestly, we do not have a sitemap...
And no, there are no other pages with similar content on our domain.
As you said it: quite odd!
-
It may have been dropped because it was seen as "thin" content. Since most of the page is excerpts from and links to other pages, it is likely being ignored - especially if there are other pages that have the same excerpts and links. If you can add unique, some descriptive text to the page, it may do better.
And about the PageRank: The PR you can see in the Toolbar or other PR checks is usually very out of date. It could be that prior to your page's disappearance, it had a high PR and really does not now. While the visible PR can be used to get a pretty good idea of how Google ranks a page, I wouldn't give it much thought. Plenty of low PR pages rank very well for whatever search terms they are targeting, and lots of high PR pages don't rank very well.
-
That is quite odd - checked all those things from my end and found the same, but still not indexed.
My only other check at this stage would be to ask if its in the .xml sitemap that you have submitted in Google Webmaster Tools? And whether or not this page features similar content to any other pages on your site?
You've probably checked both already, but thought I'd ask just to be sure.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Orphaned unwanted urls from the cms
Hi
Technical SEO | | MattHopkins
I am working on quite an old cms, and there are bunch of urls that don't make any sense.
https://www.trentfurniture.co.uk/products/all-outdoor-furniture/all-outdoor-furniture/1
https://www.trentfurniture.co.uk/products/all-chairs/all-chairs/1
https://www.trentfurniture.co.uk/products/all-industries/all-chairs/1
https://www.trentfurniture.co.uk/products/all-chairs/all-industries/1
https://www.trentfurniture.co.uk/products/all-chairs/banqueting-furniture/1
https://www.trentfurniture.co.uk/products/all-chairs/bar-furniture/1
https://www.trentfurniture.co.uk/products/all-chairs/bentwood-furniture/1
For example there are no internal links. And fortunately not much traffic at all. But I can't see in the cms why they are generating? I've tried to check the html code to check why, what's the reason? But all I can think of is the structure....? something odd the cms writes?
Anyone have any ideas please? And would I redirect all these? Just thinking there could be a better solution/fix, rather than redirects since there are no links or traffic.....Like the devs solve why they are generating.....Unfortunately I get very slow responses from the devs as a 3rd pty company, hence on here ;0). (Some of those are indexed too)... :0) Thanks in advance....0 -
Use existing page with bad URL or brand new URL?
Hello, We will be updating an existing page with more helpful information with the goal of reaching more potential customers through SEO and also attaching a SEM campaign to the specific landing page. The current URL of the page scores 25 on Page Authority, and has 2 links to it from blog articles (PA 35, 31). The current content needs to be rewritten to be more helpful and also needs some additional information. The downsides are that it has an "bad" URL- no target keyword and uses underscores. Which of the following choices would you make? 1. Update this old "bad" URL with new content. Benefit from the existing PA. -or- 2. Start with a new optimized URL, reusing some of the old content and utilizing a 301 redirect from the previous page? Thank you!
Technical SEO | | XLMarketing0 -
Duplicate pages in Google index despite canonical tag and URL Parameter in GWMT
Good morning Moz... This is a weird one. It seems to be a "bug" with Google, honest... We migrated our site www.three-clearance.co.uk to a Drupal platform over the new year. The old site used URL-based tracking for heat map purposes, so for instance www.three-clearance.co.uk/apple-phones.html ..could be reached via www.three-clearance.co.uk/apple-phones.html?ref=menu or www.three-clearance.co.uk/apple-phones.html?ref=sidebar and so on. GWMT was told of the ref parameter and the canonical meta tag used to indicate our preference. As expected we encountered no duplicate content issues and everything was good. This is the chain of events: Site migrated to new platform following best practice, as far as I can attest to. Only known issue was that the verification for both google analytics (meta tag) and GWMT (HTML file) didn't transfer as expected so between relaunch on the 22nd Dec and the fix on 2nd Jan we have no GA data, and presumably there was a period where GWMT became unverified. URL structure and URIs were maintained 100% (which may be a problem, now) Yesterday I discovered 200-ish 'duplicate meta titles' and 'duplicate meta descriptions' in GWMT. Uh oh, thought I. Expand the report out and the duplicates are in fact ?ref= versions of the same root URL. Double uh oh, thought I. Run, not walk, to google and do some Fu: http://is.gd/yJ3U24 (9 versions of the same page, in the index, the only variation being the ?ref= URI) Checked BING and it has indexed each root URL once, as it should. Situation now: Site no longer uses ?ref= parameter, although of course there still exists some external backlinks that use it. This was intentional and happened when we migrated. I 'reset' the URL parameter in GWMT yesterday, given that there's no "delete" option. The "URLs monitored" count went from 900 to 0, but today is at over 1,000 (another wtf moment) I also resubmitted the XML sitemap and fetched 5 'hub' pages as Google, including the homepage and HTML site-map page. The ?ref= URls in the index have the disadvantage of actually working, given that we transferred the URL structure and of course the webserver just ignores the nonsense arguments and serves the page. So I assume Google assumes the pages still exist, and won't drop them from the index but will instead apply a dupe content penalty. Or maybe call us a spam farm. Who knows. Options that occurred to me (other than maybe making our canonical tags bold or locating a Google bug submission form 😄 ) include A) robots.txt-ing .?ref=. but to me this says "you can't see these pages", not "these pages don't exist", so isn't correct B) Hand-removing the URLs from the index through a page removal request per indexed URL C) Apply 301 to each indexed URL (hello BING dirty sitemap penalty) D) Post on SEOMoz because I genuinely can't understand this. Even if the gap in verification caused GWMT to forget that we had set ?ref= as a URL parameter, the parameter was no longer in use because the verification only went missing when we relaunched the site without this tracking. Google is seemingly 100% ignoring our canonical tags as well as the GWMT URL setting - I have no idea why and can't think of the best way to correct the situation. Do you? 🙂 Edited To Add: As of this morning the "edit/reset" buttons have disappeared from GWMT URL Parameters page, along with the option to add a new one. There's no messages explaining why and of course the Google help page doesn't mention disappearing buttons (it doesn't even explain what 'reset' does, or why there's no 'remove' option).
Technical SEO | | Tinhat0 -
Overly Dynamic URLs
I have a site that I use to time fitness events and I like to post the results using query strings. I create a link to each event's results/gallery/etc. I don't need these pages crawled and I don't want them to hurt my seo. Can I put a "do not crawl" meta on them or will that hurt my overall positioning? What are my other options?
Technical SEO | | bobbabuoy0 -
Backlinks Indexing
Is there a way of indexing my backlinks?? I have a lot backlinks but Google can't find them
Technical SEO | | CodePlus0 -
Page not being indexed
Hi all, On our site we have a lot of bookmaker reviews, and we are ranking pretty good for most bookmaker names as keywords, however a single bookmaker seems to have been shunned by Google. For a search "betsafe" in Denmark, this page does not appear among the top 50: http://www.betxpert.com/bookmakere/betsafe All of our other review pages rank in top 10-20 for the bookmaker name as keyword. What to do if Google has "banned" a page? Best regards, Rasmus
Technical SEO | | rasmusbang0 -
How do I fix these duplicate URLs?
HI guys, I ran a report on my site and it shows some duplicate titles (example below). Do I need to add something to the htaccess file or another file to fix this? I understand that the search engines should only see 1 URL for the page. 2 pages have "Bikes for sale | used bikes | second hand bicycles" title pauslwebsite.com/bikes/ paulswebsite.com/bikes/index.asp Thanks
Technical SEO | | paulmund0 -
Why is a 301 redirected url still getting indexed?
We recently fixed a redirect issue in a website, and although it appears that the redirection is working fine, the url in question keeps on getting crawled, indexed and cached by google. The redirect was done a month ago, and google shows cached version of it, even for a couple of days ago. Manual checking shows that its being redirected, and also a couple of online tools i checked report a 301 redirect. Do you have any idea why this could be happening? The website I'm talking about is www.hotelmajestic.gr and its being redirected to www.hotel-majestic.gr
Technical SEO | | dim_d0