Single URL not indexed
-
Hi everyone!
Some days ago, I noticed that one of our URLs (http://www.access.de/karriereplanung/webinare) is no longer in the Google index.
We never had any form of penalty, link warning etc. Our traffic by Google is constantly growing every month. This single page does not have an external link pointing to it - only internal links.
The page has been indexed all the time. The HTTP status code is 200, there is no noindex or something in the code. I submitted the URL on GWMT to let Google send it to the index. It was crawled successfully by Google, sent to the index 5 days ago - nothing happened, still not indexed.
Do you have any suggestions why this page is no longer indexed? It is well linked internally and one click away from the home page. There is still the PR of 5 showing, I always thought that pages with PR are indexed.......
-
Hi Nick,
first of all, thanx for your responses.
I already did the "fetch as Googlebot" thing 5 days ago. The page was successfully crawled and has been sent to the index successfully, according to Google Webmaster Tools. But in these 5 days, nothing changed.
I like your suggestions with the extra text. We will add some and do the "fetch as Googlebot" again and see what happens.
And you are absolutely right when it comes to the "value" of this page. It didn't send that much traffic, just a little. It is no big deal for us if this page doesn't get back into the index - but as someone doing SEO I want to figure out the problem Google seems to have with this page - just to test and learn for future problems
-
Replying to myself because I just noticed something I was wrong about.
I thought that the first box at the top was an excerpt of the page it links to, but it looks like it IS actually unique.
So you probably don't need to add anything, though expanding on that text in the first box might be a good idea.
Try to get a link to that page and see if that helps.
-
The thing is those words do appear elsewhere on the site, and Google can probably figure out that what is on this particular page is excerpts and links to the originals.
This normally isn't a huge problem, though. Lots of sites and blogs have category and tag pages that fit that description and ARE indexed (though many are not).
Before messing around with adding text which you may not really need to add, try doing a Fetch as Googlebot of the page in Google Webmaster Tools and hit the submit button when the fetch is complete. It may be that the page just got dropped by accident. If it doesn't return to the index after a few days, try adding a little totally unique content. Just a sentence or two about what these links are should be enough. I have done this on a few sites with lots of thin tag or category pages and it doesn't take a lot of text to get them into the index.
Partner link pages are also typically thin, but they may be indexed anyway if the links are useful, or ignored if it is simply a link exchange page that doesn't really have any value other than swapping links (which isn't much value). Like most things related to Google search, there isn't always a specific thing that will make the difference.
What you may want to consider is whether or not you want or need that page to appear in search, and if you think it could or should actually rank well for anything. If it doesn't matter, I wouldn't be too concerned unless there are many pages on the site that are not indexed.
-
Quite strange - I see someone visiting this URL in the Google-Analytics real-time-report.
Traffic source is direct, and Google labels this site as "/empty". Any ideas why?
-
Hi Nick,
I knwo the page is not full of content - but if you count the words, they are almost 300. And we do not have pages with the same content or links on our domain.
It could be a solution to add more text, but what about pages with partner links, for example? They normally have no content and lots of external links - so they should also be seen an "thin pages"?!
-
It may be worth generating and submitting an XML sitemap, with this page relatively high up in the map, and submitting it to Google. This then might prompt Google to crawl the page and index it.
ScreamingFrog is a free tool that generates an XML sitemap for you, while there are also free generators out there as well with just a quick google search.
-
Hi Tom,
well, honestly, we do not have a sitemap...
And no, there are no other pages with similar content on our domain.
As you said it: quite odd!
-
It may have been dropped because it was seen as "thin" content. Since most of the page is excerpts from and links to other pages, it is likely being ignored - especially if there are other pages that have the same excerpts and links. If you can add unique, some descriptive text to the page, it may do better.
And about the PageRank: The PR you can see in the Toolbar or other PR checks is usually very out of date. It could be that prior to your page's disappearance, it had a high PR and really does not now. While the visible PR can be used to get a pretty good idea of how Google ranks a page, I wouldn't give it much thought. Plenty of low PR pages rank very well for whatever search terms they are targeting, and lots of high PR pages don't rank very well.
-
That is quite odd - checked all those things from my end and found the same, but still not indexed.
My only other check at this stage would be to ask if its in the .xml sitemap that you have submitted in Google Webmaster Tools? And whether or not this page features similar content to any other pages on your site?
You've probably checked both already, but thought I'd ask just to be sure.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Redirect indexed lightbox URLs?
Hello all, So I'm doing some technical SEO work on a client website and wanted to crowdsource some thoughts and suggestions. Without giving away the website name, here is the situation: The website has a dedicated /resources/ page. The bulk of the Resources are industry definitions, all encapsulated in colored boxes. When you click on the box, the definition opens in a lightbox with its own unique URL (Ex: /resources/?resource=augmented-reality). The information for these colored lightbox definitions is pulled from a normal resources page (Ex: /resources/augmented-reality/). Both of these URLs are indexed, leading to a lot of duplicate indexed content. How would you approach this? **Things to Consider: ** -Website is built on Wordpress with a custom theme.
Technical SEO | | Alces
-I have no idea how to even find settings for the lightbox (will be asking the client today).
-Right now my thought is to simply disallow the lightbox URL in robots.txt and hope Google will stop crawling and eventually drop from the index.
-I've considered adding the main resource page canonical to the lightbox URL, but it appears to be dynamically created and thus there is no place to access (outside of the FTP, I imagine?). I'm most rusty with stuff like this, so figured I'd appeal to the masses for some assistance. Thanks! -Brad0 -
Category URL Pagination where URLs don't change between pages
Hello, I am working on an e-commerce site where there are categories with multiple pages. In order to avoid pagination issues I was thinking of using rel=next and rel=prev and cannonical tags. I noticed a site where the URL doesn't change between pages, so whether you're on page 1,2, or 3 of the same category, the URL doesn't change. Would this be a cleaner way of dealing with pagination?
Technical SEO | | whiteonlySEO0 -
No index on subdomains
Hi, We have a subdomain that is appearing in the search results - I want to hide this as it looks really bad. If I were to add the no index tag to the sub domain would URL would this affect the whole domain or just that sub domain? The main domain is vitally important - it is just that sub domain I need to hide. Many thanks
Technical SEO | | Creditsafe0 -
Link building with AddThis URL
We've begun using AddThis for tracking our social sharing. AddThis has been adding the snippet to the end of the URLs on our pages and we've been finding that people linking to us are linking to the URL with the snippet. AddThis says this isn't a problem for SEO. Is this correct? Here is an example: https://www.harborcompliance.com/information/how-to-start-a-non-profit-organization-in-colorado.php#.UunCfPldVig I want to make sure this is not affecting our SEO in any way, particularly that Google would see this as an affiliate or paid link since it has the "#". I may be crazy but I just want to make sure!
Technical SEO | | Harbor_Compliance0 -
How to determine which pages are not indexed
Is there a way to determine which pages of a website are not being indexed by the search engines? I know Google Webmasters has a sitemap area where it tells you how many urls have been submitted and how many are indexed out of those submitted. However, it doesn't necessarily show which urls aren't being indexed.
Technical SEO | | priceseo1 -
What to do with 302 redirects being indexed
Hi there, Our site's forums include permalinks that for some reason uses an intermediary URL that 302 redirects to the URL with the permalink anchor. For example: http://en.tradimo.com/learn/chart-analysis/time-frames/ In the comments, there is a permalink to the following URL; en.tradimo.com/co/50c450005f2b949e3200001b/ (there is no content here, and never has been). This URL 302 redirects to the following final URL: http://en.tradimo.com/learn/chart-analysis/time-frames/?offset=0&limit=20#50c450005f2b949e3200001b The problem is, Google is indexing the redirect URL (en.tradimo.com/co/50c450005f2b949e3200001b/) and showing duplicate content even though we are using the nofollow tag on these links. Ideally, we would directly use the last link rather than redirecting. Alternatively, I'd say a 301 redirect would be preferable. But if both aren't available, is there a way to get these pages out of the index? Is the canonical tag the best way? I really wish I could just add /co/ to the robots.txt file, but I think they would still be in the index, right? Thanks for your help!
Technical SEO | | etruvian0 -
Rel=canonical + no index
We have been doing an a/b test of our hp and although we placed a rel=canonical tag on the testing page it is still being indexed. In fact at one point google even had it showing as a sitelink . We have this problem through out our website. My question is: What is the best practice for duplicate pages? 1. put only a rel= canonical pointing to the "wanted original page" 2. put a rel= canonical (pointing to the wanted original page) and a no index on the duplicate version Has anyone seen any detrimental effect doing # 2? Thanks
Technical SEO | | Morris770