Single URL not indexed
-
Hi everyone!
Some days ago, I noticed that one of our URLs (http://www.access.de/karriereplanung/webinare) is no longer in the Google index.
We never had any form of penalty, link warning etc. Our traffic by Google is constantly growing every month. This single page does not have an external link pointing to it - only internal links.
The page has been indexed all the time. The HTTP status code is 200, there is no noindex or something in the code. I submitted the URL on GWMT to let Google send it to the index. It was crawled successfully by Google, sent to the index 5 days ago - nothing happened, still not indexed.
Do you have any suggestions why this page is no longer indexed? It is well linked internally and one click away from the home page. There is still the PR of 5 showing, I always thought that pages with PR are indexed.......
-
Hi Nick,
first of all, thanx for your responses.
I already did the "fetch as Googlebot" thing 5 days ago. The page was successfully crawled and has been sent to the index successfully, according to Google Webmaster Tools. But in these 5 days, nothing changed.
I like your suggestions with the extra text. We will add some and do the "fetch as Googlebot" again and see what happens.
And you are absolutely right when it comes to the "value" of this page. It didn't send that much traffic, just a little. It is no big deal for us if this page doesn't get back into the index - but as someone doing SEO I want to figure out the problem Google seems to have with this page - just to test and learn for future problems
-
Replying to myself because I just noticed something I was wrong about.
I thought that the first box at the top was an excerpt of the page it links to, but it looks like it IS actually unique.
So you probably don't need to add anything, though expanding on that text in the first box might be a good idea.
Try to get a link to that page and see if that helps.
-
The thing is those words do appear elsewhere on the site, and Google can probably figure out that what is on this particular page is excerpts and links to the originals.
This normally isn't a huge problem, though. Lots of sites and blogs have category and tag pages that fit that description and ARE indexed (though many are not).
Before messing around with adding text which you may not really need to add, try doing a Fetch as Googlebot of the page in Google Webmaster Tools and hit the submit button when the fetch is complete. It may be that the page just got dropped by accident. If it doesn't return to the index after a few days, try adding a little totally unique content. Just a sentence or two about what these links are should be enough. I have done this on a few sites with lots of thin tag or category pages and it doesn't take a lot of text to get them into the index.
Partner link pages are also typically thin, but they may be indexed anyway if the links are useful, or ignored if it is simply a link exchange page that doesn't really have any value other than swapping links (which isn't much value). Like most things related to Google search, there isn't always a specific thing that will make the difference.
What you may want to consider is whether or not you want or need that page to appear in search, and if you think it could or should actually rank well for anything. If it doesn't matter, I wouldn't be too concerned unless there are many pages on the site that are not indexed.
-
Quite strange - I see someone visiting this URL in the Google-Analytics real-time-report.
Traffic source is direct, and Google labels this site as "/empty". Any ideas why?
-
Hi Nick,
I knwo the page is not full of content - but if you count the words, they are almost 300. And we do not have pages with the same content or links on our domain.
It could be a solution to add more text, but what about pages with partner links, for example? They normally have no content and lots of external links - so they should also be seen an "thin pages"?!
-
It may be worth generating and submitting an XML sitemap, with this page relatively high up in the map, and submitting it to Google. This then might prompt Google to crawl the page and index it.
ScreamingFrog is a free tool that generates an XML sitemap for you, while there are also free generators out there as well with just a quick google search.
-
Hi Tom,
well, honestly, we do not have a sitemap...
And no, there are no other pages with similar content on our domain.
As you said it: quite odd!
-
It may have been dropped because it was seen as "thin" content. Since most of the page is excerpts from and links to other pages, it is likely being ignored - especially if there are other pages that have the same excerpts and links. If you can add unique, some descriptive text to the page, it may do better.
And about the PageRank: The PR you can see in the Toolbar or other PR checks is usually very out of date. It could be that prior to your page's disappearance, it had a high PR and really does not now. While the visible PR can be used to get a pretty good idea of how Google ranks a page, I wouldn't give it much thought. Plenty of low PR pages rank very well for whatever search terms they are targeting, and lots of high PR pages don't rank very well.
-
That is quite odd - checked all those things from my end and found the same, but still not indexed.
My only other check at this stage would be to ask if its in the .xml sitemap that you have submitted in Google Webmaster Tools? And whether or not this page features similar content to any other pages on your site?
You've probably checked both already, but thought I'd ask just to be sure.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Sudden Indexation of "Index of /wp-content/uploads/"
Hi all, I have suddenly noticed a massive jump in indexed pages. After performing a "site:" search, it was revealed that the sudden jump was due to the indexation of many pages beginning with the serp title "Index of /wp-content/uploads/" for many uploaded pieces of content & plugins. This has appeared approximately one month after switching to https. I have also noticed a decline in Bing rankings. Does anyone know what is causing/how to fix this? To be clear, these pages are **not **normal /wp-content/uploads/ but rather "index of" pages, being included in Google. Thank you.
Technical SEO | | Tom3_150 -
Mobile website indexing
Hi we have a mobile version of our website at mobile.gardening-services-edinburgh.com its been live for 5, maybe 6 months, it has its own mobile-sitemap.xml have tried submitting this sitemap to google and for some reason it does not index these pages any ideas, most welcome
Technical SEO | | McSEO0 -
URL Format
Often we have web platforms that have a default URL structure that looks something like this www.widgetcompany.co.uk/widget-gallery/coloured-widgets/red-widgets This format is quite well structured but would it just be more effective to be www.widgetcompany.co.uk/red-widgets? I realise that it may depend on a lot of factors but generally is it better to have the shorter URL if targeting the key phrase "red widgets" One thing, it certainly looks a bit keyword stuffy with all those "widgets"
Technical SEO | | vital_hike0 -
No Index PDFs
Our products have about 4 PDFs a piece, which really inflates our indexed pages. I was wondering if I could add REL=No Index to the PDF's URL? All of the files are on a file server, so they are embedded with links on our product pages. I know I could add a No Follow attribute, but I was wondering if any one knew if the No Index would work the same or if that is even possible. Thanks!
Technical SEO | | MonicaOConnor0 -
My beta site (beta.website.com) has been inadvertently indexed. Its cached pages are taking traffic away from our real website (website.com). Should I just "NO INDEX" the entire beta site and if so, what's the best way to do this? Please advise.
My beta site (beta.website.com) has been inadvertently indexed. Its cached pages are taking traffic away from our real website (website.com). Should I just "NO INDEX" the entire beta site and if so, what's the best way to do this? Are there any other precautions I should be taking? Please advise.
Technical SEO | | BVREID0 -
AJAX and Bing Indexation
Hello. I've been going back and forth with Bing technical support regarding a crawling issue on our website (which I have to say is pretty helpful - you do get a personal, thoughtful response pretty quickly from Bing). Currently our website is set with a java redirect to send users/crawlers to an AJAX version of our website. For example, they come into - mysite.com/category..and get redirected to mysite.com/category#!category. This is to provide an AJAX search overlay which improves UEx. We are finding that Bing gets 'hung up' on these AJAX pages, despite AJAX protocol being in place. They say that if the AJAX redirect is removed, they would index and crawl the non-AJAX url correctly - at which point our indexation would (theoretically) improve. I'm wondering if it's possible (or advisable) to direct the robots to crawl the non-AJAX version, while users get the AJAX version. I'm assuming that it's the classic - the bots want to see exactly what the users see - but I wanted to post here for some feedback. The reality of the situation is the AJAX overlay is in place and our rankings in Bing have plummeted as a result.
Technical SEO | | Blenny0 -
Crawl reveals hundreds of urls with multiple urls in the url string
The latest crawl of my site revealed hundreds of duplicate page content and duplicate page title errors. When I looked it was from a large number of urls with urls appended to them at the end. For example: http://www.test-site.com/page1.html/page14.html or http://www.test-site.com/page4.html/page12.html/page16.html some of them go on for a hundred characters. I am totally stymied, as are the people at my ISP and the person who talked to me on the phone from SEOMoz. Does anyone know what's going on? Thanks So much for any help you can offer! Jean
Technical SEO | | JeanYates0 -
URL Structure Question
Hey folks, I have a weird problem and currently no idea how to fix it. We have a lot of pages showing up as duplicates although they are the same page, the only difference is the url structure. They seem to show up like: http://www.example.com/page/ and http://www.example.com/page What would I need to do to force the URLs into one format or the other to avoid having that one page counting as two? The same issue pops up with upper and lower case: http://www.example.com/Page and http://www.example.com/page Is there any solution to this or would I need to forward them with 301s or similar? Thanks, Mike
Technical SEO | | Malarowski0