VisitSweden indexing error
-
Hi all
Just got a new site up about weekend travel for VisitSweden, the official tourism office of Sweden. Everything went just fine except som issues with indexing.
The site can be found here at weekend.visitsweden.com/no/
For some weird reason the "frontpage" of the site does not get indexed. What I have done myself to find the issue:
- Added sitemaps.xml
- Configured and added site to webmaster tools
- Checked 301s so they are not faulty
By doing a simple site:weekend.visitsweden.com/no/ you can see that the frontpage is simple not in the index. Also by doing a cache:weekend.visitsweden.com/no/ I see that Google tries to index the page without the trailing /no/ for some reason.
http://webcache.googleusercontent.com/search?q=cache:http://weekend.visitsweden.com/no/
Any smart ideas to get this fixed or where to start looking?
All help greatly appreciated
Kind regards
Fredrik
-
Oh my God Fred!! the weekend sub-domain has been completely blocked from being crawled using a robots.txt file sitting in the root of the sub-domain.
http://weekend.visitsweden.com/robots.txt
User-agent: * Disallow: / Please remove the '/' from there **immediately**
-
Hi Fred,
I just copied my first response:
Here is your redirection setup:
http://weekend.visitsweden.com/ being redirected via 301 to
http://weekend.visitsweden.com/no being redirected via 301 to
http://weekend.visitsweden.com/no/
So, I would suggest you to remove the interim URL without the trailing slash after 'no'. Let the original homepage, http://weekend.visitsweden.com/ also be redirected to http://weekend.visitsweden.com/no/ (the one with trailing slash) via 301.
So your redirection setup should be as follows:
http://weekend.visitsweden.com/ - via 301 to - http://weekend.visitsweden.com/no/
Essentially, we are eliminating the redirection loop here. Please try this and post back.
Best regards,
Devanur Rafi
-
Hi
Again thanks for your quick response. Unfortunately we still have the same issue even though we have performed many checks and tests. Any more smart ideas on how this can be corrected?
Regards
Fredrik
-
Hi Fred,
Please wait for at least 2 weeks for the change to reflect in Google. This happens and depends on how popular your site is in terms of link profile, DA, PA etc..I still see "http://weekend.visitsweden.com/no" (without trailing slash) in Google's index. Let us wait for sometime. Nothing to worry about it.
-
Hi again
The weirdest this is that it does not seem to update. When I do a site:weekend.visitsweden.com/no/ the page is still nowhere to be found.
https://www.google.no/?gws_rd=ssl#q=site:weekend.visitsweden.com%2Fno%2F
Any ideas?
Again thanks
Fredrik
-
Hi Fred, now its perfect. It should soon reflect in Google and you will be able to see it in site: search. Good Luck my friend.
Best regards,
Devanur Rafi
-
Thanks for the great input! Have now tried to do the changes as per your suggestion.
Does it look better now?
Again thanks
Fredrik
-
Dear Fred,
Here is your redirection setup:
http://weekend.visitsweden.com/ being redirected via 301 to
http://weekend.visitsweden.com/no being redirected via 301 to
http://weekend.visitsweden.com/no/
So, I would suggest you to remove the interim URL without the trailing slash after 'no'. Let the original homepage, http://weekend.visitsweden.com/ also be redirected to http://weekend.visitsweden.com/no/ (the one with trailing slash) via 301.
So your redirection setup should be as follows:
http://weekend.visitsweden.com/ - via 301 to - http://weekend.visitsweden.com/no/
This should fix the issue. Essentially, we are eliminating the redirection loop here.
By the way, both the URLs, with and without trailing slash appear in Google when searched with the following queries:
Best regards,
Devanur Rafi
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why Aren't My Images Being Indexed?
Hi, One of my clients submitted an image sitemap with 465 images. It was submitted on July 20 2017 to Google Search Console. None of the submitted images have been indexed. I'm wondering why? Here's the image sitemap: http://www.tagible.com/images_sitemap.xml We do use a CDN for the images, and the images are hosted on a subdomain of the client's site: ex. https://photos.tagible.com/images/Les_Invalides_Court_Of_Honor.jpg Thanks in advance! Cheers,
Intermediate & Advanced SEO | | SEOdub
Julian0 -
Facets Being Indexed - What's the Impact?
Hi Our facets are from what I can see crawled by search engines, I think they use javascript - see here http://www.key.co.uk/en/key/lockers I want to get this fixed for SEO with an ajax solution - I'm not sure how big this job is for developers, but they will want to know the positive impact this could have & whether it's worth doing. Does anyone have any opinions on this? I haven't encountered this before so any help is welcome 🙂
Intermediate & Advanced SEO | | BeckyKey0 -
Download all GSC crawl errors: Possible today?
Hey guys: I tried to download all the crawl data from Google Search Console using the API and solutions like this one: https://github.com/eyecatchup/php-webmaster-tools-downloads but seems that is not longer working (or I made something wrong, I just receive a blank page when running the PHP file after some load time)... I needed to download more than 1.000 URLs long time ago, so I didn't tried to use this method since then. Is there any other solution using the API to grab all the crawl errors, or today this is not possible anymore? Thanks!
Intermediate & Advanced SEO | | antonioaraya1 -
How is Google crawling and indexing this directory listing?
We have three Directory Listing pages that are being indexed by Google: http://www.ccisolutions.com/StoreFront/jsp/ http://www.ccisolutions.com/StoreFront/jsp/html/ http://www.ccisolutions.com/StoreFront/jsp/pdf/ How and why is Googlebot crawling and indexing these pages? Nothing else links to them (although the /jsp.html/ and /jsp/pdf/ both link back to /jsp/). They aren't disallowed in our robots.txt file and I understand that this could be why. If we add them to our robots.txt file and disallow, will this prevent Googlebot from crawling and indexing those Directory Listing pages without prohibiting them from crawling and indexing the content that resides there which is used to populate pages on our site? Having these pages indexed in Google is causing a myriad of issues, not the least of which is duplicate content. For example, this file <tt>CCI-SALES-STAFF.HTML</tt> (which appears on this Directory Listing referenced above - http://www.ccisolutions.com/StoreFront/jsp/html/) clicks through to this Web page: http://www.ccisolutions.com/StoreFront/jsp/html/CCI-SALES-STAFF.HTML This page is indexed in Google and we don't want it to be. But so is the actual page where we intended the content contained in that file to display: http://www.ccisolutions.com/StoreFront/category/meet-our-sales-staff As you can see, this results in duplicate content problems. Is there a way to disallow Googlebot from crawling that Directory Listing page, and, provided that we have this URL in our sitemap: http://www.ccisolutions.com/StoreFront/category/meet-our-sales-staff, solve the duplicate content issue as a result? For example: Disallow: /StoreFront/jsp/ Disallow: /StoreFront/jsp/html/ Disallow: /StoreFront/jsp/pdf/ Can we do this without risking blocking Googlebot from content we do want crawled and indexed? Many thanks in advance for any and all help on this one!
Intermediate & Advanced SEO | | danatanseo0 -
Huge google index with un-relevant pages
Hi, i run a site about sport matches, every match has a page and the pages are generated automatically from the DB. pages are not duplicated, but over time some look a little bit similar. after a match finishes it has no internal links or sitemap entry, but it's reachable by direct URL and continues to be on google index. so over time we have more than 100,000 indexed pages. since past matches have no significance and they're not linked and a match can repeat and it may look like duplicate content....what you suggest us to do: when a match is finished - not linked, but appears on the index and SERP 301 redirect the match Page to the match Category which is a higher hierarchy and is always relevant? use rel=canonical to the match Category do nothing.... *301 redirect will shrink my index status, some say a high index status is good... *is it safe to 301 redirect 100,000 pages at once - wouldn't it look strange to google? *would canonical remove the past matches pages from the index? what do you think? Thanks, Assaf.
Intermediate & Advanced SEO | | stassaf0 -
To index search results or not?
In its webmaster guidelines, Google says not to index search results " that don't add much value for users coming from search engines." I've noticed several big brands index search results, and am wondering if it is generally OK to index search results with high engagement metrics (high PVPV, time on site, etc). We have an database of content, and it seems one of the best ways to get this content in search engines would be to allow indexing of search results (to capture the long tail) rather than build thousands of static URLs. Have any smaller brands had success with allowing indexing of search results? Any best practices or recommendations?
Intermediate & Advanced SEO | | nicole.healthline0 -
Wordpress error
On our Google Webmaster Tools I'm getting a Severe Health Warning regarding our Robot.txt file reading: User-agent: *
Intermediate & Advanced SEO | | NileCruises
Crawl-delay: 20 User-agent: 008
Disallow: / I'm wondering how I can fix this and stop it happening again. The site was hacked about 4 months ago but I thought we'd managed to clear things up. Colin0 -
Should you stop indexing of short lived pages?
In my site there will be a lot of pages that have a short life span of about a week as they are items on sale, should I nofollow the links meaning the site has a fwe hundred pages or allow indexing and have thousands but then have lots of links to pages that do not exist. I would of course if allowing indexing make sure the page links does not error and sends them to a similarly relevant page but which is best for me with the SEarch Engines? I would like to have the option of loads of links with pages of loads of content but not if it is detrimental Thanks
Intermediate & Advanced SEO | | barney30120