Existing Pages in Google Index and Changing URLs
-
Hi!!
I am launching a newly recoded site this week and had a another noobie question.
The URL structure has changed slightly and I have installed a 301 redirect to take care of that. I am wondering how Google will handle my "old" pages? Will they just fall out of the index? Or does the 301 redirect tell Google to rewrite the URLs in the index?
I am just concerned I may see an "old" page and a "new" page with the same content in the index. Just want to make sure I have covered all my bases.
Thanks!!
Lynn
-
Hi!! Thanks Mike! I didn't realize I was passing the SIDs (as not in the URL) but it makes sense I am. Will take this to a private question and let you know what I hear back.
Thanks for your help!
Lynn
-
I would be happy to help if I knew the answer, but I don't. I don't have session IDs in my URLs (I use cookie-based session management instead, mostly because I wanted clean URLs for bookmarking and SEO). Perhaps someone else who uses session IDs in URLs could answer (or else Google "session IDs in urls" and see what comes up. I found this one: http://www.searchengineguide.com/stoney-degeyter/why-session-ids-and-search-engines-dont.php )
-
Hi! I am in Google Webmaster Tools but haven't played with it extensively since I set it up and added my domain.
Looking at it seeing some crawl errors. Most of them have SID in them. Why would it be trying to crawl a session ID?
That brings up another question. The shopper is able to narrow down a category by manufacturer and price. These links will be crawled and indexed as well. Do I want them to be???
Anything you can offer would be appreciated. If it's too in-depth (meaning will take you too much time) can take this to a private question.
Thank you!
Lynn
-
Hi!! The only thing that has changed is the removal of /shop/ from the product pages URLs. Here is the 301 installed. I was told all was well with it. Would love another set of eyeballs if you can confirm it looks good. I am actually ranking for some things so am paranoid I am going to mess the site move up. Thanks for the info. I really appreciate it.
############################################
enable rewrites
Options +FollowSymLinks
RewriteEngine on
#RedirectMatch 301 ^/shop?/$ http://hiphound.com/
RedirectMatch 301 ^/shop?/$ http://hiphound.com
###########################################
-
Crawl rate depends on your site size, your site's rate of change, how fast you serve pages, and I'm sure a couple of other factors. If you're not yet on Google Webmaster Tools then you should be (it's free). It will show you pages/day that the googlebot is crawling your site.
-
Thank you!! Great article!
Follow-up - how long does it take for the URLs to be rewriten in the Google index? Is that done on the next crawl?
Thanks! I really appreciate the help.
Lynn
-
If you have set up the 301 correctly then if a user tries to visit the old page either via typing the old URL or via the search engine then they will be directed to the new content. When the site is reindexed the old results should fall out of the index.
-
You should be okay with 301s. See http://www.atlantaanalytics.com/practicing-web-analytics/how-does-google-analytics-handle-301-and-302-redirects/
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should we change our URLs for SEO benefit?
Hi, I'm currently covering a maternity marketing role at i-escape and one our main objectives is to increase organic traffic to the website. i-escape has a selection of hand-picked boutique hotels, villas, lodges, guesthouses and apartments for people to discover and book. At the moment each hotel page URL follows this structure: https://www.i-escape.com/hotelname We'd like to change this to include some searchable words in the URL dependent on the type of hotel. For example: https://www.i-escape.com/boutique-hotels/hotelname or https://www.i-escape.com/boutique-apartments/hotelname If we do go ahead, we know we need to make sure all old style URLs canonically redirect to the new style. Is having the keyword in the URL important enough for us to change over 1500 URLs on the website? We have quite a high quality links pointing to these hotel pages URLs. Also, will this help us with navigation/user journeys/crawls as there will be a /boutique-hotels/hotelname rather than just /hotelname? Thanks so much all! Clair
Technical SEO | | iescape0 -
How to check if an individual page is indexed by Google?
So my understanding is that you can use site: [page url without http] to check if a page is indexed by Google, is this 100% reliable though? Just recently Ive worked on a few pages that have not shown up when Ive checked them using site: but they do show up when using info: and also show their cached versions, also the rest of the site and pages above it (the url I was checking was quite deep) are indexed just fine. What does this mean? thank you p.s I do not have WMT or GA access for these sites
Technical SEO | | linklander0 -
3,511 Pages Indexed and 3,331 Pages Blocked by Robots
Morning, So I checked our site's index status on WMT, and I'm being told that Google is indexing 3,511 pages and the robots are blocking 3,331. This seems slightly odd as we're only disallowing 24 pages on the robots.txt file. In light of this, I have the following queries: Do these figures mean that Google is indexing 3,511 pages and blocking 3,331 other pages? Or does it mean that it's blocking 3,331 pages of the 3,511 indexed? As there are only 24 URLs being disallowed on robots.text, why are 3,331 pages being blocked? Will these be variations of the URLs we've submitted? Currently, we don't have a sitemap. I know, I know, it's pretty unforgivable but the old one didn't really work and the developers are working on the new one. Once submitted, will this help? I think I know the answer to this, but is there any way to ascertain which pages are being blocked? Thanks in advance! Lewis
Technical SEO | | PeaSoupDigital0 -
Should change some pages with key stuffing?
Hello, i have a website with 1 years old and when i started, when i created the pages, theses have key stuffing (20-30-40 same words in meta descriptions and text, sometimes 15, sometimes 20 and sometimes 40). Since i saw this (about 4 months), i change that, doing new pages with 5-10 same keywords. Some pages with many keywords (20-30-40) work very fine and i would not lose the position in google, but i don't want to be penalized for that. Then, my question is: Should change the old pages with key stuffing or let them? Thanks so much.
Technical SEO | | pompero990 -
Number of indexed pages dropped dramatically
The number of indexed pages for my site was 1100 yesterday and today is 344 Anybody has any idea what can cause this. Thank you Sina
Technical SEO | | SinaKashani0 -
How to know which pages are indexed by Google?
So apparently we have some sites that are just duplicates of our original main site but aiming at different markets/cities. They have completely different urls but are the same content as our main site with different market/city changed. How do I know for sure which ones are indexed. I enter the url into Google and its not there. Even if I put in " around " it. Is there another way to query google for my site? Is there a website that will tell you which ones are indexed? This is probably a dumb question.
Technical SEO | | greenhornet770 -
De-indexing millions of pages - would this work?
Hi all, We run an e-commerce site with a catalogue of around 5 million products. Unfortunately, we have let Googlebot crawl and index tens of millions of search URLs, the majority of which are very thin of content or duplicates of other URLs. In short: we are in deep. Our bloated Google-index is hampering our real content to rank; Googlebot does not bother crawling our real content (product pages specifically) and hammers the life out of our servers. Since having Googlebot crawl and de-index tens of millions of old URLs would probably take years (?), my plan is this: 301 redirect all old SERP URLs to a new SERP URL. If new URL should not be indexed, add meta robots noindex tag on new URL. When it is evident that Google has indexed most "high quality" new URLs, robots.txt disallow crawling of old SERP URLs. Then directory style remove all old SERP URLs in GWT URL Removal Tool This would be an example of an old URL:
Technical SEO | | TalkInThePark
www.site.com/cgi-bin/weirdapplicationname.cgi?word=bmw&what=1.2&how=2 This would be an example of a new URL:
www.site.com/search?q=bmw&category=cars&color=blue I have to specific questions: Would Google both de-index the old URL and not index the new URL after 301 redirecting the old URL to the new URL (which is noindexed) as described in point 2 above? What risks are associated with removing tens of millions of URLs directory style in GWT URL Removal Tool? I have done this before but then I removed "only" some useless 50 000 "add to cart"-URLs.Google says themselves that you should not remove duplicate/thin content this way and that using this tool tools this way "may cause problems for your site". And yes, these tens of millions of SERP URLs is a result of a faceted navigation/search function let loose all to long.
And no, we cannot wait for Googlebot to crawl all these millions of URLs in order to discover the 301. By then we would be out of business. Best regards,
TalkInThePark0 -
Google picking up wrong page title
Hi, When searching for "Tottenham Forum" on google.co.uk (link below) http://www.google.co.uk/search?q=tottenham+forum&ie=utf-8&oe=utf-8&aq=t&rls=org.mozilla:en-GB:official&client=firefox-a The site I manage (THFCTalk.com) is listed as 4th in the search results, but was hacked a few months ago and the search results lists the page title as "Free Shipping. Order Cialis Online. - Online Pharmacy" when the actual page title of THFCTalk is not actually set at that. Any idea how to fix this so Google updates this header on the search results? - as it is surely putting people off from clicking on our search result
Technical SEO | | WalesDragon0