SEOMOZ crawl all my pages
-
SEOMOZ crawl all my pages including ".do" (all web pages after sign up )
. Coz of this it finishes all my 10.000 crawl page quota and be exposed to dublicate pages.
Google is not crawling pages that user reach after sign up. Because these are private pages for customers I guess
The main question is how we can limit SEOMOZ crawl bot. If the bot can stay out of ".do" java extensions it'll perfect to starting SEO analysis.
Do you know think about it?
Cheers
Example;
.do java extension (after sign up page) (Google can't crawl)
Normal Page (Google can crawl)
http://magaza.turkcell.com.tr/telefon/Apple-iPhone-3GS-8GB/1001694/.html
-
Hi There,
Thanks for writing in and sorry for the confusion.
It actually isn't possible for the SEOmoz crawler to access pages that require a user login. I went to those URLs and I was able to access the pages with out be logged in as a user, so they don't require user sign up to access them. Since these pages are linked to by other pages on your site and our crawler is not being blocked from these pages and the pages don't actually require a user to be signed in to access them, we will crawl them. I can't say why Google wouldn't be crawling those pages, but there is definitely nothing in place that would stop our crawler from accessing them.
If you would like to stop our crawler from accessing those pages in the future, you may consider adding a disallow directive in your robots.txt file using the user-agent rogerbot.
I hope this helps. Let me know if you have any other questions.
Chiaryn
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How long will old pages stay in Google's cache index. We have a new site that is two months old but we are seeing old pages even though we used 301 redirects.
Two months ago we launched a new website (same domain) and implemented 301 re-directs for all of the pages. Two months later we are still seeing old pages in Google's cache index. So how long should I tell the client this should take for them all to be removed in search?
Intermediate & Advanced SEO | | Liamis0 -
Does you page need to be unique to rank
What I mean by unique is : Let's imagine I want to rank one "seo ranking factors." In order to compete do I need to have (in terms of design) that is totally different than everything out there or can I rank with a page that is presented in a very similar way than everything out there but with different content. Thank you,
Intermediate & Advanced SEO | | seoanalytics0 -
Duplicate Page getting indexed and not the main page!
Main Page: www.domain.com/service
Intermediate & Advanced SEO | | Ishrat-Khan
Duplicate Page: www.domain.com/products-handler.php/?cat=service 1. My page was getting indexed properly in 2015 as: www.domain.com/service
2. Redesigning done in Aug 2016, a new URL pattern surfaced for my pages with parameter "products-handler"
3. One of my product landing pages had got 301-permanent redirected on the "products-handler" page
MAIN PAGE: www.domain.com/service GETTING REDIRECTED TO: www.domain.com/products-handler.php/?cat=service
4. This redirection was appearing until Nov 2016.
5. I took over the website in 2017, the main page was getting indexed and deindexed on and off.
6. This June it suddenly started showing an index of this page "domain.com/products-handler.php/?cat=service"
7. These "products-handler.php" pages were creating sitewide internal duplicacy, hence I blocked them in robots.
8. Then my page (Main Page: www.domain.com/service) got totally off the Google index Q1) What could be the possible reasons for the creation of these pages?
Q2) How can 301 get placed from main to duplicate URL?
Q3) When I have submitted my main URL multiple times in Search Console, why it doesn't get indexed?
Q4) How can I make Google understand that these URLs are not my preferred URLs?
Q5) How can I permanently remove these (products-handler.php) URLs? All the suggestions and discussions are welcome! Thanks in advance! 🙂0 -
Is it a problem to use a 301 redirect to a 404 error page, instead of serving directly a 404 page?
We are building URLs dynamically with apache rewrite.
Intermediate & Advanced SEO | | lcourse
When we detect that an URL is matching some valid patterns, we serve a script which then may detect that the combination of parameters in the URL does not exist. If this happens we produce a 301 redirect to another URL which serves a 404 error page, So my doubt is the following: Do I have to worry about not serving directly an 404, but redirecting (301) to a 404 page? Will this lead to the erroneous original URL staying longer in the google index than if I would serve directly a 404? Some context. It is a site with about 200.000 web pages and we have currently 90.000 404 errors reported in webmaster tools (even though only 600 detected last month).0 -
Big discrepancies between pages in Google's index and pages in sitemap
Hi, I'm noticing a huge difference in the number of pages in Googles index (using 'site:' search) versus the number of pages indexed by Google in Webmaster tools. (ie 20,600 in 'site:' search vs 5,100 submitted via the dynamic sitemap.) Anyone know possible causes for this and how i can fix? It's an ecommerce site but i can't see any issues with duplicate content - they employ a very good canonical tag strategy. Could it be that Google has decided to ignore the canonical tag? Any help appreciated, Karen
Intermediate & Advanced SEO | | Digirank0 -
410 pages
Do you need to optimize a 410 page like you do for 404 pages? What does a visitor see when a page is 410 compared to a 404?
Intermediate & Advanced SEO | | WebServiceConsulting.com0 -
Tips for improving this page
I have made a content placeholder for a keyword that will gain significant search volume in the future. Until then I am trying to optimize the page to rank when the game launches and the keyword gains volume. http://hiddentriforce.com/a-link-between-worlds/walkthrough/ Is there anything I can do to improve the optimization for the phrase 'a link between worlds walkthrough' A lot of my competitors are already setting up similar placeholder pages and doing the same thing. I have 2 fairly large gaming sites that will place a banner for my walkthrough on their site. I did not pay for the links. I do free writing/ other services in exchange for this. I have been sharing the link socially. It has almost 200 likes and a handful of shares, tweets, g+ votes
Intermediate & Advanced SEO | | Atomicx0 -
Why Is This Page Not Ranking?
Hi Mozzers, I can't rank (the page is nowhere on the Google grid that I can find) and I've not been able to move the needle at all on it. The page is http://www.lumber2.com/Western-Saddle-Pads-s/98.htm for keyword "western saddle pads." I'm inclined to think I'm cannabalizing the category with the products so I removed the word saddle from the majority of the product names on page. However, saddle pad or saddle pads is in the meta title for most if not all of the products. Do you think I'm cannabalizing with the product titles or is there something else going on? Thanks for any help.
Intermediate & Advanced SEO | | AWCthreads0