Google has now added new particulars that specify the three classes its Google crawlers fall into, they embody Googlebot, special-case crawlers and user-triggered fetchers.
As well as, Google now lists a JSON formatted file containing the checklist of IP addresses every of those completely different crawler varieties use.
Forms of Google crawlers. On the high of this Googlebot web page, Google listed these three crawler varieties:
- Googlebot – The primary crawler for Google’s search merchandise. Google says this crawler at all times respects robots.txt guidelines.
- Particular-case crawlers – Crawlers that carry out particular capabilities (comparable to AdsBot), which can or might not respect robots.txt guidelines.
- Consumer-triggered fetchers – Instruments and product capabilities the place the end-user triggers a fetch. For instance, Google Website Verifier acts on the request of a person or some Google Search Console instruments will ship Google to fetch the web page primarily based on an motion a person takes.
IP addresses. Google additionally listed the IP deal with ranges and reverse DNS masks for every sort:
What’s new. Right here is the part of the web page that was up to date; the remainder of the web page is usually unchanged.
Why we care. I consider Google made this transformation after they noticed among the reactions to the GoogleOther robotic they introduced the opposite day. This now explains how Google crawlers act, after they respect the robots.txt and how one can determine them higher.
Now, if you need to not block Google’s most important crawler, Googlebot, however you determine to dam the others, you possibly can higher determine these crawlers extra precisely.