SPOKE 02 · BYTE-LEVEL SEO

Most SEOs treat Googlebot as a single user agent. It’s actually a coordinated ecosystem of specialized crawlers, each with its own behavior, budget, and purpose.

If you optimize your site only for “Googlebot,” you are optimizing for one bot in a fleet of six. Each crawler in the ecosystem reads your site for a different reason, and treats your byte budgets, render rules, and content differently.

The Googlebot fleet

📱

Googlebot Smartphone

PRIMARY CRAWLER

The dominant indexer since mobile-first indexing. Simulates a Pixel-class Android device. What it sees is what gets indexed.

🖥️

Googlebot Desktop

SECONDARY

Crawls less frequently. Used for desktop-specific signals and to verify parity. If your mobile and desktop diverge, this catches it.

🖼️

Googlebot Image

SPECIALIST

Fetches and analyzes image content for Google Images and AI overviews. Lazy-loaded images often go uncrawled here.

🎬

Googlebot Video

SPECIALIST

Crawls video files and video schema for Video SERP results. Treats video sitemaps as authoritative.

💰

AdsBot Google

QUALITY CHECK

Independent of organic. Validates landing page quality for Google Ads. Blocking it in robots.txt can crash Quality Scores.

🔍

Inspection Tool

ON-DEMAND

Triggered manually via Search Console URL Inspection. Uses a slightly different render path. May behave differently from real crawlers.

CONFIRMED BY GOOGLE · MARCH 31, 2026

Google admits “Googlebot” is a historical misnomer

“Back in the early 2000s, Google had one product, so we had one crawler. The name ‘Googlebot’ stuck. Dozens of other clients, Google Shopping, AdSense, and more, all route their crawl requests through this same underlying infrastructure under different crawler names.”

Gary Illyes, Google Search team, “Inside Googlebot,” March 31, 2026

Three implications most SEOs miss:

  • Crawling is a shared SaaS platform inside Google. Each client (Search, Ads, Shopping, AdSense) sets its own user-agent, byte limit, and robots.txt token. Blocking one user-agent does not block the others.
  • For crawlers without a specified byte limit, the default is 15MB. Googlebot’s 2MB is more restrictive than most of the rest of the ecosystem.
  • Image and video crawler limits vary by product. A favicon fetch has a much smaller limit than an Image Search fetch.

Reference: developers.google.com/search/blog/2026/03/crawler-blog-post, on the same day Google also published an updated location for the crawler IP range JSON files.

Why each bot matters separately

Treating “Googlebot” as one entity is the single most common mistake in technical SEO.

Bot User-Agent string Purpose
Googlebot Smartphone Mozilla/5.0 (Linux; Android 6.0.1; Nexus 5X)… Primary mobile-first indexing
Googlebot Desktop Mozilla/5.0 AppleWebKit/537.36 (compatible; Googlebot/2.1)… Desktop parity verification
Googlebot Image Googlebot-Image/1.0 Image SERP + AI vision input
Googlebot Video Googlebot-Video/1.0 Video SERP + transcripts
AdsBot Google AdsBot-Google (+http://www.google.com/adsbot.html) Google Ads quality scoring
Inspection Tool Chrome-Lighthouse, GoogleOther variants On-demand verification
Apurv Singh

PRACTITIONER NOTE

“I worked on a D2C site where the dev team had blocked AdsBot in robots.txt ‘to save crawl budget.’ Their Google Ads Quality Scores dropped from 8 to 4 across the account within a week. Cost-per-click jumped 60 percent. Nobody connected the two events for a month. AdsBot is independent of Googlebot, treat the ecosystem as separate.”

Apurv Singh, Founder HQ Digital

How to verify each bot is reaching your site

Check your server access logs

Grep for each user-agent over the last 30 days. You should see hits from all six. If any bot is missing for 30+ days, something is blocking it, firewall, robots.txt, or rate limiting.

grep -i “googlebot\|adsbot” access.log | awk ‘{print $12}’ | sort | uniq -c

Use the Crawl Stats report

Search Console → Settings → Crawl Stats. Filter by Googlebot type. You will see hit volume, response times, and any drops. A sudden zero for any bot is a red flag.

Reverse DNS verification

Many spoofed crawlers pretend to be Googlebot. Always verify by doing reverse DNS lookup on the IP, legitimate Googlebot IPs resolve to googlebot.com or google.com.

CONTINUE THE SERIES

Next: How WRS Renders Your Pages

Web Rendering Service is a headless Chrome that runs after the initial crawl. Understand its limits before you build with React or Next.js.

Read Spoke 03 →