1592+ crawlers perfilados
Directorio de Crawlers
Todos los crawlers web y bots de IA que rastreamos. Qué hace cada uno, quién lo opera y cómo proteger su contenido.
AwarioSmartBot — What It Is and How to Handle It
AwarioSmartBot is a web crawlers sent by Awario to discover and collect new and updated web data (that is further used by Internet marketers from all over the world).
AI CrawlerAI Training
Library Of Congress Web Archiving — What It Is and How to Handle It
The Library of Congress Web Archive manages, preserves, and provides access to archived web content selected by subject experts from across the Library, so that it will be available for researchers today and in the future. More information on the programme here: https://www.loc.gov/programs/web-archiving/about-this-program/ And information about crawling policy here: https://www.loc.gov/programs/web-archiving/for-site-owners/
ResearchResearch
FlipboardProxy — What It Is and How to Handle It
FlipboardProxy crawls and fetches content from publisher websites to format articles for display in the Flipboard social magazine app.
PreviewPreview
Coveo Bot — What It Is and How to Handle It
Coveo provide services to website, customer service and commerce solutions so they can feature relevant experiences to their end users; said services are based on a unified index which crawls websites when configured so by our customers.
Search EngineSearch Engine
Telegram Bot — What It Is and How to Handle It
Telegram Bot is a link preview crawler that fetches page metadata to generate rich previews when URLs are shared.
PreviewPreview
CaliberBot — What It Is and How to Handle It
Caliperbot crawls Conductor clients' and prospects' websites for HTML feature extraction to power Content Analytics features within our Searchlight web application.
SEO ToolSEO Tool
Ghost Inspector — What It Is and How to Handle It
Ghost Inspector is an automated browser testing bot that crawls websites and web applications to perform end-to-end testing, monitoring functionality and detecting bugs through scheduled test runs.
MonitoringMonitoring
Qwantbot — What It Is and How to Handle It
Qwantbot is the web crawler for Qwant, the French privacy-focused search engine. Based in Europe, it indexes websites while prioritizing user privacy and data protection, offering search results without tracking users.
Search EngineSearch Engine
Cxense — What It Is and How to Handle It
Cxense crawler collected content for Cxense's content recommendation and advertising platform (now part of Piano).
SEO ToolSEO Tool
SearchAtlas Bot — What It Is and How to Handle It
SearchAtlas Bot crawls customer websites to audit technical SEO issues, analyze content optimization opportunities, and provide AI-powered SEO recommendations for improving search engine rankings.
SEO ToolSEO Tool
MonSpark — What It Is and How to Handle It
MonSpark monitors website availability, network conditions, SSL certificate validity, and server performance, providing uptime checks and page speed tests to ensure optimal website functionality.
MonitoringMonitoring
HubSpot Crawler — What It Is and How to Handle It
HubSpot offers a full platform of marketing, sales, customer service, and CRM software — plus the methodology, resources, and support — to help businesses grow better. Get started with free tools, and upgrade as you grow.
OtherOther
Oh Dear — What It Is and How to Handle It
Oh Dear is a monitoring crawler that checks website availability, performance, and health.
MonitoringMonitoring
BrightEdge Bot — What It Is and How to Handle It
Autopilot is an SEO marketing automation tool that includes features for internal linking and image optimization. We crawl customer sites so that we can determine the best links to use on the site and to find images that need to be optimized.
AdvertisingAdvertising
Sentry Uptime Monitoring — What It Is and How to Handle It
Sentry Uptime Monitoring is a monitoring crawler that checks website availability, performance, and health.
MonitoringMonitoring
Yahoo Ad Monitoring — What It Is and How to Handle It
Yahoo Ad monitoring crawls webpages where Yahoo advertisements are served to monitor content quality, ad placement compliance, and advertiser policy adherence.
AdvertisingAdvertising
Mediatoolkitbot — What It Is and How to Handle It
Mediatoolkitbot is Determ's media monitoring crawler that scans the open internet searching for brand mentions, keywords, and phrases that users track, helping marketers identify relevant opportunities and monitor brand reputation.
AggregatorAggregator
IAS crawler — What It Is and How to Handle It
ias_crawler from Integral Ad Science performs digital ad verification, brand safety monitoring, and fraud detection across digital advertising channels to ensure quality media measurement.
AdvertisingAdvertising
Google Feed Fetcher — What It Is and How to Handle It
Google Feed Fetcher is a feed reader that fetches and processes RSS, Atom, and other content feeds.
Feed ReaderFeed Reader
Feedly — What It Is and How to Handle It
Feedly is an RSS reader bot that fetches and aggregates content from blogs, news sites, and newsletters. It provides threat intelligence, market intelligence, and news reading services by crawling and collecting articles from various sources.
Feed ReaderFeed Reader
Freshping — What It Is and How to Handle It
Freshping is a website uptime monitoring service by Freshworks that checks site availability and response times.
AccessibilityAccessibility
NixStatsMonitoringBot — What It Is and How to Handle It
NixStatsMonitoringBot is a monitoring crawler that checks website availability, performance, and health.
MonitoringMonitoring
Mojeek — What It Is and How to Handle It
Mojeek is an independent UK-based search engine crawler with its own index, focused on privacy and unbiased results.
Search EngineSearch Engine
Innologica — What It Is and How to Handle It
Innologica is a feed reader that fetches and processes RSS, Atom, and other content feeds.
Feed ReaderFeed Reader
Yahoo Slurp — What It Is and How to Handle It
Yahoo! Slurp crawls and indexes web content for Yahoo's search engine, collecting data to power search results and web discovery features.
Search EngineSearch Engine
VaultPress — What It Is and How to Handle It
VaultPress is a subscription service developed by Automattic, the company behind WordPress, that offers automated daily and real-time backups of WordPress websites onto WordPress.com's cloud servers. It is known for its ease of use, secure backups, and proactive security scanning.
OtherOther
PayPal — What It Is and How to Handle It
The PayPal webhooks is part of Paypal's Instant Payment Notification message service, automatically notifying merchants of events related to Paypal transactions.
OtherOther
LinkedInBot — What It Is and How to Handle It
LinkedInBot fetches page metadata to generate link previews when users share URLs on LinkedIn, extracting titles, descriptions, and images.
PreviewPreview
DuckAssistbot — What It Is and How to Handle It
DuckAssistBot is a web crawler that scans websites to collect content for DuckDuckGo's AI-assisted answers feature, which generates brief responses to search queries using natural language technology and Wikipedia sources.
AI CrawlerAI Assistant
Monsidobot — What It Is and How to Handle It
Monsidobot is a website scanning crawler that analyzes web content for accessibility compliance, quality assurance, and SEO issues as part of Acquia Optimize platform.
MonitoringMonitoring