# ===================================================================== # Robots.txt for wellnesscoachfl.com # Mathew Jadan, BCND - Holistic Practitioner (The Pure Rx) # Serving: Royal Oak, MI & Davie, FL | USA, Canada, UK, Australia # ===================================================================== # COMPREHENSIVE AI CRAWLER PERMISSIONS # 163 AI Crawlers from AppWT Analytics AIDetector.php # Updated: April 8, 2026 # ===================================================================== # ===================================================================== # USA TIER 1: DOMINANT (OpenAI, Google, Microsoft — 75%+ market share) # ===================================================================== # OpenAI (ChatGPT, GPT-4o, GPT-5, SearchGPT, Operator) User-agent: GPTBot User-agent: ChatGPT-User User-agent: ChatGPT-Agent User-agent: OAI-SearchBot User-agent: ChatGPT Allow: / # Google (Gemini, Bard, NotebookLM, Vertex AI, Deep Research) User-agent: Google-Extended User-agent: Googlebot-Extended User-agent: GoogleOther User-agent: GoogleOther-Image User-agent: GoogleOther-Video User-agent: Google-CloudVertexBot User-agent: Google-NotebookLM User-agent: Google-Firebase User-agent: Google-Agent User-agent: GoogleAgent-Mariner User-agent: Gemini-Deep-Research Allow: / # Microsoft (Bing, Copilot) User-agent: Bingbot User-agent: BingPreview User-agent: AzureAI-SearchBot Allow: / # ===================================================================== # USA TIER 2: MAJOR (Anthropic, Perplexity, Meta, xAI) # ===================================================================== # Anthropic (Claude) User-agent: ClaudeBot User-agent: Claude-Web User-agent: Claude-User User-agent: Claude-SearchBot User-agent: anthropic-ai Allow: / # Perplexity User-agent: PerplexityBot User-agent: Perplexity-User Allow: / # Meta (Meta AI, Facebook, Instagram) User-agent: Meta-ExternalAgent User-agent: Meta-ExternalFetcher User-agent: FacebookBot User-agent: FacebookExternalHit User-agent: facebookexternalhit Allow: / # xAI (Grok) User-agent: xAI-Grok User-agent: x.AI Allow: / # ===================================================================== # USA TIER 3: APPLE, AMAZON, VOICE ASSISTANTS # ===================================================================== # Apple (Siri, Apple Intelligence) User-agent: Applebot User-agent: Applebot-Extended User-agent: SiriBot Allow: / # Amazon (Alexa, Q, Bedrock, BuyForMe) User-agent: Amazonbot User-agent: AlexaBot User-agent: amazon-kendra User-agent: AmazonBuyForMe User-agent: Amzn-SearchBot User-agent: Amzn-User User-agent: bedrockbot Allow: / # Microsoft Voice User-agent: CortanaBot Allow: / # ===================================================================== # USA TIER 4: AI SEARCH ENGINES & ASSISTANTS # ===================================================================== # You.com User-agent: YouBot Allow: / # Brave User-agent: BraveBot Allow: / # DuckDuckGo User-agent: DuckAssistBot User-agent: DuckDuckBot Allow: / # Phind User-agent: PhindBot Allow: / # iAsk User-agent: iaskBot Allow: / # Exa User-agent: ExaBot Allow: / # Pi / Inflection User-agent: PiBot Allow: / # Andi User-agent: Andibot Allow: / # Kagi User-agent: KagiBot Allow: / # Arc / The Browser Company User-agent: ArcBot Allow: / # ===================================================================== # USA TIER 5: AI PLATFORMS, RESEARCH & DEVELOPMENT TOOLS # ===================================================================== # Diffbot User-agent: Diffbot Allow: / # Common Crawl (feeds many LLMs) User-agent: CCBot Allow: / # Hugging Face User-agent: HuggingFaceBot Allow: / # GitHub Copilot User-agent: GitHubCopilotBot Allow: / # Devin (Cognition AI agent) User-agent: Devin Allow: / # Cloudflare User-agent: Cloudflare-AutoRAG Allow: / # Firecrawl User-agent: FirecrawlAgent Allow: / # Crawl4AI User-agent: Crawl4AI Allow: / # Crawlspace User-agent: Crawlspace Allow: / # Apify User-agent: ApifyBot User-agent: ApifyWebsiteContentCrawler Allow: / # Atlassian (Jira/Confluence AI) User-agent: atlassian-bot Allow: / # OpenAI Agents (Nova Act, Operator) User-agent: Nova-Act Allow: / # Anomura User-agent: Anomura Allow: / # BuddyBot User-agent: BuddyBot Allow: / # Brightbot User-agent: Brightbot Allow: / # bigsur.ai User-agent: bigsur.ai Allow: / # FriendlyCrawler User-agent: FriendlyCrawler Allow: / # Cotoyogi User-agent: Cotoyogi Allow: / # Datenbank Crawler User-agent: Datenbank-Crawler Allow: / # ===================================================================== # USA TIER 6: DATA, ANALYTICS & SEO AI CRAWLERS # ===================================================================== # Semrush User-agent: SemrushBot User-agent: SemrushBot-OCOB Allow: / # DataForSEO User-agent: DataForSeoBot Allow: / # Moz User-agent: DotBot Allow: / # Ahrefs User-agent: AhrefsBot Allow: / # Majestic User-agent: MJ12bot Allow: / # Meltwater User-agent: MeltwaterBot Allow: / # Awario User-agent: Awario User-agent: AwarioBot User-agent: AwarioSmartBot User-agent: AwarioRssBot Allow: / # Echobox User-agent: EchoboxBot Allow: / # Factset User-agent: Factset_spyderbot Allow: / # Sentibot User-agent: Sentibot Allow: / # Turnitin User-agent: TurnitinBot Allow: / # Webz.io User-agent: Omgilibot User-agent: Omgili User-agent: webzio-extended Allow: / # Veritone User-agent: VeritoneBot Allow: / # ICC Crawler User-agent: ICC-Crawler Allow: / # ===================================================================== # USA TIER 7: AI IMAGE, VIDEO & CONTENT GENERATION # ===================================================================== # ImagesiftBot User-agent: ImagesiftBot Allow: / # img2dataset (LAION) User-agent: img2dataset Allow: / # Stability AI User-agent: StabilityBot Allow: / # ===================================================================== # USA TIER 8: SOCIAL & MESSAGING AI # ===================================================================== # LinkedIn User-agent: LinkedInBot Allow: / # Slack User-agent: Slackbot Allow: / # Snap User-agent: SnapBot Allow: / # Reddit User-agent: Redditbot Allow: / # Telegram User-agent: TelegramBot Allow: / # WhatsApp User-agent: WhatsApp Allow: / # Discord User-agent: Discordbot Allow: / # Pinterest User-agent: Pinterestbot Allow: / # ===================================================================== # CANADA # ===================================================================== # Cohere User-agent: cohere-ai User-agent: CohereBot User-agent: cohere-training-data-crawler Allow: / # Aranet (Canadian AI search) User-agent: Aranet-SearchBot Allow: / # ===================================================================== # EUROPE # ===================================================================== # Mistral AI (France) User-agent: MistralBot Allow: / # Aleph Alpha (Germany) User-agent: AlephAlphaBot Allow: / # Channel3 (EU) User-agent: Channel3Bot Allow: / # Ecosia (Germany — AI search) User-agent: EcosiaBot Allow: / # Qwant (France) User-agent: QwantBot Allow: / # ===================================================================== # CHINA # ===================================================================== # DeepSeek User-agent: DeepSeekBot Allow: / # ByteDance (Doubao, TikTok) User-agent: Bytespider User-agent: ByteDance User-agent: TikTokSpider Allow: / # Baidu (ERNIE) User-agent: Baiduspider User-agent: Baiduspider-render Allow: / # Alibaba (Qwen / Tongyi Qianwen) User-agent: AlibabaBot User-agent: QwenBot Allow: / # Huawei (PanGu) User-agent: PanguBot Allow: / # Tencent User-agent: TencentTraveler User-agent: Sosospider Allow: / # Sogou User-agent: Sogou User-agent: SogouSpider Allow: / # 360 (Qihoo) User-agent: 360Spider Allow: / # Yisou User-agent: YisouSpider Allow: / # ChatGLM (Zhipu AI) User-agent: ChatGLM-Spider Allow: / # Kimi (Moonshot AI) User-agent: KimiBot Allow: / # ===================================================================== # ASIA-PACIFIC (Korea, Japan, India, Southeast Asia) # ===================================================================== # Naver (South Korea — Clova X) User-agent: Yeti User-agent: NaverBot Allow: / # Samsung (South Korea) User-agent: SamsungBot Allow: / # LINE (Japan) User-agent: LINEBot Allow: / # Rakuten (Japan) User-agent: RakutenBot Allow: / # Coccoc (Vietnam) User-agent: CocCocBot Allow: / # Seekr (Asia-Pacific AI) User-agent: SeekrBot Allow: / # ===================================================================== # MIDDLE EAST & AFRICA # ===================================================================== # Falcon AI (UAE — Technology Innovation Institute) User-agent: FalconBot Allow: / # Jais (UAE — Inception/G42) User-agent: JaisBot Allow: / # PetalBot (Huawei — serves Middle East & Africa markets) User-agent: PetalBot Allow: / # ===================================================================== # AI DATA COLLECTION, TRAINING & RESEARCH # ===================================================================== # Timpi (decentralized search) User-agent: Timpibot Allow: / # VelenPublicWebCrawler User-agent: VelenPublicWebCrawler Allow: / # Kangaroo LLM User-agent: Kangaroo Bot Allow: / # AI2 (Allen Institute for AI) User-agent: AI2Bot User-agent: AI2Bot-Dolma User-agent: Ai2Bot-DeepResearchEval Allow: / # ISSCyberRiskCrawler User-agent: ISSCyberRiskCrawler Allow: / # aiHitBot User-agent: aiHitBot Allow: / # Echobot User-agent: Echobot Allow: / # NeevaBot (Neeva AI search — acquired by Snowflake) User-agent: NeevaBot Allow: / # Scrapy-based AI crawlers User-agent: Scrapy Allow: / # OpenBot User-agent: OpenBot Allow: / # Nicecrawler User-agent: Nicecrawler Allow: / # Iframely User-agent: Iframely Allow: / # Embedly User-agent: Embedly Allow: / # PaperLiBot User-agent: PaperLiBot Allow: / # Quora Link Preview User-agent: QuoraBot Allow: / # Notion User-agent: Notion Allow: / # ===================================================================== # WILDCARD CATCH-ALL (Future Crawlers) # ===================================================================== # Catch any AI crawler not listed above User-agent: * Allow: / Crawl-delay: 2 # ===================================================================== # ALLOW IMPORTANT FILES (site-specific) # ===================================================================== Allow: /css/ Allow: /js/ Allow: /images/ Allow: /manifest.json Allow: /.well-known/ # ===================================================================== # SITEMAPS # ===================================================================== Sitemap: https://wellnesscoachfl.com/sitemap.xml Sitemap: https://wellnesscoachfl.com/sitemap_index.xml # AI DISCOVERY & TRANSPARENCY FILES llms.txt: https://wellnesscoachfl.com/llms.txt llms-full.txt: https://wellnesscoachfl.com/llms-full.txt # Host declaration Host: https://wellnesscoachfl.com # ===================================================================== # Mathew Jadan, Holistic Practitioner (The Pure Rx) # Locations: Royal Oak, MI & Davie, FL # Serving: USA, Canada, UK, Australia # Phone: (586) 747-4578 | (954) 361-1508 # =====================================================================