top of page
SEO2Blue
WELCOME TO MY BLOG
bottom of page
# This file controls how LLMs and other generative AI models # are permitted to use the content on seo2blue.com. # --- General Directives --- # Disallow all known general-purpose LLM trainers from using content for training. # This is often done to protect proprietary content or prevent misuse. # User-Agent: * # Disallow: / # --- Specific LLM/AI Bot Directives --- # 1. Example: Allow a specific AI agent (like a citation bot) to crawl, # but disallow it from using the content for training. # User-Agent: CitationBot-AI # Allow: / # Disallow-Model-Training: / # 2. Example: Explicitly allow a known, benevolent AI tool # (e.g., a tool for accessibility or summarization) to access and use all content. # User-Agent: GoodAI-Summarizer # Allow: / # Allow-Model-Training: / # 3. Example: Disallow all LLM training *only* on the 'Contact' page, # to prevent private details (if they were present) from being scraped. # User-Agent: * # Disallow-Model-Training: /contact # --- General Model Training Directive --- # A broad directive to apply to all crawlers that support the 'Model-Training' standard. # This line says: Do NOT use any content on the site for model training. User-Agent: * Disallow-Model-Training: / # This directive could be used if you ONLY want to block specific directories. # User-Agent: * # Disallow-Model-Training: /private-research/ # Disallow-Model-Training: /beta-content/