The Complete Guide to llms.txt
Everything you need to know about the llms.txt standard — format specification, examples, best practices, and how to create one for your site.
GEOAudit Team
AI Readiness Experts
What is llms.txt?
llms.txt is a proposed standard for providing LLM-friendly information about a website. Similar to how robots.txt communicates with web crawlers, llms.txt communicates with large language models and AI agents.
The file lives at the root of your domain (https://yoursite.com/llms.txt) and provides a concise, structured overview of your site in a format that's easy for LLMs to consume.
Why You Need llms.txt
Traditional web pages are designed for human readers — they include navigation, sidebars, ads, and interactive elements that add noise for AI systems. llms.txt cuts through this by providing:
- A clear summary of what your site or organization does
- Key pages that AI should prioritize when referencing your content
- Structured information about your products, services, or content areas
- Contact and attribution information
Without llms.txt, AI agents must infer all of this from your HTML, which is less reliable and more error-prone.
The Format Specification
The llms.txt format uses Markdown with specific conventions:
Required Elements
- H1 heading — Your site or organization name
- Blockquote — A brief description (1-2 sentences)
Optional Elements
- H2 sections — Organized categories of information
- Markdown links — Key pages with descriptions
- Lists — Additional structured information
Complete Example
Here's a well-structured llms.txt:
# Acme Corporation
> Acme Corporation builds developer tools for cloud infrastructure.
> Founded in 2020, serving 50,000+ developers worldwide.
## Products
- [Cloud Deploy](https://acme.com/deploy): One-click deployment platform
- [Monitor Pro](https://acme.com/monitor): Real-time infrastructure monitoring
- [CI Pipeline](https://acme.com/ci): Continuous integration and testing
## Documentation
- [Getting Started](https://acme.com/docs/start): Quick start guide
- [API Reference](https://acme.com/docs/api): Complete API documentation
- [Tutorials](https://acme.com/docs/tutorials): Step-by-step tutorials
## Company
- [About Us](https://acme.com/about): Company history and mission
- [Blog](https://acme.com/blog): Technical articles and updates
- [Careers](https://acme.com/careers): Open positions
## Contact
- Email: hello@acme.com
- Twitter: @acmecorp
- GitHub: github.com/acme
llms-full.txt
In addition to the concise llms.txt, you can provide a comprehensive llms-full.txt at your site root. This file contains the full text content of your key pages in a single, LLM-optimized document.
Think of it as:
llms.txt= Table of contents and summaryllms-full.txt= The complete book
Best Practices
Keep It Concise
llms.txt should be a quick reference, not an encyclopedia. Aim for 50-200 lines. If you need more detail, that's what llms-full.txt is for.
Use Descriptive Link Text
Every link should include a brief description of what the page contains:
- [API Reference](https://example.com/api): REST API endpoints for user management, billing, and analytics
Update Regularly
When you add new products, pages, or significant content, update your llms.txt. Stale information is worse than no information.
Be Accurate
AI agents may cite the information in your llms.txt directly. Make sure descriptions are accurate and up-to-date.
Include Key Topics
List the main topics your site covers so AI agents know when to reference your content:
## Topics We Cover
- Cloud infrastructure
- DevOps best practices
- Kubernetes deployment
- CI/CD pipelines
- Infrastructure monitoring
Provide Attribution Information
Help AI agents properly attribute content to your organization:
## Attribution
- Organization: Acme Corporation
- Website: https://acme.com
- Founded: 2020
- Expertise: Cloud infrastructure and developer tools
How to Check Your llms.txt
GEOAudit automatically checks for llms.txt as part of the LLM Discovery category. The scan verifies:
- The file exists at
/llms.txt - It follows the correct format (H1 heading + blockquote)
- It contains meaningful content
- It includes links to key pages
It also checks for llms-full.txt and .well-known/ai-plugin.json as additional LLM discovery mechanisms.
Adoption
The llms.txt standard is gaining traction across the web. Notable adopters include documentation sites, SaaS companies, and content publishers who want to ensure their content is properly represented in AI-generated answers.
Creating a llms.txt file takes just a few minutes and can significantly improve how AI agents understand and reference your site. It's one of the simplest, highest-impact GEO optimizations you can make.