Skip to content

Latest commit

 

History

History
171 lines (124 loc) · 7.44 KB

File metadata and controls

171 lines (124 loc) · 7.44 KB

Documentation Index

Welcome to the Multi-Site PDF Scraper documentation! This guide will help you navigate the documentation based on your role and needs.

Quick Navigation

🚀 Getting Started

👥 By Role

For End Users

For Developers

For Operators/DevOps

For Contributors

Documentation Structure

docs/
├── README.md                          # This file - documentation index
├── CHANGELOG.md                       # Version history and release notes
├── TODO.md                           # Roadmap and future plans
│
├── development/                      # Developer guides and references
│   ├── DEVELOPER_GUIDE.md            # Core development guide
│   ├── BACKEND_DEVELOPER_GUIDE.md    # Backend development
│   ├── EXAMPLE_SCRAPER_WALKTHROUGH.md # Step-by-step scraper creation
│   ├── CONFIG_AND_SERVICES.md        # Configuration architecture
│   ├── ERROR_HANDLING.md             # Error patterns and exceptions
│   └── LOGGING_AND_ERROR_STANDARDS.md # Logging best practices
│
├── operations/                       # Deployment and operations
│   ├── DEPLOYMENT_GUIDE.md           # Production deployment
│   ├── RUNBOOK_COMMON_OPERATIONS.md  # Day-to-day operations
│   ├── BACKEND_MIGRATION_GUIDE.md    # Backend switching guide
│   ├── MIGRATION_AND_STATE_REPAIR.md # State management
│   ├── SECRETS_ROTATION.md           # Security procedures
│   └── troubleshooting/              # Troubleshooting guides
│       └── ragflow_scraper_audit.md  # RAGFlow debugging
│
├── reference/                        # Technical specifications
│   └── METADATA_SCHEMA.md            # Document metadata format
│
├── archive/                          # Historical documentation
│   ├── README.md                     # Archive index
│   ├── plans/                        # Historical planning docs
│   └── jules/                        # Design explorations
│
└── screenshots/                      # Application screenshots
    └── current.png

Documentation by Topic

Architecture

Configuration

Scrapers

Operations

Backends

Testing

Security

Troubleshooting

Documentation Standards

For Contributors

When adding or updating documentation:

  1. Placement: Choose the appropriate directory (operations, development, reference)
  2. Linking: Update this index when adding new documents
  3. Format: Use Markdown with clear headings and examples
  4. Maintenance: Update last-modified dates when making significant changes
  5. Audience: Write for the intended audience (users, developers, operators)

Style Guidelines

  • Use clear, concise language
  • Include code examples where helpful
  • Provide both quick reference and detailed explanations
  • Link to related documentation
  • Keep documents focused on a single topic
  • Use consistent formatting and structure

Getting Help

Can't find what you're looking for?

  1. Search: Use GitHub's search to find keywords
  2. Issues: Check existing issues for discussions
  3. Create Issue: Open a new issue if documentation is missing/unclear
  4. Contribute: Submit a PR to improve documentation

Recent Updates

See CHANGELOG.md for recent documentation changes and project updates.

Archived Documentation

Historical planning documents and implementation notes are preserved in archive/. These may be outdated but are kept for historical context.


Need something specific? Use the navigation above or the search function to find what you need. If documentation is missing or unclear, please open an issue!