Skip to content

Labels

Labels

  • Async/await patterns
  • Unexpected behavior or incorrect evaluation results
  • Maintenance tasks, tooling, and dependencies
  • CLI commands
  • Fundamental architecture and base classes
  • Synthetic data generation and test set management
  • Technical documentation, examples, and docstrings
  • Evaluator implementation
  • New functionality or capability
  • CI/CD pipelines, Docker, and PyPI publishing
  • LLM judge implementation
  • RagaliQ pytest plugin and integration logic
  • Code improvement without changing behavior
  • Output formatting: HTML, JSON, and terminal visuals
  • Exploring new RAG evaluation papers or methods
  • Testing infrastructure