Skip to content
Change the repository type filter

All

    Repositories list

    • DuckTrack

      Public
      Multimodal computer agent data collection program
      Python
      2515490Updated Dec 5, 2025Dec 5, 2025
    • The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores 68% on SWE-bench verified!
      Python
      306000Updated Aug 21, 2025Aug 21, 2025
    • SWE-agent

      Public
      SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
      Python
      1.9k000Updated Aug 18, 2025Aug 18, 2025
    • Releases from OpenAI Preparedness
      Python
      114000Updated Aug 15, 2025Aug 15, 2025
    • mle-bench

      Public
      MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
      Python
      196000Updated Aug 14, 2025Aug 14, 2025
    • deepthink

      Public
      Python
      0000Updated Jul 27, 2025Jul 27, 2025
    • 🚀 SWE-bench Goes Live!
      Python
      24000Updated Jul 25, 2025Jul 25, 2025
    • Open-source implementation of AlphaEvolve
      Python
      774702Updated Jul 10, 2025Jul 10, 2025
    • JavaScript
      14100Updated Feb 19, 2025Feb 19, 2025
    • prm

      Public
      Python
      312243Updated Jan 17, 2025Jan 17, 2025
    • site

      Public
      JavaScript
      0000Updated Nov 22, 2023Nov 22, 2023
    • arb

      Public
      Advanced Reasoning Benchmark Dataset for LLMs
      TypeScript
      34761Updated Nov 19, 2023Nov 19, 2023
    • chonk

      Public
      Python
      15200Updated Oct 18, 2023Oct 18, 2023
    • videorl

      Public
      Python
      15134Updated Oct 6, 2023Oct 6, 2023
    • 0000Updated Aug 25, 2023Aug 25, 2023
    • Community website
      JavaScript
      1010Updated Aug 14, 2023Aug 14, 2023
    • donut

      Public
      Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
      Python
      548200Updated Nov 22, 2022Nov 22, 2022