Skip to content

PlayBench is a platform that evaluates AI models by having them compete in various games and creative tasks. Unlike traditional benchmarks that focus on text generation quality or factual knowledge, PlayBench tests models on skills like strategic thinking, pattern recognition, and creative problem-solving.

License

Notifications You must be signed in to change notification settings

playsaurus-inc/play-bench

PlayBench - AI Benchmark platform

This repository contains the source code for the PlayBench.ai web page, a web platform that provides benchmark results for different AI models, testing them in different games. It is available on https://playbench.ai.

The website is built in PHP using the Laravel framework. The project was developed with 💖 by Playsaurus.

The website uses the following technologies:

🚀 Contributing

If you want to run the project locally, you can follow the instructions in the CONTRIBUTING.md file.

At the moment the project is in heavy development, so we are not accepting PRs yet. However, if you want to contribute, feel free to open an issue. We will be opening the project for PR very soon.

📜 Code of Conduct

By issuing a PR, issue or comment, you agree to the Code of Conduct.

🛡️ Security Vulnerabilities

If you discover a security vulnerability, please send an e-mail to Javier Mora via javier@playsaurus.com. DO NOT open a public issue. More information can be found in the SECURITY.md file.

🔒 License

The code in this repository is licensed under the MIT License. See the LICENSE file for more information.

About

PlayBench is a platform that evaluates AI models by having them compete in various games and creative tasks. Unlike traditional benchmarks that focus on text generation quality or factual knowledge, PlayBench tests models on skills like strategic thinking, pattern recognition, and creative problem-solving.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Contributors 2

  •  
  •  

Languages