SoMi Embodied Interaction Environment

SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions

NeurIPS 2025

⭐ If our project helps you, please give us a star on GitHub to support us!

SoMi Embodied Interaction Environment

SoMi is easily extendable and supports LVLM agents controlling characters in the open-world game Minecraft, allowing them to collaborate with other agents to achieve crafting goals. The interaction logs, game screenshots, and videos generated by the interactive environment will be used for the SoMi-ToM evaluation.

@article{fan2025somi,
  title={SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions},
  author={Fan, Xianzhe and Zhou, Xuhui and Jin, Chuanyang and Nottingham, Kolby and Zhu, Hao and Sap, Maarten},
  journal={arXiv preprint arXiv:2506.23046},
  year={2025}
}

Requirements

Minecraft Java Edition (up to v1.21.1, recommend v1.20.1)
Node.js Installed (at least v14)
OpenAI API Key

Install and Run

Make sure you have the requirements above.
Clone or download this repository (big green button).
Rename keys.example.json to keys.json and fill in your API keys (you only need one).
In terminal/command prompt, run npm install from the installed directory.
Clone or download feature/minecraft-update branch in Sotopia repository.

cd examples/experimental/minecraft_agents
uvicorn group_discussion_agents:app --reload --port 8080

// Open a new terminal
cd examples/experimental/minecraft_agents
export OPENAI_API_KEY=sk-  // Enter your OpenAI API key here
uv run aact run-dataflow group_discussion_agents.toml

Enter Minecraft Java Edition, select Singleplayer, 1.20.1 version, and Survival Mode, then click Open to LAN 55916.
Open a new terminal, than run node src/agent/index.js from this repository.

Bot Profiles

Bot profiles are toml files that define:

Crafting Goal

You and your friends need to craft 2 “boat”.

Knowledge - Specific Crafting Rule

The complete process for crafting a “boat” in Minecraft is as follows:

......

Patches

Some of the node modules that we depend on have bugs in them. To add a patch, change your local node module file and run npx patch-package [package-name]

SoMi-ToM Benchmark

We propose the SoMi-ToM benchmark, designed to evaluate multi-perspective ToM in embodied multi-agent complex social interactions. This benchmark is based on rich multimodal interaction data generated by the interaction environment SoMi, covering diverse crafting goals and social relationships. See dataset at SoMi-ToM.

🔥 Latest LVLM Benchmark Table

Performance of humans and leading closed-source or open-source LVLMs in the first-person evaluation (state inference). There are 350 questions for self-ToM reasoning and 700 questions for others’ ToM reasoning.

Performance of humans and leading closed-source and open-source LVLMs in the Third-Person Perspective ToM test (175 questions in total). Highest accuracy without CoT is shown in red bold, and with CoT in blue bold.

❤️ Acknowledgements

The SoMi-ToM benchmark references the following code repositories:

https://github.com/PrismarineJS/prismarine-viewer

https://github.com/kolbytn/mindcraft

https://github.com/ProKil/aact

https://sotopia.world/projects/sotopia

Thanks for their awesome work!

📺 Easter Egg: More AI in Minecraft!

For more fascinating videos on AI playing Minecraft, check out the Emergent Garden YouTube channel. The codebase for the AI in these videos comes from kolbytn/mindcraft.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
bots		bots
patches		patches
profiles		profiles
src		src
.gitignore		.gitignore
FAQ.md		FAQ.md
Jack.json		Jack.json
Jane.json		Jane.json
John.json		John.json
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
keys.json		keys.json
main.js		main.js
overview.jpg		overview.jpg
package-lock.json		package-lock.json
package.json		package.json
settings.js		settings.js
test_api_hello.js		test_api_hello.js
viewer.html		viewer.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions

SoMi Embodied Interaction Environment

Requirements

Install and Run

Bot Profiles

Patches

SoMi-ToM Benchmark

🔥 Latest LVLM Benchmark Table

❤️ Acknowledgements

📺 Easter Egg: More AI in Minecraft!

About

Uh oh!

Releases

Packages

Languages

License

XianzheFan/SoMi-ToM

Folders and files

Latest commit

History

Repository files navigation

SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions

SoMi Embodied Interaction Environment

Requirements

Install and Run

Bot Profiles

Patches

SoMi-ToM Benchmark

🔥 Latest LVLM Benchmark Table

❤️ Acknowledgements

📺 Easter Egg: More AI in Minecraft!

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages