Releases: pftq/GrokBot
1.3
GrokBot1.2
Updated with Grok 4 Fast - which makes Grok much more responsive and closer to real-time interaction on the desktop now.
GrokBot1.1
Added support for ChatGPT-5 (slightly different JSON parameters than Grok-4 and GPT-4.1)
GPT-5 is finally able to click the start menu and even draw in Paint!
GrokBot1.0
GrokBot
2025-07-28.21-39-12_crop.mp4
Summary
This Windows script will let Grok 4 roam freely on your desktop, controlling your mouse and keyboard (even without your input). You can ask it to do pretty much anything, and it'll at least try (but it is able to one-shot opening programs, typing into fields, doing system commands, etc). Grok will also retain its memories even if you close and reopen it!
The limitation right now is that Grok 4 is very slow (video has the wait times truncated), but it is the only intelligent-enough version of Grok on the API that accepts images. This should hopefully improve over time from xAI's side though.
Instructions
- Unzip the GrokBot1-0_EXE.zip folder
- Get an API key from https://x.ai/api
- Save the API key to apikey.txt
- Run GrokBot.exe - have fun!
There are settings available to configure in settings.ini but in general. For example, if Grok is running off too long without asking for your input, you can set maxNonResponse to a smaller number to force it to check in once in a while (although it might be better to just ask it to check in more frequently instead to not interrupt multi-step tasks). The source code is available for those wanting to customize further or help improve things.
Bonus: You can use a ChatGPT API key as well and it'll automatically use ChatGPT, but ChatGPT is nowhere near as capable, struggling even just to click the Start menu.
https://x.com/pftq/status/1950213965256020244