-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Labels
Description
Currently, each session already auto-generates a skill. However, according to a use case I tried (getting a music list in Spotify), it failed often. I also find that the generated skills are probably not very useful for helping the AI prevent mistakes in the future.
I am not sure about the solution yet, but I want to note down the issue here.
Skill 2: "Get a music list in Spotify for music like Glass Animals"
Session (session-20260222-170804) — 42 steps
Model: gpt-5.3-codex
This was a much more complex session with multiple obstacles:
1. Steps 1-12: Attempt to install Spotify via Play Store — Opened Play Store, dismissed promos, searched "Spotify", found the
app, but installation repeatedly failed with "Can't install Spotify" errors.
2. Step 13: request_user_decision — Agent asked the user how to proceed. User chose "Use YouTube Music to build a similar
list now".
3. Steps 14-23: YouTube Music fallback — Dismissed the error, launched YouTube Music via explicit intent, bypassed premium
trial popup and onboarding, searched "glass animals", and tapped "Mix" to open a Glass Animals artist mix.
4. Steps 24-37: Second attempt to install Spotify — Despite the YouTube Music success, the agent circled back to try Spotify
installation again via Play Store deep links. Hit the same "Can't install" error repeatedly.
5. Step 35: Second request_user_decision — User provided custom input (29 chars, content not shown).
6. Steps 37-41: Spotify Web fallback — Agent opened open.spotify.com/search/glass%20animals in Chrome, searched "glass
animals radio", tapped the result.
7. Step 42: finish — Successfully landed on the Glass Animals Radio playlist on Spotify Web, showing tracks like "The Other
Side Of Paradise" and "Electric Love" with related artists (BORNS, alt-J, Cage The Elephant).
Key takeaway: The agent encountered persistent Spotify install failures on the emulator, pivoted through YouTube Music, then
ultimately succeeded via Spotify Web in Chrome — a 42-step session with multiple recovery strategies and two user decision
points.
Reactions are currently unavailable