Skip to content

improve auto-generated skills for future reuse #32

@reneehuang1

Description

@reneehuang1

Currently, each session already auto-generates a skill. However, according to a use case I tried (getting a music list in Spotify), it failed often. I also find that the generated skills are probably not very useful for helping the AI prevent mistakes in the future.

I am not sure about the solution yet, but I want to note down the issue here.

Skill 2: "Get a music list in Spotify for music like Glass Animals"

  Session (session-20260222-170804) — 42 steps

  Model: gpt-5.3-codex

  This was a much more complex session with multiple obstacles:

  1. Steps 1-12: Attempt to install Spotify via Play Store — Opened Play Store, dismissed promos, searched "Spotify", found the
   app, but installation repeatedly failed with "Can't install Spotify" errors.
  2. Step 13: request_user_decision — Agent asked the user how to proceed. User chose "Use YouTube Music to build a similar
  list now".
  3. Steps 14-23: YouTube Music fallback — Dismissed the error, launched YouTube Music via explicit intent, bypassed premium
  trial popup and onboarding, searched "glass animals", and tapped "Mix" to open a Glass Animals artist mix.
  4. Steps 24-37: Second attempt to install Spotify — Despite the YouTube Music success, the agent circled back to try Spotify
  installation again via Play Store deep links. Hit the same "Can't install" error repeatedly.
  5. Step 35: Second request_user_decision — User provided custom input (29 chars, content not shown).
  6. Steps 37-41: Spotify Web fallback — Agent opened open.spotify.com/search/glass%20animals in Chrome, searched "glass
  animals radio", tapped the result.
  7. Step 42: finish — Successfully landed on the Glass Animals Radio playlist on Spotify Web, showing tracks like "The Other
  Side Of Paradise" and "Electric Love" with related artists (BORNS, alt-J, Cage The Elephant).

  Key takeaway: The agent encountered persistent Spotify install failures on the emulator, pivoted through YouTube Music, then
  ultimately succeeded via Spotify Web in Chrome — a 42-step session with multiple recovery strategies and two user decision
  points.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions