Would be epic to use Ollama Vision and push a screenshot of the current app. Use cases are immense:
- Logic Pro: how can I EQ guitar track
- Excel: what formula should i write in cell A1 to SUM rows 3-15
- etc...
Then this opens the door for onit to also take action and click inside the app to accomplish tasks like browser/computer use solutions