I think it going into Multimodal (image, video etc support) reasoning would be a great segue to the project you've built so far.