Open
Conversation
…as "Text Extracted from Image"
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Added file attachment support for images. (PDF and more to come later)
Vision Model Integration for AkashChat using Ollama's LLaVA
This feature allows users to upload images to the chat and have them analyzed. Images are processed by Ollama's LLaVA vision model. The analysis is then appended to the user's message as context and sent to the AI model.
How It Works
Components
ImageUploadButton.tsx: A reusable component for the file upload buttonChatInput.tsx: Modified to include file upload and processingpages/api/vision.ts: API endpoint that processes images using Ollama's LLaVA modelutils/app/vision.ts: handles file uploads and conversionsNew Dependencies
axios: For making HTTP requests to the Ollama APIConfiguration
Install and set up Ollama:
ollama pull llavaIf your Ollama server is running on a different machine or port, update the API endpoint URL in
pages/api/vision.ts.Limitations