Y-Detective Image & Language AI Assistance Program

-- Utilize AI to expand human limits.

Combining computer vision and natural language processing, the program provides the following functionalities:

Multiple object detection from an image and auto cropping
Image searching and selection based on descriptions in natural language
Image grneration
Ask questions about an image
Text to speech generation

For object cropping, please select the image from "static/imgorig/" directory; for image selection, select from "static/imggroups/{class name}/" directory. For both, the result will be saved to "static/result/".

GPT-4 Vision, DALL E image generation, and OpenAI text to speech are used. To use our AI tools, please obtain a Clarifai access key, create a .env file in the root directory, and put the key in the .env file as follows:

CLARIFAI_PAT="Your access key"

To launch our program, open "main.py" and click run. The webpage will be in local host.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
__pycache__		__pycache__
static		static
templates		templates
.gitignore		.gitignore
answer_parser.py		answer_parser.py
imgfactory.py		imgfactory.py
main.py		main.py
models.py		models.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Y-Detective Image & Language AI Assistance Program

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Y-Detective Image & Language AI Assistance Program

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages