Skip to content

PictoSymphony is an innovative image processing application that leverages cutting-edge technologies to provide a unique and artistic experience. It combines the power of Salesforce's blip-image-captioning-large model for image description with CompVis's stable-diffusion-v1-4 for image generation.

Notifications You must be signed in to change notification settings

Tejas911/PictoSymphony-Generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PictoSymphony

Project Image

PictoSymphony is an innovative image processing application that leverages cutting-edge technologies to provide a unique and artistic experience. It combines the power of Salesforce's blip-image-captioning-large model for image description with CompVis's stable-diffusion-v1-4 for image generation. The result is a symphony of images that transform your input into captivating visual compositions.

Getting Started

Follow these instructions to get a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

  • Python 3.x
  • Jupyter Notebook
  • Google Colab account

Installation

Clone the repository to your local machine.

git clone https://github.com/Tejas911/PictoSymphony.git

Usage

Open the Interface_PictoSymphony.ipynb file in Google Colab.

Follow the instructions in the notebook to load your image and experience the magic of PictoSymphony.

How It Works

  • Image Description:

    PictoSymphony utilizes the Salesforce/blip-image-captioning-large model to generate textual descriptions of input images.

  • Image Generation:

    The CompVis/stable-diffusion-v1-4 model is employed to generate visually appealing images based on the provided descriptions.

  • Google Colab Integration:

    Users are recommended to use the Interface_PictoSymphony.ipynb file in Google Colab for a seamless and interactive experience.

Demo

Project Image
Project Image
Project Image

About

PictoSymphony is an innovative image processing application that leverages cutting-edge technologies to provide a unique and artistic experience. It combines the power of Salesforce's blip-image-captioning-large model for image description with CompVis's stable-diffusion-v1-4 for image generation.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •