Voice-Based-Virtual-Trial-Room

The project takes in voice input from the user and retrieves the most relevant clothing items from its database and overlays it on the image of the user. Our target was to create an automated virtual try on operated through voice.

Our model aims at generating photo-realistic try-on result while preserving both the character of clothes and details of human identity (posture, body parts, bottom clothes) through speech input from the user. This has the potential to revolutionize user’s experience while shopping for clothes online.

Voice to Text

We make use of Wav2Vec model for generating transcript for the user’s voice input.

Cloth Selection

● We employ CLIP Encoders to generate embeddings for clothes and user’s cloth description.

● Most similar cloth is selected using cosine similarity between Image embeddings and text embedding.

Overlaying Cloth Image

We overlay the cloth image using 3 modules involving

○ Semantic Generation Module (SGM),

○ Clothes Warping Module (CWM)

○ Content Fusion Module (CFM).

Results

The following are some of the results we achieved

References

● Wav2Vec2 : Unsupervised pre-training for speech recognition

● CLIP: Learning Transferable Visual Models From Natural Language Supervision

● Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
ACGPN_Person		ACGPN_Person
models		models
README.md		README.md
VBVTR.ipynb		VBVTR.ipynb
blackdress (1).wav		blackdress (1).wav
results.png		results.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice-Based-Virtual-Trial-Room

Voice to Text

Cloth Selection

Overlaying Cloth Image

Results

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Voice-Based-Virtual-Trial-Room

Voice to Text

Cloth Selection

Overlaying Cloth Image

Results

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages