ImageLingo: Text-Image Pair Generation for Language Learners

Authors

Suwon Yoon, Undergraduate, POSTECH
Taehyeok Ha, Undergraduate, POSTECH

Introduction

This paper introduces ImageLingo, a platform designed for language learning through automated generation of contextually and visually accurate sentence-image pairs. Utilizing GPT-4 API and Stable Diffusion XL/DALL-E 3, it offers a transformative learning experience.

System Architecture

The ImageLingo platform is built with a modular architecture, comprising the following components:

User Input Module: This component is responsible for capturing user inputs, which could be specific vocabulary words, phrases, or expressions. The interface is designed to be intuitive and easy to use, encouraging users to interact and specify their learning focus.
Sentence Generation Module: Leveraging the capabilities of GPT-4, this module generates sentences that contextually fit the user-provided vocabulary or expressions. The AI-driven sentence generation ensures variety and relevance, providing users with realistic examples of how words or phrases can be used.
Image Generation Module: This part of the system employs Stable Diffusion XL and DALL-E 3 to create visual representations of the generated sentences. The images are not only visually appealing but also contextually tied to the sentences, enhancing the learning experience by providing a visual context.
Translation Module: Integrating the Papago API, this module offers multilingual translation of the generated sentences. This is particularly useful for users who are learning a language different from their native one, as it provides a reliable translation for better understanding.
Unified Display Interface: The final component is the user interface, which is designed to be clean, intuitive, and user-friendly. It displays the generated sentences and corresponding images side by side, along with translations, creating an immersive learning environment.

Conclusion

ImageLingo exemplifies the integration of advanced technologies in language learning. By combining AI with user-centered design, it provides a tailored, engaging, and effective educational tool.

Installation

To run Image Lingo on your local machine, you need to have Python installed. Follow these steps:

Clone the repository

git clone https://github.com/hataehyeok/ImageLingo.git
cd ImageLingo

Install the required Python packages
```
pip install -r requirements.txt
```
Run the Streamlit application
```
streamlit run app.py
```

Development

This application is built using the Streamlit framework, which allows for rapid development of data applications. For contributions or modifications, refer to the Streamlit documentation to understand the structure and capabilities of the framework.

License

MIT License

Contact

For any additional questions or comments, please contact the repository owner.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.streamlit		.streamlit
docs		docs
image		image
src		src
.gitignore		.gitignore
README.md		README.md
app.py		app.py
image.png		image.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ImageLingo: Text-Image Pair Generation for Language Learners

Authors

Introduction

System Architecture

Conclusion

Installation

Development

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ImageLingo: Text-Image Pair Generation for Language Learners

Authors

Introduction

System Architecture

Conclusion

Installation

Development

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages