Skip to content

Latest commit

 

History

History
51 lines (31 loc) · 1.55 KB

File metadata and controls

51 lines (31 loc) · 1.55 KB

🏗️ This documentation is a work in progress! It's also mostly AI generated right now 🏗️

Read my project notes about this app at: https://nathanclonts.com/text-to-audiobook-app-with-kubernetes/


Audiobookify

PDF to Speech Converter

Table of Contents


Introduction

The PDF to Speech Converter is a web application that allows users to upload PDF documents and receive high-quality spoken audio as output. Users can choose from multiple voice options, ensuring a personalized listening experience. The application processes the PDF by extracting and cleaning the text before converting it to speech using advanced text-to-speech (TTS) models.


Features

  • PDF Upload: Upload PDF files directly through the web interface.
  • Voice Selection: Choose from a variety of high-quality voices.
  • Text Cleaning: Automatic removal of headers, footers, and hyphenations for smooth narration.
  • Audio Playback and Download: Listen to the generated audio online or download it for offline use.
  • Asynchronous Processing: Efficient handling of large files without blocking the user interface.
  • Modular Design: Separate backend and frontend components for scalability and maintainability.

Project Structure

TODO

Prerequisites

TODO

Local Development

TODO