Medley is a tiny library that lets you parse documents of different formats like pdf, pptx, html or wav.
- Parse documents of different formats for my RAG applications
- Transcribe audio files (supports around 100 languages)
- Identify and sanitize PII information for upstream services
- Text [.pdf, .pptx]
- Audio [.wav]
- Text [.html]