speech2summary

This project was an endeavor to see how feasible summarizing text from audio is. This project went through several iterations and used several models, from Wav2Vec2.0 to WhisperV3 and BART. Transcription attempts with Wav2Vec2.0 were difficult due to the model's output format being inconsistent with training datasets for summarization models such as BART and Pegasus. Eventually, WhisperV3 was chosen to be the transcription model due to its output being similar to the training sets of summarization models. In the summarization task, I evaluated several Pegaus and BART models and chose a fine-tuned BART model. The project combines WhisperV3 as a transcription model that feeds text into a BART model which summarizes and outputs the text. The main drawback of this project is that it relies heavily on the quality of the transcript due to the BART model being highly sensitive to poor input text. This project shows the feasibility of using transformer models to transcribe and summarize audio due to the mean BERTScore of 0.85.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
README.md		README.md
final_test.ipynb		final_test.ipynb
summarization.py		summarization.py
transcriber.py		transcriber.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

speech2summary

About

Uh oh!

Releases

Packages

Languages

efar301/speech2summary

Folders and files

Latest commit

History

Repository files navigation

speech2summary

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages