-
Notifications
You must be signed in to change notification settings - Fork 8
Open
Description
How to deal with:
- numbers
- acronyms / symbols
- website / email spellings (e.g. use "dot", "at" )
I would go for trying as much as possible to have letter-based normalised representations of all the above such as:
- 100 -> one hundred (cento, cent)
- Hz -> (hertz), WHO -> double u aitch o (less sure about this one ...)
- www.rai.it -> vu vu vu punto rai punto it, pippo@pluto.com -> pippo at pluto dot com
of course this would be for the sake of comparison, no one would really like to have such transcripts as a final product ... we don't even need to output normalised text if not for a debug session.
Metadata
Metadata
Assignees
Labels
No labels