First impression of whisper
Tried out openai/whisper
Whisper is a general-purpose speech recognition model.
It also requires the command-line tool ffmpeg to be installed on your system,
# on MacOS brew install ffmpeg
OMG. Installing one package on brew takes so long
Had some errors trying to run whisper
Traceback ... ... [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1125) ...
Fixed the error by running following command (change 3.8 to your version of python)
/Applications/Python\ 3.8/Install\ Certificates.command
Command I used for transcribing
whisper audio_en.mp3 --model base --language en whisper audio_ko.mp3 --model base --language ko
Using any other models bigger than 'base' was way too slow for my old macbook (2015).
Command line shows some sample results on the terminal. Generates transcribed files: *.vtt and *.txt
Awesome speech recognition for English. It can pick up my voice reading stuff in English with some accent flawlessly.
However, voice memo that contains a conversation between me and my then-4yo daughter in Korean wasn't as impressive. Much more room to improve on Korean and possibly other non-English languages.
Grateful that these things are available.