文字起こし　whisper - 文献斜め読み２

会議録の作成は必ずしも簡単でない
zoomの会議では、音声(.m4a)と動画(.mp4)の記録をのこすことができる
openAIによりwhisperが公開され、文字起こしに応用できるようになった。(2022/9/21)
zoomの音声記録の文字起こししてみた記録
参考サイト
ほぼ完璧な文字起こしができるAI音声認識Whisperのインストール | ジコログ
環境
- macbook air M1 (M1,2020) with 16G RAM
- macOS Montrey
- homebrew 3.6.3
- anaconda /jupyter notebook 6.3.0
ffmegのinstall
- brew install ffmpeg
whisperのinstall
- pip install git+https://github.com/openai/whisper.git
whisperの実行　on jupyter notebook
- import whisper
- model = whisper.load_model("large")

result = model.transcribe("audio??????????.m4a", verbose=True, language="ja")
print(result["text"])

40分のくらいzoom会議の音声をもとに、文字起こしが一晩でできていた
あまり熱を発しないmacbook airがほんのりと温かくなっていた
便利な世の中になったものである
追記　2022/10/26 いままでのanacondaは以前のintel macbookの環境を移行したのでintel版であった。したがって、rosetta2を使った運用だったので時間がかかったようだ。anaconda M1をいれて、実行してみたら２時間弱で文字起こしができたようだ