Creating an audiobook with help from OpenAI Whisper (Episode #T2)
Creating an audiobook with help from OpenAI Whisper (Episode #T2)
Ten Minute Projects for the Curious: https://www.youtube.com/@10minute1liners
In this video, I show how to build a m4b audiobook file from a set of audio files using very basic commands from OpenAI Whisper, ffmpeg and AtomicParsley. The code can be found at https://www.innovationincrisis.com/10minute1liners/episode-T2 and downloaded here.
The steps to create a m4b audiobook from audio files are as follows:
Concatenate audio files into one m4b file using FFMPEG
Run whisper on m4b file to transcribe and time stamp text from the audio
Edit the tsv file to create correct chapter/section/part delimiters and times
Create a chapters tag file and tag the m4b file
Add title, author and narrator to m4b file
Design and add cover art to m4b file
The steps can be done as follows (most as bash files with a one line command).