Search audio file for pattern?
In theory—you’ll have to write your own app to do this—you would need to do this:
Run the WAV file through a speech recognition engine Sphinx Speech Recognition is a free open source engine.
Have your speech recognition engine spit out text in real-time in memory and also keep track of the sample length location. Everytime the string “deux radio” is found, spit out the sample location to a
splitlog.txt
file. Since this is in realtime you get an approximation of the sample location.Write your own wav splitter app: Gather all the sample location points from
splitlog.txt
and split the WAV accordingly.
There is a program called audiogrep which transcribes text using speech-to-text conversion and then creates audiocuts based on text searches.
I haven't seen any option to output time stamps instead of audio clips, you probably need to modify the source slightly to that (or contact the author and ask him). I also don't know how well it will handle French (or whatever language your announcer uses).
You could try :
-
TAPESTREA: Experimental software which is described as:
TAPESTREA (or taps) is a unified framework for interactively analyzing, transforming and synthesizing complex sounds. Given one or more recordings, it provides well-defined means to:
- identify points of interest in the sound and extract them into reusable - templates
- transform sound components independently of the background and/or other events
- continually resynthesize the background texture in a perceptually convincing manner
- controllably place event templates over backgrounds, using a novel - graphical user interface and/or scripts written in the ChucK audio programming language
-
Voice Converter: Shareware ($29.95, trial version with no time limit) which is described as a tool that can be a:
Pitch Editor, Formant Shift, Vibrato, Raspiness, Breathiness, Pitch Range Scaling, Pitch Smoothing, Time Stretching…