Linux command to concatenate audio files and output them to ogg

Here's my suggestion: Use mplayer and oggenc connected with a named pipe.

  • Use mplayer to decode the audio. It can play back a wide variety of audio (and video) formats, and it also can play multiple files.

  • Use oggencto encode the audio to Ogg Vorbis.

  • To eliminate the need for a temporary file, use a named pipe to transfer the data between encoder and decoder.

Putting that into a script:

#!/bin/sh
# Usage: ./combine2ogg destination.ogg source1.wma source2.wma ...
# Requires: mplayer, oggenc
destination="$1"
shift
readpipe="$destination.tmpr"
writepipe="$destination.tmpw"
mkfifo "$readpipe"
mkfifo "$writepipe"
cat "$writepipe" > "$readpipe" &
mplayer -really-quiet -slave -nolirc -vc null -vo null -ao "pcm:fast:file=$writepipe" "$@" &
oggenc --quiet -o "$destination" "$readpipe"
rm -f "$readpipe"
rm -f "$writepipe"

Explained:

  1. Take the destination file name from the first command line parameter.
  2. Remove the first command line parameter, leaving only the source file names.
  3. Create a name for a pipe for oggenc to read from.
  4. Create a name for a pipe for mplayer to write to
  5. Create the pipes.
  6. Use cat to continuously dump the writepipe to the readpipe (this helps avoid issues where mplayer may terminate, but prevents oggenc from thinking it's done when this happens)
  7. Decode the audio from the source files using mplayer. Options -really-quiet -slave -nolirc are there to disable messages and to make it not read the keyboard or the remote. Options -vc null -vo null are there to disable video encoding and output. The -ao option directs it to output the audio in WAV format to the named write pipe.
  8. While the previous command is running, simultaneously encode from the named read pipe into Ogg using oggenc.
  9. Remove the named pipes.

Stuff to left to improve: Terminating the script early if one of the commands fails (use set -e), but still properly cleaning up the fifo (trap the necessary signals).


SoX can handle a large number of audio formats, depending on what libraries it's compiled against, and can concatenate files with a simple command line

sox file1.wav file2.mp3 file3.flac outfile.ogg

I believe this only works if the source audio files all have the same number of channels, but I've not tested this.

As far as I know, SoX has zero support for .wma files, so converting at least those files with something like ffmpeg first is probably unavoidable.

ffmpeg -i infile.wma outfile.wav

I would use ffmpeg. To convert wma to ogg vorbis try:

ffmpeg -i sample.wma -acodec vorbis -aq 100 sample.ogg

or mp3:

ffmpeg -i input.wma -acodec libmp3lame output.mp3

you'll need lame installed for the mp3 convert. sudo apt-get install lame libmp3lame0

Cat, then convert doen't seem to work well, although you can find lots of references on the web saying you can do something like cat *.wma | ffmpeg -i - -acodec ... - this doesn't work on my machine - only the first file gets processed. There is a 'copy' codec for ffmpeg, but doesn't make much difference.

Doing the convert with ffmpeg first, then cat *.ogg > output.ogg worked for me.


To concatenate and transcode audio files, this is the workflow you want to follow:

  1. Decode input audio files to WAV/PCM data,
  2. Concatenate WAV,
  3. Encode WAV to output audio codec file.

Processing in these steps performs the concatenation on the WAV data, which is the easiest to deal with and least lossy. And since you want to transcode anyway, you'll have to decode to WAV somewhere in the process; might as well take advantage of it.

Here are the steps I recommend. We'll use ffmpeg for decoding, sox for concatenation, and oggenc for encoding. You could substitute other tools for any step -- mencoder and others work fine as a decoder, and any tool that encodes Ogg can be used as the encoder -- although I think you'll be hard-pressed to find a better tool than sox for the concatenation.

  1. find *.wma -exec ffmpeg -i {} outfile.{}.wav \;
    (this runs ffmpeg -i infile.wma outfile.infile.wma on each WMA file in the current directory)

  2. sox outfile* all.wav
    (note outfile is the same prefix we gave the output files in step 1)

  3. oggenc all.wav all.ogg you probably want some quality settings here, but this will give decent quality defaults. I did not get results I liked with ffmpeg, but you may prefer it, so I've included an example with ffmpeg below.

Note that the sox command won't work properly unless all the WAV files are the same format -- if some are 22kHz and some are 44.1kHz, for example, you need to identify them to sox via commandline switches, and specify which format you want the output in. I don't know any commandline tools as capable as sox for the concatenation step. You could possibly output to a raw PCM format in step 1 and use cat for step 2, but that would only work if they're in the same format, and you'd need to be extremely specific to both your decoder and encoder as to what format they should expect.

Encoding with ffmpeg: Lots of posts around the 'net complain that ffmpeg's builtin Vorbis encoder isn't very good. If your version is built with libvorbis support, use -acodec libvorbis instead. Sample commandline:

ffmpeg -i all.wav -acodec libvorbis -ac 2 -ab 128k all.ogg