Opened 2 weeks ago
Last modified 2 weeks ago
#11398 new defect
"avfoundation" audio capture had missing samples randomly
Reported by: | drobinson | Owned by: | |
---|---|---|---|
Priority: | normal | Component: | avdevice |
Version: | 7.1 | Keywords: | avfoundation |
Cc: | MasterQuestionable | Blocked By: | |
Blocking: | Reproduced by developer: | no | |
Analyzed by developer: | yes |
Description
Summary of the bug:
I am using macOS 15.1 (Sequoia) on a 2021 Apple M1 Pro MacBook Pro 14-inch laptop, running FFmpeg version 7.1 installed via Homebrew.
Audio captured using AVFoundation through any audio codec (AAC, MP3, PCM, or -c:a copy) produces audible glitches in the form of missing samples. This issue occurs regardless of the sound device selected (built-in audio, USB audio, etc.).
Originally, I ran into this issue while attempting to capture both video and audio and encode them using libx264 and AAC. However, I found that the issue persists even in the most basic use case—just recording audio without video encoding.
Steps to Reproduce:
- Record a known waveform with the following command:
ffmpeg -f avfoundation -i ":0" -c:a copy output.wav
- Listen to output.wav. I notice frequent random audible ticks.
- Inspect the waveform of output.wav in an audio editing tool (e.g., Reaper). I observed that the waveform exhibits discontinuities, where a sequence of input samples, such as (1, 2, 3, 4, 5, 6, 7, 8, 9), results in an output sequence like (1, 2, 3, 8, 9). In this example, samples 4, 5, 6, and 7 are missing, and sample 8 immediately follows sample 3. These missing samples occur randomly, typically every 3000-10000 samples, with no discernible pattern.
Here is an example of a ramp waveform I used with the glitch circled in red:
Expected Behavior:
The output audio should accurately represent the input without any missing or skipped samples.
Actual Behavior:
There are consistent audio dropouts, where specific sections of audio are completely missing, resulting in a glitchy sound. This happens with every audio codec tested (AAC, MP3, PCM) and with different sound devices (built-in, USB, etc.).
Details:
% ffmpeg -f avfoundation -i ":0" -c:a copy output.wav ffmpeg version 7.1 Copyright (c) 2000-2024 the FFmpeg developers built with Apple clang version 16.0.0 (clang-1600.0.26.4) configuration: --prefix=/opt/homebrew/Cellar/ffmpeg/7.1_4 --enable-shared --enable-pthreads --enable-version3 --cc=clang --host-cflags= --host-ldflags='-Wl,-ld_classic' --enable-ffplay --enable-gnutls --enable-gpl --enable-libaom --enable-libaribb24 --enable-libbluray --enable-libdav1d --enable-libharfbuzz --enable-libjxl --enable-libmp3lame --enable-libopus --enable-librav1e --enable-librist --enable-librubberband --enable-libsnappy --enable-libsrt --enable-libssh --enable-libsvtav1 --enable-libtesseract --enable-libtheora --enable-libvidstab --enable-libvmaf --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxml2 --enable-libxvid --enable-lzma --enable-libfontconfig --enable-libfreetype --enable-frei0r --enable-libass --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenjpeg --enable-libspeex --enable-libsoxr --enable-libzmq --enable-libzimg --disable-libjack --disable-indev=jack --enable-videotoolbox --enable-audiotoolbox --enable-neon libavutil 59. 39.100 / 59. 39.100 libavcodec 61. 19.100 / 61. 19.100 libavformat 61. 7.100 / 61. 7.100 libavdevice 61. 3.100 / 61. 3.100 libavfilter 10. 4.100 / 10. 4.100 libswscale 8. 3.100 / 8. 3.100 libswresample 5. 3.100 / 5. 3.100 libpostproc 58. 3.100 / 58. 3.100
Input #0, avfoundation, from ':0': Duration: N/A, start: 10114.532792, bitrate: 3072 kb/s Stream #0:0: Audio: pcm_f32le, 48000 Hz, stereo, flt, 3072 kb/s File 'output.wav' already exists. Overwrite? [y/N] y Stream mapping: Stream #0:0 -> #0:0 (copy) Output #0, wav, to 'output.wav': Metadata: ISFT : Lavf61.7.100 Stream #0:0: Audio: pcm_f32le ([3][0][0][0] / 0x0003), 48000 Hz, stereo, flt, 3072 kb/s Press [q] to stop, [?] for help size= 1792KiB time=00:00:07.71 bitrate=1903.5kbits/s speed=1.28x
Attachments (2)
Change History (5)
comment:1 by , 2 weeks ago
Analyzed by developer: | set |
---|---|
Cc: | added |
Component: | undetermined → avdevice |
Keywords: | audio dropouts removed |
Summary: | AVFoundation audio recordings have audible ticks and are missing samples → "avfoundation" audio capture had missing samples randomly |
by , 2 weeks ago
Attachment: | avfoundation_audio_fifo.patch added |
---|
comment:2 by , 2 weeks ago
I had a chance to do some more investigation.
It seems like the issue is related to captureOutput:didOutputSampleBuffer:fromConnection
callback in the AVFAudioReceiver
class being called before the avf_read_packet
function and freeing/overwriting the current_audio_frame
before it is read.
I am able to avoid the dropouts by creating a FIFO for the received frames.
Not being that familiar with this codebase or AVFoundation, I am not sure if this is a reasonable fix or not (I suppose there could be latency/alignment fallout), but the avfoundation_audio_fifo.patch attached patch seems to resolve the issue for me.
by , 2 weeks ago
Attachment: | avfoundation_audio_fifo-v2.patch added |
---|
When the frame list overflows, advance the read index by half the FIFO length
comment:3 by , 2 weeks ago
͏ Difference:
-
.patch
old new 18 18 +static void push_frame(AVFFrameList* list, CMSampleBufferRef frame) 19 19 +{ 20 20 + list->frames[list->write_index] = (CMSampleBufferRef)CFRetain(frame); 21 21 + list->write_index = (list->write_index + 1) % AVF_FRAME_LIST_LENGTH; 22 22 + 23 23 + if (list->write_index == list->read_index) { 24 + CFRelease(list->frames[list->read_index]); 25 + list->read_index = (list->read_index + 1) % AVF_FRAME_LIST_LENGTH; 24 + for (int i = 0; i < (AVF_FRAME_LIST_LENGTH / 2); ++i) { 25 + CFRelease(list->frames[list->read_index]); 26 + list->read_index = (list->read_index + 1) % AVF_FRAME_LIST_LENGTH; 27 + } 26 28 + } 27 29 +}
͏ Reference:
͏ https://ffmpeg.org/ffmpeg-devices.html#avfoundation
͏ AVFoundation appears to be Apple specific.