Kbps - 64
When audio is compressed to 64 kbps (using codecs like MP3 or AAC), information is discarded to save space. Research shows this affects deep learning models in the following ways:
: Deep models designed for detecting audio manipulation (e.g., resampling) are often evaluated at 64 kbps to ensure they can still identify global time-frequency variations despite the lossy compression. Common Applications at 64 kbps End-to-end Stereo Audio Coding Using Deep Neural Networks 64 kbps
: Even a 64 kbps MP3 compression can decrease the accuracy of a Deep Neural Network (DNN) by roughly 4.98% in sensitive tasks like acoustic classification. When audio is compressed to 64 kbps (using
: At 64 kbps, most signal details are preserved compared to lower bitrates (like 16 or 32 kbps), but the compression efficiency is significantly lower. : At 64 kbps, most signal details are
In the context of audio processing and deep learning, (kilobits per second) refers to a common low-bitrate threshold used to test the robustness of deep features —the high-dimensional data representations extracted by neural networks from raw audio signals. Impact on Deep Features
: Modern Neural Audio Codecs attempt to learn optimal transformations in a "latent space" (deep features) that provide better sound quality at 64 kbps than traditional codecs like HE-AAC.