Main Content

Audio Processing

Extend deep learning workflows with audio and speech processing applications

Apply deep learning to audio and speech processing applications by using Deep Learning Toolbox™ together with Audio Toolbox™. For signal processing applications, see Signal Processing. For applications in wireless communications, see Wireless Communications.

App

Signal LabelerLabel signal attributes, regions, and points of interest, and extract features

Funzioni

espandi tutto

audioDatastoreDatastore for collection of audio files
audioDataAugmenterAugment audio data (Da R2019b)
audioFeatureExtractorStreamline audio feature extraction (Da R2019b)
openl3EmbeddingsExtract OpenL3 feature embeddings (Da R2022a)
pitchnnEstimate pitch with deep learning neural network (Da R2021a)
vggishEmbeddingsExtract VGGish feature embeddings (Da R2022a)
yamnet(Not recommended) YAMNet neural network (Da R2020b)
classifySoundClassify sounds in audio signal (Da R2020b)
crepe(Not recommended) CREPE neural network (Da R2021a)
pitchnnEstimate pitch with deep learning neural network (Da R2021a)
vggish(Not recommended) VGGish neural network (Da R2020b)
vggishEmbeddingsExtract VGGish feature embeddings (Da R2022a)
openl3(Not recommended) OpenL3 neural network (Da R2021a)
openl3EmbeddingsExtract OpenL3 feature embeddings (Da R2022a)
vadnet(Not recommended) Voice activity detection (VAD) neural network (Da R2023a)
detectspeechnnDetect boundaries of speech in audio signal using AI (Da R2023a)
separateSpeakersSeparate signal by speakers (Da R2023b)

Blocchi

espandi tutto

VGGishVGGish embeddings extraction network (Da R2022a)
VGGish EmbeddingsExtract VGGish embeddings (Da R2022a)
YAMNetYAMNet sound classification network (Da R2021b)
Sound ClassifierClassify sounds in audio signal (Da R2021b)
OpenL3OpenL3 embeddings extraction network (Da R2022b)
OpenL3 EmbeddingsExtract OpenL3 embeddings (Da R2022b)
CREPECREPE deep pitch estimation neural network (Da R2023a)
Deep Pitch EstimatorEstimate pitch with CREPE deep learning neural network (Da R2023a)

Argomenti