Extracting Mel-Frequency Cepstral Coefficients with Python

Extracting Mel-Frequency Cepstral Coefficients with Python

Valerio Velardo - The Sound of AI

3 года назад

56,177 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@devendrajec
@devendrajec - 04.08.2023 12:57

If signal is sampled at sr = 22050 Hz and while calculating MFCC it is mentioned as sr = 44100 Hz. Theoretically what will happen?

Ответить
@armghan2312
@armghan2312 - 26.05.2023 18:41

how can we extract tqwt features from audio??

Ответить
@peabrane8067
@peabrane8067 - 06.01.2023 03:28

What's the Debussy piece you played?? First time hearing it

Ответить
@shubhamkapoor5152
@shubhamkapoor5152 - 04.01.2023 01:01

How do we normalize the mfcc using cepstral mean and variance?

Ответить
@zoezoezoe458
@zoezoezoe458 - 21.12.2022 00:33

Thank you so much, Valerio!! Super helpful content and super clear steps to help me get my MFCCs.

Ответить
@Fa94Ar
@Fa94Ar - 07.12.2022 15:10

thanks alot ,, but how i can get mfcc feature as a matrix not a plot figure

Ответить
@desrucca
@desrucca - 06.11.2022 16:33

are we getting MFCC from STFT or FFT ?

Ответить
@Astrovic1
@Astrovic1 - 27.09.2022 15:35

only three videos left and I made it throught the entire course/playlist. At this point I want to thank you for all that. This was exactly what I was searching for and now I am ready to start my Bachelor thesis about "ai in music" what is definitely the niché in which I want to get really really good

Ответить
@simhan2895
@simhan2895 - 23.08.2022 22:03

Hi Thanks for the informative video. Based on the concatenated output of mfcc and delta, can you guide which value to be considered to find the uniqueness in a speakers voice. Or on the top of this model, should we build some more dataset and train the model to identify the uniqueness in a voice for biometric?

Ответить
@chetanverse
@chetanverse - 03.08.2022 10:34

Thank You, MAN

Ответить
@rabihiawaludin7128
@rabihiawaludin7128 - 26.07.2022 16:00

Hi, how to compare with 2 audio's? and result will show precentage.

Ответить
@shimas8916
@shimas8916 - 12.06.2022 09:42

how to extract spatial features of an audio from the mfcc values by using inception resnet v2 and how extract temporal features using lstm model

Ответить
@tentyluaysari3393
@tentyluaysari3393 - 23.02.2022 03:29

i want to ask is there any difference between using only librosa.feature.mfcc and extracting manually the mfcc by changing the audio to FT,FTT first? thank you

Ответить
@iioiggtrt9085
@iioiggtrt9085 - 08.01.2022 19:46

how to extract mfcc for group of files in folders and save plot

Ответить
@shloktadilkar5536
@shloktadilkar5536 - 03.12.2021 09:28

Thanks a lot sir very nice video

Ответить
@alfredoalarconyanez4896
@alfredoalarconyanez4896 - 21.11.2021 11:34

loved this videos, thank you !

Ответить
@SHADABALAM2002
@SHADABALAM2002 - 10.10.2021 20:30

MFCC or mel spectrogram? which one is preferable and where why? unable to find answer

Ответить
@ruslanruslan338
@ruslanruslan338 - 20.09.2021 10:00

Dear author, thank you very much, the content is very usefull!!! Could you please explain, why you don't set the number of mel-filters when you apply < librosa.feature.mfcc(...) > ?

Ответить
@kathyker3498
@kathyker3498 - 15.07.2021 06:50

thank you it does help a lot :)

Ответить
@ananyaboruah6457
@ananyaboruah6457 - 08.07.2021 13:27

this helped me so much. i tried every other ways but only this worked as it was shown. Thank you.

Ответить
@bihahazlan3393
@bihahazlan3393 - 22.06.2021 07:01

why i have error at part idp.Audio(audio_file) ? the audio cannot display

Ответить
@nidhichakravarty9483
@nidhichakravarty9483 - 11.06.2021 17:58

Thank you sir for these videos! Can you please make videos on CQCC feature extraction?

Ответить
@bernardmatt7067
@bernardmatt7067 - 15.04.2021 19:28

Thank you for the informative video. But I have some questions. Do I have to perform pre-emphasis or I can just use the MFCCs directly?

Ответить
@muralimanohar8273
@muralimanohar8273 - 02.04.2021 00:40

Nice Video. How do we interpret the MFCCs?

Ответить
@yuktatayi4801
@yuktatayi4801 - 24.03.2021 08:32

Has anyone implemented mfcc and hmm for speech emotion recognition using Matlab?

Ответить
@nabaaf.hameed786
@nabaaf.hameed786 - 14.03.2021 06:54

how I extract MFCC from multi-audio at once ?? can I??

Ответить
@seekingSeanCarlin
@seekingSeanCarlin - 09.02.2021 22:56

I'm trying to figure out what is the significance of a positive versus negative MFCC intensity value, especially over time. If the value for MFCC 8, for example, was to consistently remain negative, but become more positive (from -0.05 to -0.03) what is that indicating?

Ответить
@oybekeraliev
@oybekeraliev - 04.02.2021 15:54

First of all, Thank you very much Mr. Valerio Velardo. Your all videos are very helpful for learners. I have one please Could you explain FFT and MFCCs in numpy? Thanks a lot in advance.

Ответить
@Sam-jk5dw
@Sam-jk5dw - 24.01.2021 09:10

What's the point of concatenating the MFCC's with the derivative

Ответить
@Alice-jv9fj
@Alice-jv9fj - 04.01.2021 02:23

Thank you for your videos! So clear and helpful

Ответить
@resmimarangatta8626
@resmimarangatta8626 - 22.12.2020 13:54

can we calculate timbre from mfcc

Ответить
@nishantbangera5384
@nishantbangera5384 - 19.12.2020 10:57

I did this with an audio file of Dysarthric person and even got the output. Now how should I explain the visualisation to my project guide.

Ответить
@gihanchathuranga3802
@gihanchathuranga3802 - 24.11.2020 16:13

please tell me, I found an error here...in 4th cell..
# load audio files with librosa
signal, sr = librosa.load(audio_file)
attribute error...

Ответить
@Erosis
@Erosis - 11.11.2020 08:28

Is concatenating them better than putting each of them into a different channel? Also, how would you normalize each of these? Do you do it the MFCC / delta / delta2 individually before concatenating? Do you use minmax normalization or standardization? Do you apply this to each "image" individually or do you use the entire training dataset's statistics? Thanks!

Ответить
@leashr
@leashr - 06.11.2020 20:52

Hi, very useful video! To do some simple sound classification, can I just do it with the MFCC's or would you recommend to use the derivatives or concatination?

Ответить
@MrDari88
@MrDari88 - 02.11.2020 17:50

Another great video. This is amazing! Just one question if you don't mind, Valerio: Does the librosa.feature.mfcc method consider frame sizes of 512 by default? (30/1292*sr=512)

Ответить
@jeremynx
@jeremynx - 31.10.2020 12:19

Thank you for this videos! Really helpful

Ответить
@mohamankad970
@mohamankad970 - 25.10.2020 12:54

This is the what i was looking for! Thank you so much!

Ответить
@twrider3649
@twrider3649 - 16.10.2020 09:53

Your video is great!
excuse me ! whats the n_mfcc mean ?

Ответить
@alvynabranches1214
@alvynabranches1214 - 09.10.2020 07:29

Can we use MFCCs for music generation?

Ответить
@ahsanrossi4328
@ahsanrossi4328 - 08.10.2020 17:36

awesome way to explain.

Ответить
@cbrtdgh4210
@cbrtdgh4210 - 08.10.2020 16:10

I had absolutely no idea what I was doing when it came to MFCCs for my electronic stethoscope project, this is exactly the content I needed! Really helpful.

Ответить
@sagarkulkarni9114
@sagarkulkarni9114 - 08.10.2020 15:02

Thank you, Valerio. Great content

Ответить
@sandipandhar1668
@sandipandhar1668 - 08.10.2020 14:10

Thanks Valerio for such a great content

Ответить