Extracting Mel-Frequency Cepstral Coefficients with Python

3 года назад

56,177 Просмотров

Комментарии:

@devendrajec - 04.08.2023 12:57

If signal is sampled at sr = 22050 Hz and while calculating MFCC it is mentioned as sr = 44100 Hz. Theoretically what will happen?

Ответить

@armghan2312 - 26.05.2023 18:41

how can we extract tqwt features from audio??

Ответить

@peabrane8067 - 06.01.2023 03:28

What's the Debussy piece you played?? First time hearing it

Ответить

@shubhamkapoor5152 - 04.01.2023 01:01

How do we normalize the mfcc using cepstral mean and variance?

Ответить

@zoezoezoe458 - 21.12.2022 00:33

Thank you so much, Valerio!! Super helpful content and super clear steps to help me get my MFCCs.

Ответить

@Fa94Ar - 07.12.2022 15:10

thanks alot ,, but how i can get mfcc feature as a matrix not a plot figure

Ответить

@desrucca - 06.11.2022 16:33

are we getting MFCC from STFT or FFT ?

Ответить

@Astrovic1 - 27.09.2022 15:35

only three videos left and I made it throught the entire course/playlist. At this point I want to thank you for all that. This was exactly what I was searching for and now I am ready to start my Bachelor thesis about "ai in music" what is definitely the niché in which I want to get really really good

Ответить

@simhan2895 - 23.08.2022 22:03

Hi Thanks for the informative video. Based on the concatenated output of mfcc and delta, can you guide which value to be considered to find the uniqueness in a speakers voice. Or on the top of this model, should we build some more dataset and train the model to identify the uniqueness in a voice for biometric?

Ответить

@chetanverse - 03.08.2022 10:34

Thank You, MAN

Ответить

@rabihiawaludin7128 - 26.07.2022 16:00

Hi, how to compare with 2 audio's? and result will show precentage.

Ответить

@shimas8916 - 12.06.2022 09:42

how to extract spatial features of an audio from the mfcc values by using inception resnet v2 and how extract temporal features using lstm model

Ответить

@tentyluaysari3393 - 23.02.2022 03:29

i want to ask is there any difference between using only librosa.feature.mfcc and extracting manually the mfcc by changing the audio to FT,FTT first? thank you

Ответить

@iioiggtrt9085 - 08.01.2022 19:46

how to extract mfcc for group of files in folders and save plot

Ответить

@shloktadilkar5536 - 03.12.2021 09:28

Thanks a lot sir very nice video

Ответить

@alfredoalarconyanez4896 - 21.11.2021 11:34

loved this videos, thank you !

Ответить

@SHADABALAM2002 - 10.10.2021 20:30

MFCC or mel spectrogram? which one is preferable and where why? unable to find answer

Ответить

@ruslanruslan338 - 20.09.2021 10:00

Dear author, thank you very much, the content is very usefull!!! Could you please explain, why you don't set the number of mel-filters when you apply < librosa.feature.mfcc(...) > ?

Ответить

@kathyker3498 - 15.07.2021 06:50

thank you it does help a lot :)

Ответить

@ananyaboruah6457 - 08.07.2021 13:27

this helped me so much. i tried every other ways but only this worked as it was shown. Thank you.

Ответить

@bihahazlan3393 - 22.06.2021 07:01

why i have error at part idp.Audio(audio_file) ? the audio cannot display

Ответить

@nidhichakravarty9483 - 11.06.2021 17:58

Thank you sir for these videos! Can you please make videos on CQCC feature extraction?

Ответить

@bernardmatt7067 - 15.04.2021 19:28

Thank you for the informative video. But I have some questions. Do I have to perform pre-emphasis or I can just use the MFCCs directly?

Ответить

@muralimanohar8273 - 02.04.2021 00:40

Nice Video. How do we interpret the MFCCs?

Ответить

@yuktatayi4801 - 24.03.2021 08:32

Has anyone implemented mfcc and hmm for speech emotion recognition using Matlab?

Ответить

@nabaaf.hameed786 - 14.03.2021 06:54

how I extract MFCC from multi-audio at once ?? can I??

Ответить

@seekingSeanCarlin - 09.02.2021 22:56

I'm trying to figure out what is the significance of a positive versus negative MFCC intensity value, especially over time. If the value for MFCC 8, for example, was to consistently remain negative, but become more positive (from -0.05 to -0.03) what is that indicating?

Ответить

@oybekeraliev - 04.02.2021 15:54

First of all, Thank you very much Mr. Valerio Velardo. Your all videos are very helpful for learners. I have one please Could you explain FFT and MFCCs in numpy? Thanks a lot in advance.

Ответить

@Sam-jk5dw - 24.01.2021 09:10

What's the point of concatenating the MFCC's with the derivative

Ответить

@Alice-jv9fj - 04.01.2021 02:23

Thank you for your videos! So clear and helpful

Ответить

@resmimarangatta8626 - 22.12.2020 13:54

can we calculate timbre from mfcc

Ответить

@nishantbangera5384 - 19.12.2020 10:57

I did this with an audio file of Dysarthric person and even got the output. Now how should I explain the visualisation to my project guide.

Ответить

@gihanchathuranga3802 - 24.11.2020 16:13

please tell me, I found an error here...in 4th cell..
# load audio files with librosa
signal, sr = librosa.load(audio_file)
attribute error...

Ответить

@Erosis - 11.11.2020 08:28

Is concatenating them better than putting each of them into a different channel? Also, how would you normalize each of these? Do you do it the MFCC / delta / delta2 individually before concatenating? Do you use minmax normalization or standardization? Do you apply this to each "image" individually or do you use the entire training dataset's statistics? Thanks!

Ответить

@leashr - 06.11.2020 20:52

Hi, very useful video! To do some simple sound classification, can I just do it with the MFCC's or would you recommend to use the derivatives or concatination?

Ответить

@MrDari88 - 02.11.2020 17:50

Another great video. This is amazing! Just one question if you don't mind, Valerio: Does the librosa.feature.mfcc method consider frame sizes of 512 by default? (30/1292*sr=512)

Ответить

@jeremynx - 31.10.2020 12:19

Thank you for this videos! Really helpful

Ответить

@mohamankad970 - 25.10.2020 12:54

This is the what i was looking for! Thank you so much!

Ответить

@twrider3649 - 16.10.2020 09:53

Your video is great!
excuse me ! whats the n_mfcc mean ?

Ответить

@alvynabranches1214 - 09.10.2020 07:29

Can we use MFCCs for music generation?

Ответить

@ahsanrossi4328 - 08.10.2020 17:36

awesome way to explain.

Ответить

@cbrtdgh4210 - 08.10.2020 16:10

I had absolutely no idea what I was doing when it came to MFCCs for my electronic stethoscope project, this is exactly the content I needed! Really helpful.

Ответить