Discussing All The Types Of Feature Transformation In Machine Learning

Discussing All The Types Of Feature Transformation In Machine Learning

Krish Naik

3 года назад

75,306 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@krishnaik06
@krishnaik06 - 21.04.2021 16:39

Please take care everyone.

Ответить
@moonSTAR1893
@moonSTAR1893 - 17.08.2023 17:37

Hello. Important mistake in this tutorial, so I have to stop watching it.
Problem: you e.g. use MinMax Scaler on whole X_train with differently scaled variables inside. Let's assume "age" is distributed 18-65 while "fare" goes from 5-2000. Scaling age with the global min/max of the dataset, distorts your features. In this case for age 20 you would get z = X-Xmin/Xmax-Xmin = (20-5)/(2000-5) = 15/1995= 0.0075. Instead in the per-feature scaling with just age you would get z = (20-18)/(65-18) = 0.0426 corresponding to a 5-fold numerical difference. The maximal age of 65 would get z = (65-5)/(2000-5) = 0.03 !!!! Meaning age would have maximal value of 0.03 instead of 1!

Ответить
@captainmustard1
@captainmustard1 - 07.03.2023 12:16

thank you sir, it is just an amazing video!!

Ответить
@wahabali828
@wahabali828 - 04.10.2022 15:30

thank you very much sir

Ответить
@ishantyagi2701
@ishantyagi2701 - 25.04.2022 17:23

should standardization be applied to whole dataset or after we split into train test data?

Ответить
@ajaykushwaha-je6mw
@ajaykushwaha-je6mw - 31.03.2022 20:07

Hi Krish, while transformation why we are not dividing our data in Train and Test ?

Ответить
@write2ruby
@write2ruby - 27.03.2022 15:05

Very Informative

Ответить
@SomeoneElsesSomeoneElse
@SomeoneElsesSomeoneElse - 31.01.2022 23:09

With respect to StandardScaler() If you split the dataset prior to scaling the features then don't you risk having skewed features? Put differently, if you train your model to learn that values of 1 get a certain weight and in your test set the data isn't standardized around the same mean as the train set then the model will invariably have worse accuracy unless the train set and test set features have the same mean, right? Shouldn't the test set samples of the full dataset removed only to serve as an "out-of-sample" test? Not two separate datasets?

Ответить
@mostafakhazaeipanah1085
@mostafakhazaeipanah1085 - 08.01.2022 11:44

What A Useful and Informative Video.
Most of the ML Courses are based on Algorithms which they forget the importance of Data Preparation

Ответить
@sandipansarkar9211
@sandipansarkar9211 - 03.10.2021 23:59

finished watching

Ответить
@yashpandey5484
@yashpandey5484 - 21.09.2021 14:20

Sir weather scalling is required after performing log transformation ??

Ответить
@MdMahmudulHasanSuzan--
@MdMahmudulHasanSuzan-- - 05.08.2021 07:55

how can i perform scaling on a k-fold data?

Ответить
@poojapatil7128
@poojapatil7128 - 03.08.2021 05:09

I have completed my 1-year post-graduation program in data science from a leading institute, but the various techniques I learned from your videos in free, were not even mentioned in the curriculum.

Thank you for your easy and detailed explanation.

Ответить
@mayurgupta4004
@mayurgupta4004 - 26.06.2021 00:48

when we are using gaussian transformation that will convert our distribution to gaussian distribution where mean=median or standard gaussian distribution where mean=0 and variance=1

Ответить
@umaanil3344
@umaanil3344 - 12.06.2021 13:20

Sir what about that 'df_scaled' term?
I am getting error at that point that df_scaled is not defined... Can you please explain

Ответить
@mdadilhussain2967
@mdadilhussain2967 - 07.06.2021 14:52

I guess that you should first do fit_transform then train_test_split;
As if you have first splited then according to train data you have calculated mean.
Then applies same mean for test data, so test data won't have mean as zero.
Please clear this doubt.

Ответить
@venkatraaman4509
@venkatraaman4509 - 01.06.2021 14:05

hai, for eg I have a feature regarding age, height, weight
now I willing to make the gaussian transformation, here in my case
==>logarithm tx makes a good fit for age
==>reciprocal tx makes a good fit for height
the question is may I use both features(applied with age(log tx) & height(reciprocal tx)) for my train data, kindly reply to me, sir

Ответить
@ashiqhussainkumar1391
@ashiqhussainkumar1391 - 26.05.2021 12:40

Tbh I don't prefer any lecture series except nptel. But seeing your 20-25 I personally feel this channel is a better resource for practical implementation of ML...
Initially I didn't subscribe bcz I felt ur profile is looking young and u might not be knowing the way u taught 😁😁😁... Subscribed
Thanks to you and to Nptel

Ответить
@abhishek_dataman6348
@abhishek_dataman6348 - 04.05.2021 07:42

Do we require to check this transformation techniques in all binary classification problems?!

Ответить
@geianmarkdenorte9874
@geianmarkdenorte9874 - 23.04.2021 20:39

I am looking for these master krish! Take care too

Ответить
@pseudounknow5559
@pseudounknow5559 - 23.04.2021 13:17

Greetings from Poland <3 Stay strong India you will overcome this ;)

Ответить
@priyayadav3990
@priyayadav3990 - 22.04.2021 17:13

In transformation we transform distribution in Normal distribution.then after transformation we also need to perform Standardisation(Scale down).please tell me if I am wrong.

Ответить
@foreignworker-2163
@foreignworker-2163 - 22.04.2021 16:37

Pray for your team!

Ответить
@shubhamkondekar5382
@shubhamkondekar5382 - 22.04.2021 14:51

Krish Naik is best

Ответить
@sarthakphatate4595
@sarthakphatate4595 - 22.04.2021 14:51

good

Ответить
@vidulakamat6564
@vidulakamat6564 - 22.04.2021 12:24

While doing the transformation, do we need to transform both numerical and categorical (encoded) features or only numerical ones? If target is continuous, do we need to transform that as well?

Ответить
@tanujajoshi1901
@tanujajoshi1901 - 22.04.2021 12:03

Hey Krish, Can you explain Generative Adversarial Networks (GANs) especially the coding part for a dataset other than an image dataset?? It would be of great help.

Ответить
@shivaragiman
@shivaragiman - 22.04.2021 10:37

Get well soon, you people need more to us 👍👍👍👍👍

Ответить
@Sivaramakrishnanv7
@Sivaramakrishnanv7 - 22.04.2021 09:29

In the join button, i can see (6 months: ₹283.20) plan. you have not mentioned this plan in that join video.Can you pls explain here sir?

Ответить
@dheerendrasinghbhadauria9798
@dheerendrasinghbhadauria9798 - 21.04.2021 21:54

krish bhai....please upload a PDF of notes of video summary.... along with each video...

Ответить
@prakashkafle454
@prakashkafle454 - 21.04.2021 20:43

I pray for your team for speed recovery krish . We are also getting worst news day by day here in nepal ...

Ответить
@bhargavikoti4208
@bhargavikoti4208 - 21.04.2021 20:37

As usual neatly explained..👍👍thank you for uploading 🙏

Ответить
@nishanthviswajith1496
@nishanthviswajith1496 - 21.04.2021 20:05

I know python programming. And I'm learning data science by self-study .. My problem is I have 4 years gap in employment. Will I get job in data science field? Need your suggestions.. I'm 26 yrs old

Ответить
@shivu.sonwane4429
@shivu.sonwane4429 - 21.04.2021 19:45

for people in home isolation 👇🏻

I've almostj recovered from COVID in home isolation. I'm sharing what helped me recover in case it helps someone.

• Steam atleast 3 times a day
• Plenty of fluids: Water (preferably warm), lemonade, coconut water
• Salt water gargles


• Vitamin C supplement
• Plenty of rest
• Meditation for peace of mind
• Balanced diet
• Regain smell: Smell ajwain, kapoor and cloves
• Lie on your stomach periodically

Monitor oxygen every 2 hours. Seek medical assistance if it's 92 or below.
Pls add if I missed anything

Add ajwain and kapoor into the water while taking steam and drink malvani kadha (Tulsi, adrak, jaggery, lavng, Black paper, ajwain, gavti cha,dalchini)


Don't be panic take care use ajwain as much you can it works as natural sanitizer

Ответить
@mosart03
@mosart03 - 21.04.2021 19:28

Are we suppose to scale categorical features along with continuous features?

Ответить
@alihaiderabdi9939
@alihaiderabdi9939 - 21.04.2021 19:12

praying for employees of ineuron, inshallah everyone will get well soon.

Ответить
@nagrajwellness8622
@nagrajwellness8622 - 21.04.2021 18:02

Sir sudhanshu sir tested positive my god please I hope he get well soon

Ответить
@ayushsingh-qn8sb
@ayushsingh-qn8sb - 21.04.2021 18:00

If I have applied some encoding technique , do I have to scale them ?

Ответить
@sandipansarkar9211
@sandipansarkar9211 - 21.04.2021 17:45

great explanation

Ответить
@satviksaxena3868
@satviksaxena3868 - 21.04.2021 17:31

Hope the team will recover soon, Take Care !!

Ответить
@kiyotube222
@kiyotube222 - 21.04.2021 17:17

Get we soon Sudh!!

Ответить
@ashutoshtiwari5222
@ashutoshtiwari5222 - 21.04.2021 17:00

Sir app apna dyan rakhiye . 🥺😢

Ответить
@teegnas
@teegnas - 21.04.2021 16:54

a very important video to review all feature important techniques at one go ... thanks for uploading!

Ответить
@SALESENGLISH2020
@SALESENGLISH2020 - 21.04.2021 16:43

Pray your team members recover quickly. India needs good teachers.

Ответить
@pankajkumarbarman765
@pankajkumarbarman765 - 21.04.2021 16:38

1st view 💞💞❤️

Ответить