Solving Real-World Data Science Interview Questions! (with Python Pandas)

Solving Real-World Data Science Interview Questions! (with Python Pandas)

Keith Galli

1 год назад

112,006 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

Ahab
Ahab - 17.10.2023 22:35

this took me a week to finish all coding questions, 10000% helped me alot to practice everything i learned in your previous pandas crash crourse. thanks

Ответить
Jovan Janjic
Jovan Janjic - 24.07.2023 21:13

Your solution for the Probability of Having a Sister question is not correct. We know for sure that the random girl must be from the [1, 2, 3, 4] part of the dataset, which amounts to 0.7. We should divide the probabilities for 1, 2, 3, 4 with 0.7, to get the probabilities that the girl is from each of these families. She theoretically can't be from families with 0 and 5 children. Essentially, you are counting in the possibilities of she being in families 0 and 5, even tough it's impossible. (In practical terms, you are needlessly being blind about the info you already have.) So the correct solution is: 0.25/0.7 x 0 + 0.2/0.7 x 0.5 + 0.15/0.7 x 0.75 + 0.1/0.7 x 0.875 = 0.42857, which is 0.43 when we round it up.

Ответить
Jovan Janjic
Jovan Janjic - 22.07.2023 16:46

In question #3 Counting Instances in Text you should add filters=re.I to account for capital letters: len(re.findall(r'\bbull\b', text, flags=re.I)))

Ответить
FantasyXpress
FantasyXpress - 19.07.2023 05:39

Is dsa important for data scientists too keith

Ответить
Niamh Maher
Niamh Maher - 29.06.2023 00:07

Brilliant video! very helpfil

Ответить
daniele farotti
daniele farotti - 16.06.2023 14:34

I really like your approach in explaining things. I am currently transitioning from pure maths into data science, and I find these videos very helpful!

Ответить
expat2010
expat2010 - 08.06.2023 11:27

I really enjoy the real world feel of your videos. Probably now ChatGPT would be a lot faster than searching Stackoverflow or the Pandas docs for those things that one doesn't know by heart.

Ответить
swerving onthatW
swerving onthatW - 30.05.2023 18:25

I solved the Bathrooms/Bedrooms problem with:

cols_of_interest = airbnb_search_details[['city', 'property_type', 'bathrooms', 'bedrooms']]
property_results = cols_of_interest.groupby(['city','property_type']).agg(
avg_bathrooms = ('bathrooms', 'mean'),
avg_bedrooms = ('bedrooms', 'mean')).reset_index()

Ответить
Tony T
Tony T - 18.05.2023 04:01

Hi Keith , Thank you so much for these videos, could you make more videos about power PI or Tableau, really really appreciate it .

Ответить
Kenneth Stephani
Kenneth Stephani - 07.05.2023 23:17

Great video, Keith!

Ответить
MikeResurrected
MikeResurrected - 25.03.2023 21:45

Could you actually google for help during a DS coding interview nowadays?

Ответить
Balakumar .N
Balakumar .N - 06.12.2022 16:34

super

Ответить
BAHAGEEL OSMAN
BAHAGEEL OSMAN - 29.11.2022 22:00

Replace yes with 1 and no with zero and sum them

Ответить
BAHAGEEL OSMAN
BAHAGEEL OSMAN - 29.11.2022 21:49

The problem lays in your use of round function you supposed to wrap the equation with round and then select the decimals 2

Ответить
Finn Nelson
Finn Nelson - 26.11.2022 21:08

TY :)

Ответить
Doula Isham Rashik Hasan
Doula Isham Rashik Hasan - 18.11.2022 07:58

you disappeared again 😢

Ответить
Man Phu
Man Phu - 22.10.2022 13:43

Thanks so much for the video, learn a lot from you. And you are super cute 😍

Ответить
Fábio Coatis
Fábio Coatis - 21.10.2022 17:27

Great video Keith. I just got curious how you comment a block of code?

Ответить
udaya bhaskar
udaya bhaskar - 13.09.2022 06:52

Thank you for this video!👍

Ответить
Data Point
Data Point - 10.09.2022 13:01

You're literally the best tutor I have seen, I myself am a Data Scientist but the amount of data science approaches I learn from you is incredible, I started from your channel and always wait for you to post new video, Hat's off. Love from Pakistan.

Ответить
Anseka O
Anseka O - 27.08.2022 19:08

Great video, please do more like that. Watching you for a long time

Ответить
Phoenix College
Phoenix College - 20.08.2022 08:44

makes it easy to understand
watching your vid on a friday night and these are the best years of my young life

Ответить
meu Jie
meu Jie - 19.08.2022 21:29

Hi, I'm Jiemeu and I love your channel. I hope to discuss business cooperation with you.....

Ответить
Rahman Alam Guidance
Rahman Alam Guidance - 10.08.2022 19:53

Hey ,Keith ..Can we access library during the solving at real time exam?

Ответить
Vansh Malik
Vansh Malik - 10.08.2022 13:07

Hey!
Does anyone knows more of the data analysis pay after placement programs accepting applications all over the globe?

Ответить
Yunus Fidan
Yunus Fidan - 08.08.2022 23:07

Noice!

Ответить
Zane Rios
Zane Rios - 08.08.2022 06:38

really love the style and format of vid, just subbed

Ответить
Nitesh Prajapat
Nitesh Prajapat - 04.08.2022 18:47

You are gem ❤️ the way you explain concepts are at next level 🔥🔥

Ответить
Dendrocnide Moroides
Dendrocnide Moroides - 04.08.2022 17:38

yes please make more videos like this

Ответить
Netanel M
Netanel M - 04.08.2022 11:21

Thanks for the video! Would love to see your approach to more non-coding questions specifically :)

Ответить
Wiz
Wiz - 04.08.2022 07:54

Great work man!! you're always doing the best.🔥🔥🔥

Ответить
Troy
Troy - 03.08.2022 22:51

Thanks for the video. It is great to see your thinking process even though you are not an expert in pandas.

Ответить
Pratik Pawar
Pratik Pawar - 03.08.2022 11:54

great video, please make more video like this

Ответить
Troy23
Troy23 - 03.08.2022 00:32

Here's a one liner chained version I've come up with for coding #6
df = ms_user_dimension.merge(ms_acc_dimension, on = 'acc_id').merge(ms_download_facts
,on ='user_id').pivot_table(index = 'date',columns = 'paying_customer',values = 'downloads',aggfunc ='sum').reset_index().query('no > yes')

Ответить
prof _ Albert
prof _ Albert - 02.08.2022 20:01

That was great. Bravo and all of your videos are awesome 🌺👌💞🤩💪

Ответить
Deepak Sai Kumar
Deepak Sai Kumar - 02.08.2022 16:12

Hi Keith,

You have been a great resource to learn Python and Data science-related skills.

Thank you!

Ответить
Earth Eyes
Earth Eyes - 01.08.2022 22:35

great video! thank you!

Ответить
Edward J. Warden
Edward J. Warden - 01.08.2022 15:00

Very helpful. Thank you Keith.

Ответить
Nicholas Grandizio
Nicholas Grandizio - 31.07.2022 03:40

Thank you for all the hard work you put into teaching Data Science. Your videos and others like you, provide more to the community such as myself trying to build a career in data than what University Programs provide. Your playing an important role in the future of Data Science by leading current students along the path to future industry leaders.

Ответить
Ranjith Raghunathan
Ranjith Raghunathan - 30.07.2022 08:46

how can i download or copy the raw dataset for each part ?

Ответить
Laurent Reynaud
Laurent Reynaud - 28.07.2022 18:06

Thank you so much for these data science courses!

Ответить
Ranjith Raghunathan
Ranjith Raghunathan - 28.07.2022 17:07

Thanks Keith

Ответить
IamFavoured
IamFavoured - 28.07.2022 15:00

Welcome back Keith 💃🏻💃🏻

Ответить
mehdi smaeili
mehdi smaeili - 28.07.2022 10:40

excellent, thanks.

Ответить
Ade Afni
Ade Afni - 28.07.2022 05:23

Thank you Keith, you're amazingg, keep it up!!!

Ответить
Kumari Pritika
Kumari Pritika - 28.07.2022 00:55

Really helpful video!

Ответить
phsopher
phsopher - 27.07.2022 11:49

For the fifth problem, pandas has an in-built percentage difference method (pct_change). The solution could be as follows for example:

sf_transactions['year_and_month'] = sf_transactions.created_at.dt.strftime("%Y-%m")
monthly_revenue = sf_transactions.groupby(["year_and_month"]).sum().reset_index()
monthly_revenue['pct_change'] =(monthly_revenue.value.pct_change()*100).round(2)
monthly_revenue[['year_and_month','pct_change']]

Ответить