End to End Data Analytics Project (Python + SQL)

End to End Data Analytics Project (Python + SQL)

Ankit Bansal

10 месяцев назад

219,916 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@adityavamsi12
@adityavamsi12 - 12.08.2024 17:55

Thank You so much for your content

Ответить
@subhajitdutta9983
@subhajitdutta9983 - 17.08.2024 14:18

Thanks for this awesome project.

Ответить
@sagarjadhav3564
@sagarjadhav3564 - 18.08.2024 10:09

Hi Ankit , What should I do If I don't see ODBC driver for SQL server in ODBC data source administrator ?

Ответить
@tarekmustafa2525
@tarekmustafa2525 - 25.08.2024 12:42

Excellent tutorial, thank you!

Ответить
@rizwan1396
@rizwan1396 - 28.08.2024 11:31

Great work sir , sir i have one problem im using mysql and for third questionsy output showing me only 2 rows for same query why is that

Ответить
@ajaykumar8725
@ajaykumar8725 - 03.09.2024 06:17

Thank you for the valuable content that i found so helpful for my profile .
I have been going through a problem where after transforming the data in Jupyter and loading it to SSMS , the code runs in the notebook with no error but I can't find that data file in SSMS, can you clarify Please.
Thanks.

Ответить
@AnushaGanipisetty-t5u
@AnushaGanipisetty-t5u - 06.09.2024 20:22

Hi, In my C drive i dont have .Kaggle folder so do we need to create new folder for .Kagglejson file to save

Ответить
@PratikshaGirase-m2w
@PratikshaGirase-m2w - 26.09.2024 17:33

Hello sir i got an error in this line can u help me out df['order_date']=pd.to_datetime(df['order_date'],format="%Y-%m-%d")

Ответить
@АлександрРусаков-в4с
@АлександрРусаков-в4с - 27.09.2024 13:21

Lopez Charles Allen Joseph Hernandez Kenneth

Ответить
@anushar3361
@anushar3361 - 02.10.2024 02:57

Great start for learning. Provided complete knowledge how the ETL works and improved the self confidence. Thank you so much for wonderful lecture about the project

Ответить
@-es2bf
@-es2bf - 02.10.2024 22:32

Hi Ankit,

Thanks for your video. I work at a company where we load 70-80 files into our database each month. These files can be in formats like CSV, XLSX, or TXT etc..., and they all contain data. While each file is similar, there are always some differences. Currently, we use bulk insert in SQL Server to handle this, but things can go wrong, leading to some manual work. For example, one of the files might look different from the previous month.

Do you think it would be better to use a mix of Python and SQL Server for this task, or is SQL Server sufficient? I feel that SQL Server's error messages can be poor, making it hard to troubleshoot why a table sometimes fails to load.

Thanks!

Ответить
@kadapalarakeshreddy910
@kadapalarakeshreddy910 - 06.10.2024 20:39

The best video for beginners

Ответить
@NetTakeAways
@NetTakeAways - 11.10.2024 13:24

Hi,my table is not reflecting the changes in mysql tables which I have done in jupyter notebook. Even after saving and refreshing . Please help me out in this.

Ответить
@Rohitchauhan-kg7sp
@Rohitchauhan-kg7sp - 13.10.2024 19:02

great video

Ответить
@sshibam
@sshibam - 20.10.2024 12:04

extremely insightful thank you sir

Ответить
@waqarbinjamil3577
@waqarbinjamil3577 - 21.10.2024 15:31

I found doing analysis more easy by Python rather than SQL... Its because may be my conviction is more dominant over Python..

Ответить
@rallabandivijayvijay5456
@rallabandivijayvijay5456 - 28.10.2024 20:28

hi sir i just want to ask one question that is why u are loading the modified | cleaned data in to the sql ? we can do the same thing in pandas also....
and i have another doubt that is in data engineering field we have to use sql or python pandas to write most complex queries.Ii find sql is more easy than pandas...can u please give me an reply..

Ответить
@emadeconsultinginc
@emadeconsultinginc - 01.11.2024 00:24

Great work and thanks for sharing this

Ответить
@HeshamTayyara
@HeshamTayyara - 04.11.2024 11:23

You can run SQL queries directly from the ipynb notebook using the pandasql library like this:

import pandasql as ps

query = """
SELECT product_id, SUM(sale_price) AS sales FROM df
GROUP BY product_id
ORDER BY sales DESC
"""
res = ps.sqldf(query, locals())
print(res)

Ответить
@ngocphung7234
@ngocphung7234 - 05.11.2024 17:54

I love the content, but unfortunately, I'm using a MacBook.

Ответить
@IndusGamersHub07
@IndusGamersHub07 - 13.11.2024 09:14

-- find month over month growth comparisen for 2022 and 2023
it s small and effective


SELECT month(order_date) AS order_month,
SUM(CASE WHEN year(order_date) = 2022 THEN sale_price ELSE 0 END) AS sales_2022,
SUM(CASE WHEN year(order_date) = 2023 THEN sale_price ELSE 0 END) AS sales_2023
FROM df_orders
GROUP BY month(order_date)
ORDER BY order_month;

Ответить
@JayeshTatipamul
@JayeshTatipamul - 19.11.2024 17:41

Ankit Sir, it is a really very helpful video !! Thank you so much !!

Ответить
@KrantikumarCashCorner
@KrantikumarCashCorner - 21.11.2024 11:54

can i put this project in resume

Ответить
@vishnucruz4529
@vishnucruz4529 - 30.11.2024 17:31

How did you get the .kaggle folder ? Did you create one ?

Ответить
@nilanjanpaul2723
@nilanjanpaul2723 - 30.11.2024 23:38

Hi Ankit, while loading the data into sql server using to_sql method, there is an error pop-up(mentioned below). Please help on this.

ProgrammingError: (pyodbc.ProgrammingError) ('The SQL contains 28832 parameter markers, but 159904 parameters were supplied', 'HY000')

Ответить
@abdallahfahmy-h6k
@abdallahfahmy-h6k - 11.12.2024 21:43

in last query : which sub category had highest growth by profit in 2023 compare to 2022 i think it is by price not profit?

Ответить
@abdallahfahmy-h6k
@abdallahfahmy-h6k - 11.12.2024 22:00

why did not you remove nulls ?

Ответить
@danishsajan4114
@danishsajan4114 - 12.12.2024 22:32

Great help for someone just stepping into the data world! Thanks

Ответить
@CharlesSmith2-v2z
@CharlesSmith2-v2z - 13.12.2024 02:41

Currently receiving this error when downloading dataset from kaggle. 'Your Kaggle API key is readable by other users on this system!'

Ответить
@NalyticsEnterprises
@NalyticsEnterprises - 16.12.2024 14:07

Well done! The sales price column should be multiplied by the quantity since there is a discount on each product

Ответить
@dingusbrain101
@dingusbrain101 - 17.12.2024 14:11

I have been looking for projects to get experience from since leaving college. This was great and reminded me of my previous course work

Ответить
@Arpita-y1v
@Arpita-y1v - 18.12.2024 20:13

where can i get the dataset link

Ответить
@countryroads11
@countryroads11 - 19.12.2024 06:50

Great knowledge man

Ответить
@arvindgupta270
@arvindgupta270 - 22.12.2024 14:15

sir i didnt get link of kaggle dataset

Ответить
@ijazma54
@ijazma54 - 24.12.2024 22:13

Ankit this is brilliant.

Ответить
@tenzinchoepheldev
@tenzinchoepheldev - 04.01.2025 14:28

Very informative and helpful - one thing what I like about ur video is that u r very detailed oriented but at the same time u explains all to the point without bla bla . Thanks for the big tutorial 🎉🎉🎉

Ответить
@xyz1234544
@xyz1234544 - 13.01.2025 22:01

Thanks a lot for this fantastic project tutorial! It was super informative and well-explained. I’d love to see more content like this—keep inspiring learners like me!

Ответить
@jacqueto
@jacqueto - 31.01.2025 22:56

Thank you

Ответить
@samiksha137
@samiksha137 - 03.02.2025 10:08

Hey! Ankit.. I tried connecting to Mysql but everytime it shows me the error "kernel died".... 100 se v jada try kar lia.. sab kuch sai hai locahost, root password.. fr q baar baar error aa ra.. pareshan ho gai ye problem solve hi ni ho raha

Ответить
@IamMrRaaj
@IamMrRaaj - 03.02.2025 13:09

5 star🌟🌟🌟🌟🌟

Ответить
@mehmoodurrehman9322
@mehmoodurrehman9322 - 04.02.2025 16:12

Subscribed and liked it too ❤

Ответить
@shilpanidhi9016
@shilpanidhi9016 - 04.02.2025 17:51

thank you for this video. it helps a lot!

Ответить
@VinaySharma-hi5fs
@VinaySharma-hi5fs - 16.02.2025 20:03

can't find " .kaggle " in my user folder , please help

Ответить
@mohamedsaad052
@mohamedsaad052 - 26.02.2025 00:40

pigeon sound

Ответить
@UmerAziz-c4c
@UmerAziz-c4c - 03.03.2025 14:51

Not provided the DDL

Ответить
@codemode-r4u
@codemode-r4u - 03.03.2025 22:20

what a good content

Ответить