Комментарии:
Thank You so much for your content
ОтветитьThanks for this awesome project.
ОтветитьHi Ankit , What should I do If I don't see ODBC driver for SQL server in ODBC data source administrator ?
ОтветитьExcellent tutorial, thank you!
ОтветитьGreat work sir , sir i have one problem im using mysql and for third questionsy output showing me only 2 rows for same query why is that
ОтветитьThank you for the valuable content that i found so helpful for my profile .
I have been going through a problem where after transforming the data in Jupyter and loading it to SSMS , the code runs in the notebook with no error but I can't find that data file in SSMS, can you clarify Please.
Thanks.
Hi, In my C drive i dont have .Kaggle folder so do we need to create new folder for .Kagglejson file to save
ОтветитьHello sir i got an error in this line can u help me out df['order_date']=pd.to_datetime(df['order_date'],format="%Y-%m-%d")
ОтветитьLopez Charles Allen Joseph Hernandez Kenneth
ОтветитьGreat start for learning. Provided complete knowledge how the ETL works and improved the self confidence. Thank you so much for wonderful lecture about the project
ОтветитьHi Ankit,
Thanks for your video. I work at a company where we load 70-80 files into our database each month. These files can be in formats like CSV, XLSX, or TXT etc..., and they all contain data. While each file is similar, there are always some differences. Currently, we use bulk insert in SQL Server to handle this, but things can go wrong, leading to some manual work. For example, one of the files might look different from the previous month.
Do you think it would be better to use a mix of Python and SQL Server for this task, or is SQL Server sufficient? I feel that SQL Server's error messages can be poor, making it hard to troubleshoot why a table sometimes fails to load.
Thanks!
The best video for beginners
ОтветитьHi,my table is not reflecting the changes in mysql tables which I have done in jupyter notebook. Even after saving and refreshing . Please help me out in this.
Ответитьgreat video
Ответитьextremely insightful thank you sir
ОтветитьI found doing analysis more easy by Python rather than SQL... Its because may be my conviction is more dominant over Python..
Ответитьhi sir i just want to ask one question that is why u are loading the modified | cleaned data in to the sql ? we can do the same thing in pandas also....
and i have another doubt that is in data engineering field we have to use sql or python pandas to write most complex queries.Ii find sql is more easy than pandas...can u please give me an reply..
Great work and thanks for sharing this
ОтветитьYou can run SQL queries directly from the ipynb notebook using the pandasql library like this:
import pandasql as ps
query = """
SELECT product_id, SUM(sale_price) AS sales FROM df
GROUP BY product_id
ORDER BY sales DESC
"""
res = ps.sqldf(query, locals())
print(res)
I love the content, but unfortunately, I'm using a MacBook.
Ответить-- find month over month growth comparisen for 2022 and 2023
it s small and effective
SELECT month(order_date) AS order_month,
SUM(CASE WHEN year(order_date) = 2022 THEN sale_price ELSE 0 END) AS sales_2022,
SUM(CASE WHEN year(order_date) = 2023 THEN sale_price ELSE 0 END) AS sales_2023
FROM df_orders
GROUP BY month(order_date)
ORDER BY order_month;
Ankit Sir, it is a really very helpful video !! Thank you so much !!
Ответитьcan i put this project in resume
ОтветитьHow did you get the .kaggle folder ? Did you create one ?
ОтветитьHi Ankit, while loading the data into sql server using to_sql method, there is an error pop-up(mentioned below). Please help on this.
ProgrammingError: (pyodbc.ProgrammingError) ('The SQL contains 28832 parameter markers, but 159904 parameters were supplied', 'HY000')
in last query : which sub category had highest growth by profit in 2023 compare to 2022 i think it is by price not profit?
Ответитьwhy did not you remove nulls ?
ОтветитьGreat help for someone just stepping into the data world! Thanks
ОтветитьCurrently receiving this error when downloading dataset from kaggle. 'Your Kaggle API key is readable by other users on this system!'
ОтветитьWell done! The sales price column should be multiplied by the quantity since there is a discount on each product
ОтветитьI have been looking for projects to get experience from since leaving college. This was great and reminded me of my previous course work
Ответитьwhere can i get the dataset link
ОтветитьGreat knowledge man
Ответитьsir i didnt get link of kaggle dataset
ОтветитьAnkit this is brilliant.
ОтветитьVery informative and helpful - one thing what I like about ur video is that u r very detailed oriented but at the same time u explains all to the point without bla bla . Thanks for the big tutorial 🎉🎉🎉
ОтветитьThanks a lot for this fantastic project tutorial! It was super informative and well-explained. I’d love to see more content like this—keep inspiring learners like me!
ОтветитьThank you
ОтветитьHey! Ankit.. I tried connecting to Mysql but everytime it shows me the error "kernel died".... 100 se v jada try kar lia.. sab kuch sai hai locahost, root password.. fr q baar baar error aa ra.. pareshan ho gai ye problem solve hi ni ho raha
Ответить5 star🌟🌟🌟🌟🌟
ОтветитьSubscribed and liked it too ❤
Ответитьthank you for this video. it helps a lot!
Ответитьcan't find " .kaggle " in my user folder , please help
ОтветитьNot provided the DDL
Ответитьwhat a good content
Ответить