How to build and automate a python ETL pipeline with airflow on AWS EC2 | Data Engineering Project

How to build and automate a python ETL pipeline with airflow on AWS EC2 | Data Engineering Project

tuplespectra

1 год назад

125,757 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@donatus.enebuse
@donatus.enebuse - 07.04.2025 13:42

I'm so much blessed to be learning from a fellow Nigerian who is not interested in unnecessary promotions and advertorials. Just great lessons and for free. Thanks Dr Yemi.

Ответить
@glowlog
@glowlog - 27.03.2025 16:42

Legend

Ответить
@sriharsha4742
@sriharsha4742 - 12.03.2025 20:17

Awesome Hands on Tutorial!! Thank you🎉

Ответить
@dayanandab.n3814
@dayanandab.n3814 - 06.03.2025 11:58

Perfect one video to get started with AIRFLOW. Thank you for sharing. your valuable knowledge
your teaching skills are highly impressive. Subscribed :)

Ответить
@Interesting_Facts_Of_Life
@Interesting_Facts_Of_Life - 13.02.2025 18:41

Great video, super easy to understand and well explained!

Ответить
@waqarbinjamil3577
@waqarbinjamil3577 - 11.02.2025 13:12

I really liked the way you teach. Even a layman will understand everything.. Thank you Sir

Ответить
@TheKnowledgeArcades
@TheKnowledgeArcades - 06.01.2025 02:19

@tuplespectra can we use shell scripting for orchestrating this instead of Apache Airflow..?

Ответить
@NOUHAILAELFID
@NOUHAILAELFID - 05.01.2025 21:45

are all what you used included in the free tier of aws services or not, please?

Ответить
@miracleatimangovictoria7529
@miracleatimangovictoria7529 - 29.11.2024 12:29

Wow. I love how you explain.

Ответить
@venkat497
@venkat497 - 20.11.2024 08:17

So I'm doing a similar project but using DBT for transformation. Is it possible to put DBT into same DAG? Is that how people do it as a good practice?

Ответить
@marciofranco6085
@marciofranco6085 - 15.11.2024 14:41

Thank's from Angola

Ответить
@jamespowers5510
@jamespowers5510 - 11.11.2024 06:37

I have used airflow as a user but never spun up an instance on my own. This is easily the best video that exists today for AWS + Airflow, this has taken so much guess work out of the process. Thanks so much!

Ответить
@chikwadovalentine2598
@chikwadovalentine2598 - 01.11.2024 19:40

Thank you for this awesome project, it took me 2weeks to complete. Had to fix lots of errors but I was determined to get past it all. I honestly learnt a whole lot from all the errors really. Thank you once again🙏

Ответить
@podunkman2709
@podunkman2709 - 21.10.2024 21:14

Penato is way simpler, have nice gui, performs fantastic. What is the sense to write code for each data transformation in 2024...

Ответить
@DANNYEL20122
@DANNYEL20122 - 16.10.2024 04:21

Na Nigerian guy be this

Ответить
@purnapachhai3021
@purnapachhai3021 - 26.09.2024 11:57

Very clear explanation

Ответить
@nikitanikumbh9162
@nikitanikumbh9162 - 23.09.2024 09:48

I am able to install airflow on free-tier EC2 instance but not able to connect to : Public IPv4 DNS . can anyone suggest on this ?

Ответить
@kingnguyen6109
@kingnguyen6109 - 18.09.2024 05:32

What if I use CSV file instead of API so how can I load the file into airflow file

Ответить
@ericiyen96
@ericiyen96 - 17.09.2024 22:40

Excellent job

Ответить
@MalvinSiew
@MalvinSiew - 13.09.2024 11:48

Hello thank you for the great video. I am running into an issue where the EC2 instance keeps freezing and disconnecting every time I manually trigger a dag. I believe it's because the t2.small instance does not have enough memory but I see this is not an issue for you or others. Just wondering if anyone else is going through the same thing or whether it is something I did wrong? Any help is appreciated, thank you.

Ответить
@bobbytito6301
@bobbytito6301 - 11.09.2024 15:34

wonderful video

Ответить
@lehast
@lehast - 04.09.2024 17:23

So far to me, Airflow is cron with a half baked UI and functionality, for starters, you can't create a DAG using the UI

Ответить
@toml2951
@toml2951 - 29.08.2024 17:39

Thank you, for this! Love the enthusiasm, exuberance and energy brother. Usually, most tutorials are dull, monotonous and riddled with confusing, and abstract terms and acronyms. You really take the time to distill things in plain speak!

Ответить
@emmanuelharel
@emmanuelharel - 28.08.2024 18:02

Just a remark: you forgot to tell that here you are using the default vpc. I don't think it is a best practise. I understand the video would be longer but it would worth mentioning it.
As well depending on how you created your VPC, AWS now can create for you automatically a connection between for ec2 instance and S3.

Ответить
@rashedulkabir7479
@rashedulkabir7479 - 22.08.2024 00:07

when I ssh using vs code it keeps disconnecting. anyone found a solution?

Ответить
@Dlearnn
@Dlearnn - 19.08.2024 15:34

Since you are using session token, isnt it a manual effort and hence DAG will not be automated?

Ответить
@Dlearnn
@Dlearnn - 19.08.2024 14:55

Where does the airflow folder in vscode came from?

Ответить
@mamadoulo1737
@mamadoulo1737 - 18.08.2024 23:48

Great work ! Thank for this project. I have already completed and i learn a lot.

Ответить
@andresparra5820
@andresparra5820 - 16.08.2024 04:59

Thank you so much for this tutorial, was very helpful for an interview !!!

Ответить
@cascadeanalog320
@cascadeanalog320 - 15.08.2024 01:29

Thank you very very very Much !

Ответить
@salmanshikalgar4482
@salmanshikalgar4482 - 27.07.2024 18:53

Command not working sudo pip install pandas in airflow _venv , what to do?

Ответить
@collinspo
@collinspo - 23.07.2024 00:30

This is a masterpiece. Thank you again and again !!!

Ответить
@rohan_mehra
@rohan_mehra - 05.07.2024 06:46

It's like T'Challa is teaching AirFlow! 🤣

Ответить
@Nawar-t56
@Nawar-t56 - 28.06.2024 01:42

Tutorial is really great. However, I am having some issues with loading the airflow UI. When I use ''airflow standalone'', and go to the 8080 port on the url from s3 instance it takes a long time to load the UI. Most times it doesn't load at all. I can't seem to understand what the reason is. Could you please help me?

Ответить
@xavierromerocarrion1369
@xavierromerocarrion1369 - 27.06.2024 15:12

Hi bro. Congrats for the video!!
I have experiencing a major issue, the airflow service and the SSH connection collapses after a few minutes of the instance initiation... What can I do?

Ответить
@jrohit9664
@jrohit9664 - 19.06.2024 10:01

The scheduler does not appear to be running. Last heartbeat was received 32 minutes ago.

The DAGs list may not update, and new tasks will not be scheduled.I have followed all commands you mentioned but i was getting like this in my airflow user interface and the command prompt which i connected to ec2 was running airflow standalone where to run new commands so that i can run airflow scheduler ? pls reply

Ответить
@josecardons6221
@josecardons6221 - 16.06.2024 05:54

amazing bro thanhs

Ответить
@mdobaidullahal-faruk3457
@mdobaidullahal-faruk3457 - 31.05.2024 00:13

Thank you very much for making the concepts so easy to understand👌

Ответить
@JDTheClasher
@JDTheClasher - 30.05.2024 22:46

I am getting following error while creating a virtual environment.
E: Unable to locate package python3.11.9-venv
E: Couldn't find any package by glob 'python3.11.9-venv'

If anyone can help then it would be great!!

Ответить
@RaghulS-nl6wx
@RaghulS-nl6wx - 25.05.2024 17:49

i get my load_data task failed i configured everything right but still get failed for the last task i couldnt figure it out anyone with the same scenario got any soln?

Ответить
@narasa12
@narasa12 - 19.05.2024 08:14

Excellent information, thank you so much for posting this video here

Ответить
@kristiandaclan9236
@kristiandaclan9236 - 17.05.2024 15:05

If you have problems with installing dependencies it is because instead of sudo apt install3.10-venv, replate it to sudo apt install3-venv to get the latest version. Currently, it's at 3.12

Ответить
@Vilayat_Khan
@Vilayat_Khan - 05.05.2024 23:15

yay, i managed to finish it! and i have the csv file in s3. thx, u deserve the like lol.

Ответить
@Vilayat_Khan
@Vilayat_Khan - 04.05.2024 18:10

dont ask for likes, i will only like if i can finish and add this to my resume

Ответить
@pavanparvathanenii4471
@pavanparvathanenii4471 - 24.04.2024 05:01

Amazing video❤

Ответить
@mihirgharat7585
@mihirgharat7585 - 12.04.2024 23:40

my ec2 instance is not loading using a t2 micro instance. could that be the only reason?

Ответить
@Vikasptl07
@Vikasptl07 - 07.04.2024 08:17

Good beginners video. I am personally not a fan of running your processing code and orchestration on same instance. How do you ensure package dependency,manage virtual environments,resource allocation for different workloads.

Ответить
@AladinAiWisdom
@AladinAiWisdom - 02.04.2024 11:29

Thanks for the tuto, just as remarque no need for hard coding credentials, since you gave iam role full access and assumed by the EC2. you can write directly in S3 from EC2

Ответить