Beautifulsoup vs Selenium vs Scrapy - Which Tool for Web Scraping?

Beautifulsoup vs Selenium vs Scrapy - Which Tool for Web Scraping?

John Watson Rooney

3 года назад

75,524 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

Vítor Rocha
Vítor Rocha - 26.09.2023 15:46

Hello John, Thanks for the video!

I have two questions:
What the best tool for scrapper a website with login or autentication?

And, when the website use a api with autentication, what can i use?

Ответить
Robert Tuttle
Robert Tuttle - 06.08.2023 04:55

I am new to scraping. If you have a dynamic website that requires you input dates or numbers and click on buttons, what else besides selenium works? Does Beutiful Soup work? Very interested.

Ответить
ProjectH
ProjectH - 19.07.2023 14:23

Greatly informative, thank you.

Ответить
Ahmed Mando
Ahmed Mando - 14.05.2023 19:47

thanks bro

Ответить
Cleyton virino
Cleyton virino - 08.05.2023 16:36

I liked a very didactic explanation

Ответить
Shivam Sharma
Shivam Sharma - 25.01.2023 09:51

I want to scrape data from infinite scrol website which library should I choose

Ответить
moto_HYUTS
moto_HYUTS - 08.01.2023 12:09

Great video thank you. Thoughts on AutoHotkey?

Ответить
Kraln0r
Kraln0r - 01.01.2023 20:07

Thanks for a great rundown of the options available for web scraping in Pyhton. There were a few that I was not familiar with.

Ответить
Mohit Sharma
Mohit Sharma - 17.12.2022 19:23

what would you recommend for creating scraping tool ?

Ответить
M CC
M CC - 12.09.2022 18:32

Very useful for beginners

Ответить
Affan Ahmed
Affan Ahmed - 15.08.2022 00:58

most of the time what I do is use selenium to get me where I want then extract what I want by making soup of the page using beautifulsoup extracting specific tag info then afterwards using pandas to save the list data in data frame and exporting it out as a csv or excel .

Ответить
Adam Churchwell
Adam Churchwell - 07.06.2022 23:53

do you have any videos on scrape masking?

Ответить
Запольская
Запольская - 06.06.2022 13:51

what tool i can use to bypass blocking websaits if it understands that I m using automation tools?

Ответить
HudsonA
HudsonA - 29.05.2022 16:32

You mentioned that selenium sends information about itself to websites being scraped, so that websites could detect that selenium is being used. I'm curious if you know more about this and any workarounds?

Ответить
Gels
Gels - 26.04.2022 11:40

I have been using requests along with bs4 & I did heard about scrapy and I agree it isn't good for beginners so I was a beginner that time it really was daunting. But now I think it's time to scrapy time

Ответить
Travis G
Travis G - 29.03.2022 03:01

Wish I had watched this before choosing Selenium for a scraping project. Really feel you hit the nail on the head. Great video!

Ответить
Yvonne
Yvonne - 17.03.2022 14:36

Thanks for the whole content!
I got a question and it would be very helpful for me, if you can support:
I have to scrape a dynamic website. If I scroll down, more objects are loading to this page (always 50 new). When I look in "developers" of my browser, I find the Data I need in the folder "XHRs" and with every scroll for new 50 objects, there is a new file called "730" with the new 50 objects in json-format. I need all the 730-files.
do you know how to scrape them?

Ответить
Tim
Tim - 02.03.2022 03:52

requests + bs4 VS scrapy?

Ответить
Abid Hossain
Abid Hossain - 27.02.2022 08:17

Can you make a video about recent scrapy-playwright bug about implementing scrapy-playwright setting implementation and some books or resources to learn scrapy.

Ответить