Python Web-scraping with Selenium vs Scrapy vs BeautifulSoup | Witcher project ep. #1

Python Web-scraping with Selenium vs Scrapy vs BeautifulSoup | Witcher project ep. #1

Thu Vu data analytics

2 года назад

45,544 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

Vilas Mawal
Vilas Mawal - 03.08.2023 11:13

The website I am scraping from gives me captcha to check whether I am human or not. Is there any way to avoid this??

Ответить
Jerry Cheng
Jerry Cheng - 10.07.2023 02:53

Why it returns an empty list in book_categories when I typed "book_categories = driver.find_elements(By.CLASS_NAME, 'category-page_member-link')"

Ответить
zain iqbal
zain iqbal - 12.05.2023 08:25

Hi Thu, how can i support you besides making purchases on your affiliate links. Let us know

Ответить
D-Rey
D-Rey - 25.03.2023 15:47

any advice with crawling through a website with a lot of "a href" elements, especially when they are child elements. using selenium it seems to struggle. is selenium even the right tool.

Ответить
LVHQ - Làm việc hiệu quả trong thời đại số
LVHQ - Làm việc hiệu quả trong thời đại số - 12.03.2023 21:13

Thanks!

Ответить
imaveri
imaveri - 12.03.2023 06:45

Such a good video . I loved it 🙌🙌🙌🙌🙌

Ответить
Ifeanyi Nwobodo
Ifeanyi Nwobodo - 09.03.2023 13:26

Thanks for the video
I'm having issues in this project, it keeps giving me


AttributeError: module 'selenium' has no attribute 'Chrome'

Please what can I do? Your input will be highly appreciated

Ответить
Frenzymove
Frenzymove - 18.02.2023 18:53

You're so rush

Ответить
Felipe Akira Fukue
Felipe Akira Fukue - 18.01.2023 19:31

Very good content, congratulations for the videos and for the didactic. It's a lot of fun to study and follow your content, it made me enjoy using Python again

Ответить
James C
James C - 17.01.2023 22:37

Awesome :)
part 2 please :D

Ответить
Fokrul Amin Rasel
Fokrul Amin Rasel - 15.12.2022 13:11

driver.get(pageURL) is showing error. can you help me.

Ответить
Creativity Unleashed
Creativity Unleashed - 14.12.2022 07:48

How to write code to scrap information from websites on first 3 pages of google search engine. I mostly see solution for single URL but to extract information. I need something comprehensive. Can I give website as argument?

Ответить
d
d - 14.11.2022 17:23

BTW find_element_by_class_name is deprecated. Should be find_element(By.CLASS_NAME, "class name"). Like you used for the xpath

Ответить
Dana Daskalova
Dana Daskalova - 11.10.2022 13:44

Hi Thu, I have one question - the Wiki Terms and Conditions page forbids any kind of scrape of its content. Isn't this a little, let's say not legal? Thanks <3

Ответить
Jordan Rubio
Jordan Rubio - 07.10.2022 23:57

Ah yeah i love the Witcher books! And showing how solve real problems in your process is great since i learn better with visuals this is a great series!

Ответить
Al
Al - 09.09.2022 23:25

This confiemd some doubts I had about what types of framework/libraries I'm using. Thanks!

Ответить
Ekaterina Erikhova
Ekaterina Erikhova - 27.08.2022 20:58

Thank you so much for this video! I've struggled with data scraping for my project, but with your tutorial, I managed to get the data I needed!

Ответить
Sisnando Junior
Sisnando Junior - 09.08.2022 23:23

Thanks for sharing you knowleage Thu! Besides training my English, I learn more about data science. :)

Ответить
NormanWasHere
NormanWasHere - 21.07.2022 01:12

I can not for the life of me get past this error: "WebDriverException: Message: chrome not reachable" whenever I try and run driver.get(page_url)

Ответить
Julien Soyer
Julien Soyer - 28.06.2022 02:47

Thanks for that ! thanks to you I managed to use it for the lore of a game I love !

Ответить