Scrape Data from Multiple Web Pages with Power Query

Scrape Data from Multiple Web Pages with Power Query

MyOnlineTrainingHub

3 года назад

130,409 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

Jay Li
Jay Li - 13.10.2023 21:28

Great resource! I am curious about replacing 1 with "&PageStart&". Can you explain why we use the double quotes coupled with the double ampersand? Which language/grammar are we following here, M or HTML or something else? I just wanted to learn more coding rules so I can crack the query more freely. I would appreciate any help you could provide.

Ответить
Stephen Cross
Stephen Cross - 27.09.2023 21:30

Wow, this is clever and exactly what I needed. My mind is blown !!

Ответить
shrikant badge
shrikant badge - 13.09.2023 17:32

I still need to watch this video a few times. Our entire organization dont know this i bet

Ответить
shrikant badge
shrikant badge - 13.09.2023 16:51

Exclusive...

Ответить
Naveed Khowaja
Naveed Khowaja - 11.09.2023 11:31

Excellent tutorial, super easy to follow. That’s brilliant 👍

Ответить
Leandro Coelho
Leandro Coelho - 21.08.2023 19:48

Thanks!

Ответить
gzfraud
gzfraud - 13.08.2023 16:13

I can't get PQ or BI to extract embedded URL in a webpage table. eg email is embedded in person's name? Any ideas?

Ответить
V V Nair
V V Nair - 10.08.2023 15:21

brilliant idea

Ответить
Pritam Gaba
Pritam Gaba - 23.07.2023 08:24

Thus is AWESOME!!

Ответить
gestor
gestor - 12.07.2023 11:08

I don´t know you, but I love you. thanks!

Ответить
Barry K
Barry K - 02.06.2023 09:53

Is it possible to scrape the URL of each individual book? If yes, how can't it be done?

Ответить
仁です。
仁です。 - 23.05.2023 11:10

It's usefull. Thanks you. I am looking for silimilar data scraper software. Do you mind to show me how to work with power BI in the case with differences website please.

Ответить
Nadeem Shafique Butt
Nadeem Shafique Butt - 18.05.2023 00:54

As always, an excellent tutorial

Ответить
Michael Brown
Michael Brown - 15.05.2023 07:31

This is simply awesome, now I have to practice this technique.

Ответить
Rami Daoud
Rami Daoud - 21.04.2023 12:19

Can you provide me with an example of using a single Power Query connection to scrape multiple HTML elements from a webpage and load them into corresponding cells?

Ответить
malanie banney
malanie banney - 19.04.2023 06:53

I slightly adjusted this to scrape data from a folder full of PDF files. Excellent thanks!

Ответить
Abraham J
Abraham J - 23.03.2023 17:00

How did you come up with 21610 for the list of numbers?

Ответить
Capri
Capri - 17.03.2023 00:29

What if i want to pull all the data available but i dont know how many pages there is?

Ответить
Duy Vu Ngo
Duy Vu Ngo - 11.03.2023 15:37

Great example

Ответить
Timotei Satmarean
Timotei Satmarean - 04.03.2023 23:20

Hi. When I get data from web it only extracts a small fraction of the data on the web page. Why is that? I also do not have the option to add table using examples. Excel 2021

Ответить
Taleb Bagazi
Taleb Bagazi - 01.03.2023 11:16

Woow this is brillant :D
i have question, this method can work in any website am i right? so, by learning this i don't have to figure way in Pyathon or other web scraper data am i right?

Ответить
John G
John G - 03.02.2023 21:11

Thanks in a million.

Ответить
Stephan Onisick
Stephan Onisick - 25.12.2022 19:30

The problem I have is that I want to capture the URLs and I keep getting "No CSS"

Ответить
Stephan Onisick
Stephan Onisick - 22.12.2022 19:19

How do you get URLs from a web page?

Ответить
Stephan Onisick
Stephan Onisick - 22.12.2022 18:34

Awesome use of M for us tiptoeing into the M Script!

Ответить
Ultrascoop
Ultrascoop - 05.12.2022 19:36

I want to scrape multiple links list 😶

Ответить
Bryan Dadiz
Bryan Dadiz - 05.12.2022 16:05

The website is not anymore updated

Ответить
maria del mar
maria del mar - 12.11.2022 03:18

Hi! Your tutorial is very clear. However, what if the web page you are trying to access needs your credentials first? Do you know how I can go around that? Thank you!

Ответить
Steve Wilson
Steve Wilson - 04.11.2022 16:39

Good afternoon, i followed your instructions however instead of producing the results from the subsequent URL pages it just mirrored the results from the first page. Any ideas? Thanks, Steve

Ответить
Excel Tutorials
Excel Tutorials - 27.10.2022 17:47

Hi Mynda, I'm trying to scrape data from Amazon but the website is not moving forward from Accept Cookies. Any help

Ответить
Nur Ezzati
Nur Ezzati - 21.10.2022 05:07

Hi Mynda. Thank you for sharing it. Very useful. However, Is there any way to get the actual URL since the position keeps changing whenever I refresh data in Power BI.

Ответить
Tamás Somogyi
Tamás Somogyi - 10.10.2022 15:12

Beautiful. It's solved my actual problem. Thx. :)

Ответить
Miguel
Miguel - 22.09.2022 21:32

maybe someone can help me here. I need to do something similar to this, simpler in terms of form, since on the web the table I need already has the table format, but the problem is that the url address is not modified when changing pages to see other parts of the table. I hope someone knows how to proceed in this situation. Thanks in advance.

Ответить
siddharth shil
siddharth shil - 22.09.2022 20:13

hey, I don't have a start parameter in my url

Ответить
Marcel Felipe Machado Lopes
Marcel Felipe Machado Lopes - 19.09.2022 04:39

Amazing how it is easy to scrape web pages. Thanks for this excellent tutorial.

Ответить
Geoff M
Geoff M - 18.08.2022 16:24

To think I was doing this manually 🤦🏽‍♂️. Thank you, this is a huge time saver!

Ответить
David Stevens
David Stevens - 09.08.2022 23:39

Wow...Easily used this tutorial to query printer settings from every Zebra printer on my LAN. Very helpful!

Ответить
peter mcallister
peter mcallister - 05.08.2022 00:15

Great tutorial! Helped me a lot. But do you have any idea, why "Add Table Using Examples" won't work and throws this message: "This Stencil app is disabled for this browser"?

Ответить
Clyde Oporto
Clyde Oporto - 02.08.2022 10:15

hi is it possible to scrape data to a list of url's using power query?

Ответить
Máté Pataki
Máté Pataki - 24.07.2022 19:48

Is it possible to have more than one parameter?

Ответить
POWER B
POWER B - 06.07.2022 10:47

Great video thanks this makes web scraping a lot easier. Thank you.

Ответить
SAJAD Abdul Cader
SAJAD Abdul Cader - 22.06.2022 20:25

How can we pull data from a web page that publishes daily. Web does not have date as field.
We need the excel to be updated daily with generated data daily as a table so we have a history and date table

Ответить
Pritam Mishra
Pritam Mishra - 08.06.2022 21:02

Mam can u please tell me how to scrape data from multiple websites which have different url format at once

Ответить
Samuel Suárez
Samuel Suárez - 07.06.2022 14:38

Hi! Nice video. I just want to know if there is a way to extract the paragraph that is inside some products using power query also. I was trying but power only extract information in the outside, that is visible, is there a code or a formula to do this? Thanks!

Ответить
Maroš Brezovský
Maroš Brezovský - 06.06.2022 10:38

When we assume that bookstore extend the number of pages in the time, how can I set it up so that query will check all pages available. I can not set it up so, because when it checks the urls which does not exists yet, it will stop sraping procedure. Is it possible to fix it somehow?

Ответить
ritvik bolugudde
ritvik bolugudde - 27.05.2022 14:04

Thanks a lott!! I was wondering if the web page is updated would the loaded data in power bi update too (so basically if it's real time or not)

Ответить
Bali Rakhra
Bali Rakhra - 22.05.2022 16:31

Thank you soooo much! You changed my life this weekend. Been struggling with Excel's limitations for years, and lost countless hours of my life sometimes without even accomplishing my goal. I only discovered the existence of Power Query last night with your video, and you blew my mind. A brilliantly well presented and comprehensive video on it too! It got me partway through my current problem, but now I'm stuck again if you can help?

I've created Query1 to gets multiple tables from each webpage with 10 records each , and includes a record ID. But each record has a link to a details page for more info for that record. The record ID is used within the URL string to get those details. Can I create a single query that collects the list of records and uses the ID to also collect the details for each record all in one go?

Also, with 30,000 records in total, it takes hours to refresh. However, as the historic records don't change, and have a historic date of filing, is there any way for future updates to only get and append the latest records (with a filing date after the last date of the previous dataset, whilst removing any duplicates, and append it to the list?

Finally, it would be great if a timestamp could be added in an additional column to denote the date when that query was run, so that I can easily see which data has been added and when. Is any of this possible with PowerQuery?

Ответить
yassine ouahi
yassine ouahi - 19.05.2022 21:20

What if my URL page is a search page how can I scrap the data from it. Cause URL only has text and no tables?

Ответить