Комментарии:
If the pdf contains 50 pages and I want to parse/extract a particular value in each page. Please help in this context.
ОтветитьHello Aman,
Can u please let me know why u have used != -1
in second for loop in if condition
what is the purpose of not equal to -1, sir ?
ОтветитьPlease or make a video on extracting the checkboxes from the word document or pdf
ОтветитьHi..thanks its great learning..do u also do freelancing?
ОтветитьIt's very useful.Thank you Aman.
ОтветитьI am really enjoying your NLP series. Thank you for making it look as simple as this.
ОтветитьThank you so much! This is great. I have a question though....¿How would you save this information in JSON format? : D
ОтветитьHOW CAN I SCRAPE A KANNADA PDF TO UNICODE IN PYTHON
ОтветитьDear Sir,
Thanks for This Video , Is there any way that I can enter a word and search in thousands of pdf and the pdf which contains the word will open.
Great video! Thank you for sharing!
ОтветитьHow do I extract specific data from invoice having different formats, please help sir.
Ответитьfinished watching
ОтветитьPlease do it with pdfminer
ОтветитьGreat video sir, how do I save those values in a CSV file? And my second question is how do I split on next line rather than : ?
ОтветитьSir, My Folder Has Various Files Like
txt,docs,excel,pdf etc then what is the solution? Can you make a separate video for them?
Could you please suggest if in case all the Invoices format are different each other.
ОтветитьGIving an error at this line ---> invoice_no = file_contents[i].split(': ')[1]
ERROR: IndexError: list index out of range
I tried & replicated same format of bills in word and saved them in PDF format, used random values in invoice, date and amount.
Please suggest!
for match in self._lang_vars.period_context_re().finditer(text):
TypeError: expected string or bytes-like object
while performing tokenization
please help
Still learning Python and your simple teaching style is really helpful.
You got yourself a subscriber sir. Thanks!
Hi, I am getting this error 'PdfReadWarning: Xref table not zero-indexed. ID numbers for objects will be corrected. [pdf.py:1736]'. Any idea why that's happening?
ОтветитьBro plz make video on a how to extract data from docs and pdfs and how to add that entities to data frame plz bro
Ответитьvery good thank you.
ОтветитьBur this is not working in my google colab
import os
dir_Path = 'C://Users//server//Desktop'
os.chdir(dir_Path)
print(dir_Path)
The eror which i am getting is
FileNotFoundError Traceback (most recent call last)
<ipython-input-13-13a426d276e1> in <module>()
1 import os
2 dir_Path = 'C://Users//server//Desktop'
----> 3 os.chdir(dir_Path)
4 print(dir_Path)
FileNotFoundError: [Errno 2] No such file or directory: 'C://Users//server//Desktop'
Please guide me
How can i parse doc file its very challenging one in Windows 10 Python?
Thanks in advance
The content is simple yet very useful to start with.
ОтветитьAWESOME GR8
ОтветитьIs there a way to get through the pages of the file? I don't want just the informations on page 0.
ОтветитьI want to learn web scraping from basic to advance. If u are providing the online classes plz let me know sir 🙏
ОтветитьThis is such a great simple playlist. Thank you.
ОтветитьSo useful! This helped me automate a huge amount of work for my company. Thank you very much
ОтветитьThanks for the information sir
ОтветитьGood presentation...
ОтветитьNice work
ОтветитьNice video on Doc parsing.
Ответить