How to extract data from PDF's with RegEx in UiPath - Full Tutorial

preview_player
Показать описание
This video tutorial will show, how to extract data from one or more PDF's using RegEx with UiPath. It shows how to get the data, we want, out of a single invoice PDF and afterwards how to get data out of all invoices in a folder.

You could also watch:

0:17 Inspecting our data
We have pdf invoices in a folder, which we want to extract data from automatically with UiPath.

1:24 Install PDF package
In the Manage Packages we find the PDF activites and install it.

2:22 Read PDF Text
Read the PDF into a string we can work with in Uipath. Copy the extracted text from the PDF, so we can work with it in an online RegEx editor.

3:28 RegExr - Online RegEx editor
Use an online RegEx editor to easily test your RegEx pattern and check, what data is to be extracted.

5:17 How to use the Matches activity in UiPath
Specify input, pattern and output. The output is an IEnumerable, which you can see as a list, where the data from the invoice will be stored.

6:47 Creating Data Tables
We create one Data Table with our RegEx patterns and one for the outputs.

11:29 Writing our extraced data to a Data Table

13:42 Write Range
Use a Write Range to write the Data Table to Excel.

15:21 Loop through our invoices in the folder
Use a For Each to loop through our invoice files in the folder.

Connect with me:

#uipath #rpa #automation
Рекомендации по теме
Комментарии
Автор

that was exactly what I was searching for. Thank you for the great work!

pnard
Автор

I wouldn't get anywhere without these guides. Thanks

pattok
Автор

Thank you Anders for sharing the knowledge with us! You are great ! I work in HR and you gave me the idea of how to extract the salaries, ranks and positions from employees' offer letters. I look forward to build my own robot. Thank you so much for the lesson and good luck !

mihaeladraghici
Автор

Anders thank you very much for this video ... it was very good the way you did, it was exactly what I was looking for, please continue with your channel I hope it grows much more

Kind Regards

adrianosouza
Автор

Thanks, thanks a lot per this amazingly valuable tutorial! Not only about the content, but also about your clear explanation.

martaduino
Автор

Thank you, Anders, for this awesome video!

gxutstw
Автор

Great job! Very helpful video, i am waiting for next elaborations

LUKSUSq
Автор

Hi Anders, I saw mostly all your videos. They are so great and clear all concepts. Thanks a lot. I just want you to know that if possible can you please demonstrate all the request methods of Http request activity and if it is already there can you please just share it.

sayalichincholikar
Автор

Hi Anders, I love your tutorial videos, they are simple and easy to understand. Can i check with you if you know how to Read a downloaded PDF and rename it with text extracted from within the PDF?

jeremylee
Автор

Hello Anders, perfect video all works very well, I am only struggling a little bit with regex, could you please help me? I need to match the line below the finding.

jnblnr
Автор

Hi Anders, thank you very much for all of your videos. Have you ever worked RPA on an Application that has Silverlight installed?
Silverlight basically prevented RPA to detect the UI elements, didn't it? I was thinking that PDF has same properties as an application that has Silverlight.
The reason I mention about PDF was that Silverlight is now deprecated, I couldn't install Silverlight on my app to test it out. Does RPA work on PDF files the same way it works on Silverlight? Please advise?

VICKY
Автор

Hi Anders
Thank you again for the very useful video. It is just what I was looking for.
The difficulty I see is to manage the real PDF and PDF containing only image. Is there way in UiPath to identify it ? So I can use the read PDF or the OCR.

RobertWalthery
Автор

Thank you for the wonderful video Anders. I have a question that comes to mind: do you have to do the RegEx procedures again for every type of invoice or is there a way to classify the invoices beforehand, so you can assign RegEx values to them based on a pre-screen? Best, Max

AmorFatiYT
Автор

Hi Jenson could you please provide the sample input which u have used for document understanding

Nagendra
Автор

Thank you for this awesome video, just one question though
Is there a way to add file name too, from which the text is being extracted from?

if yes please let me know how

kindly let me know, ,,,it'll be helpful

saalemrafiq
Автор

Hi Anders,
Thank you this video.
I tried it too. It worked great and very fast.
How can i get the invoice items (multiple line) to excel?

TevfikBagcivan
Автор

Hi Anders, I have a question for you about the data table output. Is it possible to assign variables to columns in the data table? For example, I would like to have invoice number as a variable. Great video by the way. I learned a lot! Thank you.

andrewandrew
Автор

thanks for helping me to learn UIpath, I have learned a lot from your videos.
This is my humble request, can you please Enlarge the font when you enter the code in uipath. Sometimes its hard to read because of small font

rajayushgupta
Автор

Thanks for your support Anders but one question i am having butch of documents in one pdf from this i need to extract some important fields & Can i continue with is video or did i make any changes

ashokdondapati
Автор

Hey Andersen, this is great learning !
Could you please help me on "How can we extract data from multiple images to excel using OCR", lets say if its a expense receipt, and each receipt will have different formats, so how to exact Name, Date, Description & Amount from that image.
Any help, please !

arunarumugam
join shbcf.ru