Actually, I need to to load manually information from aprox 1000 pdf files/day (invoices, purchase orders, etc.). Those files have approximately 60 different templates and resides in a Windows Server with a folder tree like: "d:\Folder\Folder\ Folder\001\pdffile00[1...n].pdf"
I need to do:
1. Connect to the Win server and pull the files to a linux server preserving directory structure
2. Once the files are in the linux server extract the info I need (between 20 and 60 fields per file) and load this info into a table in MariaDB.
1. To pull the files could be a shell script
2. To read the files a java application is preferred because, later we will need to integrated into
OpenKM ([login to view URL]) as an extension
Please send your offer in hour/man a $/hour.
9 freelancers are bidding on average $8/hour for this job
Hi, We are a team of java and python developers who ensure on time task completion with complete customer [login to view URL] find our portfolio below [login to view URL]
Hi, I am a java developer for almost 5 years and I know shell script as well. Extracting records from pdf is easy if PDFs aren't locked. So tell me if you want this to be done. Have a nice time