Data extraction from a pdf

Closed Posted Aug 5, 2007 Paid on delivery
Closed Paid on delivery

A skilled pdf programmer needed to disassemble a pdf and extract text. I need to take a pdfed catalogue, extract the data and use it to populate a database. I could do it manually, but in my time trials it would take about 16 years! I am hoping that you can automate the process and make it happen a bit faster Time is not an issue and neither is language skills ??" being a smart, clever programmer is. I have full time programming/web staff but projects like this, one offs, do come up from time to time and I would love to connect with someone who can help. Future work is definitely a possibility! I have 2 files. A pdf file with the source information and an excel file with a sample of the expected results of parsing the pdf file. The files, for your review are to be found at ??" [url removed, login to view] Bet you never even heard of GRIQUALAND WEST!! Please make me an offer on the time and cost to give me a well documented program, with source code, written in an open source programming language, to extract the information in the pdf file to the excel file (or a fixed field length text file) I have a number of other similar files to run the program on and will need to be able to run the program each year with updated data. I will need to own the resulting program and source code. The first six fields in the excel spreadsheet are the most important. The remaining four fields are the information above each group of data in the pdf and remain in effect until explicitly changed. There is other information in the file, namely more detailed descriptions of the individual items, but for the life of me, I cannot imagine how anyone could automate their extraction! Please help! Jerry

## Deliverables

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):

a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.

b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

window98 SE

Engineering Microsoft MySQL PHP Software Architecture Software Testing Translation Windows Desktop

Project ID: #3182612

About the project

6 proposals Remote project Active Aug 26, 2007

6 freelancers are bidding on average $80 for this job

jawadh

See private message.

$85 USD in 14 days
(83 Reviews)
6.0
redduke

See private message.

$85 USD in 14 days
(11 Reviews)
5.0
raceaseworks

See private message.

$85 USD in 14 days
(12 Reviews)
2.3
edgarcrossvw

See private message.

$85 USD in 14 days
(1 Review)
2.2
alx870

See private message.

$55.25 USD in 14 days
(1 Review)
0.0
decal

See private message.

$85 USD in 14 days
(0 Reviews)
0.0