Find Jobs
Hire Freelancers

Implement Inverted Index with Lucene on Windows XP platform

$30-5000 USD

Closed
Posted about 15 years ago

$30-5000 USD

Paid on delivery
Hello, ? I have a need to implement an custom inverted index in Lucene. I have files like the ones I have attached here. The Files have words and and scores assigned to that word. There will 100's of such files. Each file will have atleast 50000 such name value pairs. Note: Currently the file only shows 10s of such name value pairs. But My real production data will have 50000 plus name value pairs in file. I am not indexing Webpages. For my application I am indexing the data in the file that is attached to the project. A sample Query will be "Actress? Actuary? Administrator Anaesthetist? Animator Animal trainer Archaeologist Architect Artist Astronaut Astronomer Athlete trainer author Baker barber Babysitter Bassoonist Beauty queen" Currently I index the data using Lucene's Inverted Index. The query that is being execute against the Index has 10s Words. This is not like a usual 1 or two word query that people execute. But this query has 10s of words. When the query is excuted against the index the result is returned in 100 milli seconds or so. Problem: Once i have the results of the query, I have to go through each file (for ex. attached file one). Then for each word in the user input query, I have to compute the total score. Doing this against 100's of files and 100's of keywords is causing the score computation to be slow i.e. about 3-5seconds. I need help resolving the above problem so that score computation takes less than 200Milli Seconds or so. One Resolution I was thinking is modifying the Lucene Source Code for creating inverted index. In this index we store the score in the index itself. When the results of the query are returned, we will get the scores along with the file names, there by eleminating the need to search the file for the keyword and corresponding score. I need to compute the total of all scores that belong to one single file. I am also open to any other ideas that you may have. I need the complete source code/changes made to get this to work such that execution speed of the program is less than 300Milliseconds. Note: Payment will only be released once I have determined that the execution is only taking 300Ms or less. If it takes anything above that, the code is useless to me and payment will not be released. I am using Java Based Lucene for inverted index. Thanks, Akash
Project ID: 3889438

About the project

1 proposal
Remote project
Active 14 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
1 freelancer is bidding on average $170 USD for this job
User Avatar
See private message.
$170 USD in 14 days
5.0 (4 reviews)
2.4
2.4

About the client

Flag of
5.0
21
Member since May 8, 2009

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.