Apache Solr for Drupal 6 Integration - Phase I - Research
$250-750 USD
Completed
Posted over 9 years ago
$250-750 USD
Paid on delivery
Background
Our company hosts several websites that presently have large and ever-growing databases. Queries have become more cumbersome and Drupal Core search has gotten slow. We feel that scalability is a two part endeavor. 1) Create and archival system so as to limit database growth and 2) implement Solr as a mechanism to aid in efficient data processing for the lifetime of the applications.
The scope of this project is solely concerned with fleshing the second part - 2) Solr integration. In general we’re looking for someone to take a highly consultative approach to our ideas. We are Drupal developers, but we are not experts in Solr, and we are very (very) busy. We are happy to hear many differing ideas around how to best make this integration happen. Please see the subsequent sections for further information on what our desired outcomes are.
Our websites are presently running Drupal 5, but we are beginning the upgrade process to get them ready for Drupal 6.
Phase II of this project will be assigned separately, but will ultimately implement the findings in this research project, part I.
Attached is the Word version of this description. It is far more legible.
Timeline
The research should be completed within two weeks of assignment.
A) Desired Architecture
Please comment upon each point herein and make recommendations for how to proceed with the work involved. Your recommendations need not agree with the ideas below.
Solr as a standalone server
Almost every webpage will be hitting Solr for data. We estimate that a dedicated server will be best.
Solr running on Tomcat or Jetty. Why?
Solr version?
Tuning Solr to allow for it to utilize the majority of server resources
Solr acting as a service to the afore-mentioned websites
Inter-application security - no website may access anything other than their Solr core/index
Secured to the outside world - only our servers should be able to reach the Solr server. All our servers are presently in the same data center and can be privately networked.
Ideally we’d be able to whitelist by IP so that our developers could call the Solr server directly (optional)
B) Desired Solr Features, Questions, & Considerations
Please comment upon each point herein and make recommendations for how to proceed with the work involved. Your recommendations need not agree with the ideas below.
What is the query language?
Boolean?
NOT-logic? NOT-logic with facets?
Advanced boolean such as proximity, wildcard, stemming?
Provide documentation
Field-specific (as compared to taxonomy-specific) searching (date ranges, authors, nodetype-specific fields)
Faceted search
Can this include NOT logic?
Autocomplete as you type (spell check? Suggestions?)
File indexing
Solr-provided contextual extract
Really only need to search nodes at present. No need to support user or other searches.
Suggestions for rich features that we may be missing
C) Drupal Considerations & Constraints
Drupal modules MUST be Drupal 6
MUST work with Drupal’s access rights scheme
We do not run views or CCK. Most content types are very simple, but can vary from website to website
What are the appropriate modules required to give us the features listed above
We’d like to get by with the fewest modules possible
No multi-lingual requirements
Deliverables
A detailed Word document explaining and expanding on ALL the points above in parts A, B, and C. It should include:
A comprehensive look at each bullet point
Your ideas and opinions
Issues and constraints
As many screenshots and visuals as is possible
Pls dont award before discussing
For solution for the query on the ever-growing data in the databases your choice of solr as search platform what we recommend to our clients for huge data retrieving needs.
Only query optimization for RDMS can not provide the desired results. And we need to introduce the search indexing
We have implemented solr search for different projects. We did implement solr for a Drupa 5 php website where we had to deal with a databse size of 110 GB inone of our of our porjects ( [login to view URL]). Solr plays a pivotal role for the success of one of our multi-vendor-drupal 6 project - [login to view URL]
For the D5 implementation Views & Cck were not used but for D6 we used slor with views and we made custom module for faceted search.
As you will be upgrading the websites to D6 from D5, we understand we need to implement solr for D6.
We used apachesolr for D6 implementation. We will need to understand the relationship exists between different content type and we need to make solr schema very carefully for getting optimum performance.
As you have multiple websites we need to decide on the schema for each website or a group of website.
We will provide the detailed analysis you sought and we might make 'POC' in support of our argument.
What we experience only using contributed module might not offer desired result. We might have to make custom module. Now on question on using tomcat or jetty we would recommend jetty.
$789 USD in 14 days
5.0 (65 reviews)
7.5
7.5
8 freelancers are bidding on average $1,495 USD for this job
Warm Regards, I am pleased to inform you that your PR matches our expertise who own all the right credentials to execute this job in a very proficient I request you to please initiate the discussion so that I can reach you efficiently, I can schedule an in-person to have better understanding on every minute requirement, final quote and deadline. Ready to share Portfolio on demand.(Committed instant reply) | Regards :Mike Blane
As a Solr expert of many years, I have answers to all your questions.
But to give proper advice, I have a few questions as well.
Do you have access to servers with SSD-disks?
How many queries per second does Solr have to handle?
Do you have experience with either Tomcat or Jetty?
Why is the NOT logic so important?
Do you have an example of a website that Solr will be used for?
Feel free to contact me for any questions you might have.
Thank you for your time.
Kind regards,
Okke Klein