The Search Guy

Minion: An open source search engine from Sun Labs

Saturday Apr 19, 2008

I just created a java.net project for Minion. Minion is the name that we (which is to say, Jeff) came up with for the open sourcing of the Sun Labs search engine. The engine we're open sourcing is a substantial revision of the engine that ships with the Portal Server and Web Server.

In the simplest terms, Minion provides an API for indexing and searching documents. Minion has a pretty liberal interpretation of what a document is: a document is a map from field names to field values. If you want to index data, you just have to figure out what the fields are that you want indexed and how they should be treated by the engine. The indexer takes a document as a java.util.Map and adds it to the index. This simple model turns out to be fairly useful for a pretty wide range of things.

As far as querying goes, Minion provides ranked boolean, proximity, and parametric query operators. In addition to the query opertions, Minion provides document similarity operations as well as automatic document classification and document clustering capabilities.

Once the project's officially approved (and we clean things up a wee bit) we'll be putting the source code into the java.net repository.

For the next little while, I'll be blogging about the engine in general as well as the extremely specific.




[4] Comments
Comments:

Wow, big news! This is fantastic. I can't wait until the source code is available.

I look forward to reading your posts about the newest OS search library!

Posted by Jeff on April 20, 2008 at 01:11 AM EDT #

Have you considered donating this to ASF (Apache Lucene in particular)?

Posted by Otis Gospodnetic on April 20, 2008 at 09:20 AM EDT #

Jeff, thanks for the kind words. I hope to be blogging a lot about this in the near future.

Otis, I haven't considered that, although I would be perfectly happy to share what's in Minion with the Lucene community. One of the things that I'm hoping to blog about shortly is the way that Minion and Lucene differ (although they have similar underpinnings!)

In an alternate universe where Sun went open source in a big way a little earlier, I would have started working on Lucene in 2001 instead of going my own way...

I'll talk to my management and see what the deal is with donating things to ASF to assess the degree of difficulty of such a donation. So, stay tuned, I guess.

Posted by Stephen Green on April 21, 2008 at 01:23 PM EDT #

This sounds great. Where can I download it and have a play?

Posted by Rob Young on May 06, 2008 at 07:28 AM EDT #

Post a Comment:
Comments are closed for this entry.