linkedIn OpenSource

Zoie

Real-time search indexing

http://sna-projects.com/zoie/

Zoie is a real-time search and indexing system built on Apache Lucene

 

 

 

Bobo

Fast faceted search with Lucene

http://sna-projects.com/bobo/

Bobo is a Faceted Search implementation written purely in Java, an extension of Apache Lucene

 

 

Decomposer
Massive matrix decompositions.
http://sna-projects.com/decomposer/


Norbert
Partitioned routing and cluster management
http://sna-projects.com/norbert/
Norbert is a library that provides easy cluster management and workload distribution. With Norbert, you can quickly distribute a simple client/server architecture to create a highly scalable architecture capable of handling heavy traffic.

  Voldemort
A distributed database
http://project-voldemort.com/

Voldemort is a distributed key-value storage system

 

 

Kafka
A high-throughput distributed messaging system.
http://sna-projects.com/kafka/

Kafka is a distributed publish/subscribe messaging system

 

 

Kamikaze
Doc set compression
http://sna-projects.com/kamikaze/
Kamikaze is a utility package for performing operations on compressed arrays of sorted integers. Search indexes, graph algorithms, and certain sparse matrix representations make heavy use of integer arrays, and special compression techniques are needed to get good compression performance on this data


Krati
A persistent high-performance data store.
http://sna-projects.com/krati/
Krati is a simple persistent data store with very low latency and high throughput


Sensei
A distributed, elastic, realtime, searchable database
http://sna-projects.com/sensei/
Sensei is a distributed database that is designed to handle the following type of query:
SELECT f1,f2...fn FROM members
WHERE c1 AND c2 AND c3..
MATCH (fulltext query, e.g. "java engineer")
GROUP BY fx,fy,fz...
ORDER BY fa,fb...
LIMIT offset,count

 

 

Azkaban
Simple hadoop workflow
http://sna-projects.com/azkaban/
Azkaban is simple batch scheduler for constructing and running Hadoop jobs or other offline processes.



这里是官方 blog  http://sna-projects.com/blog/

你可能感兴趣的:(hadoop,workflow,Lucene,performance,OpenSource)