http://dblab.cs.toronto.edu/courses/443/2013/tutorials/leveldb.html
LevelDB is a high quality B+-tree implementation. We want to use it as a B+tree index to a data file to speed up query processing.
The main functionalities of LevelDB we want are:
For the less patient readers, please refer to this well written user's guide on LevelDB API from the authors of LevelDB.
LevelDB uses a directory to store its data. You should have already experimented with the basics of running LevelDB applications in "Setup" practical learning unit of this course.
We assume that you are using g++ compiler.
Download leveldb-x.x.x.tar.gz from Google Code repository, and uncompress it to some directory.
Run make in .../leveldb-x.x.x/. This compiles the static leveldb library, namedlibleveldb.a
You need to include the libleveldb.a and the pthread library when compiling C/C++ source code that uses leveldb API.
The following command compiles the my_program.cc:
export LEVELDB=~/leveldb-1.9.0 g++ -o my_program my_program.cc $LEVELDB/libleveldb.a -lpthread -I $LEVELDB/include
where we assume that ~/level-1.9.0/ is the path where leveldb was stored.
- #include "leveldb/db.h"
- ...
- /**
- * opens a database connection to "./leveldb_dir"
- */
- leveldb::DB *db;
- leveldb::Options options;
- options.create_if_missing = true;
- leveldb::Status status = leveldb::DB::Open(options, "./leveldb_dir", &db);
LevelDB is a B+tree implementation, so it stores key/value pairs. It assumes that all keys are unique.
In the C++ namespace leveldb, we find that DB class has three important methods:
We will be using Put and Get to write key/value pairs to the B+ tree index, and get them back.
The default ReadOptions and WriteOptions can be constructed by:
- leveldb::WriteOptions()
- leveldb::ReadOptions()
Put(...) models both the key and value as leveldb::Slice. Internally, it's just a byte array, but equipped with many very useful conversion mehods.
We can construct a slice (key or value) from a string literal:
leveldb::Slice key = "CSC443/WINTER";
We can also construct a slice from a C++ string:
leveldb::Slice value = std::string("This is a course about database implementation.")
We can even construct a slice from a pointer.
char *ptr = (char *)malloc(100); leveldb::Slice key(ptr, 100);
Get assumes std::string
It's important to note that when reading key/value pairs from the B+ tree, leveldb returns the value as a C++ string.
LevelDB assumes that all keys must be unique. Sometimes, we may wish to index records on attributes which are not keys of the table. A common trick suggested by the author of LevelDB is to append a running counter to the key.
- long unique_counter = 0;
- char key[10];
- char unique_key[10 + sizeof(long)];
- while(...) {
- ...
- unique_counter ++;
- get_key(record, &key);
- memcpy(unique_key, key, 10);
- memcpy(unique_key + 10, &unique_counter, sizeof(long));
- db->Put(Slice(unique_key, sizeof(unique_key)), value);
- }
We define acomposite key consisting of the key and a longcounter.
LevelDB supports efficient iteration in ascending order of keys.
leveldb::Iterator is a class that supports very useful navigation in the B+ tree.
The following code constructs (and frees) a new iterator:
- leveldb::Iterator* it = db->NewIterator(leveldb::ReadOptions());
- ...
- delete it;
The following code prints out all the key/value pairs in the B+ tree:
- leveldb::Iterator* it = db->NewIterator(leveldb::ReadOptions());
- for (it->SeekToFirst(); it->Valid(); it->Next()) {
- leveldb::Slice key = it->key();
- leveldb::Slice value = it->value();
- std::string key_str = key.ToString();
- std::string val_str = value.ToString();
- cout << key_str << ": " << val_str << endl;
- }
- assert(it->status().ok()); // Check for any errors found during the scan
- delete it;
To iterate over a finite range we can do this:
- leveldb::Slice start = ...;
- leveldb::Slice end = ...;
- leveldb::Options options;
- leveldb::Iterator* it = db->NewIterator(leveldb::ReadOptions());
- for (it->Seek(start); it->Valid(); it->Next()) {
- leveldb::Slice key = it->key();
- leveldb::Slice value = it->value();
- if(options.comparator->compare(key, end) > 0) {
- // key > end
- break;
- } else {
- /**
- * process (key, value)
- */
- }
- }
- delete it;