-
Notifications
You must be signed in to change notification settings - Fork 49
Command Line Tools
Roy Guo edited this page Apr 18, 2017
·
10 revisions
We have a couple of command line tools for you to experience our algorithms. With these convenient tools you don't need to write any code.
Terark Wiki Full Documentation
You can download the package from Terark Downloads:
- terark-fsa_all-Linux-x86_64-g++-VERSION-bmi2-0.tgz
- works on older machine
- terark-fsa_all-Linux-x86_64-g++-VERSION-bmi2-1.tgz
- only works on intel-haswell or better CPUs
After you download the package and unpack it, you will find these directories:
- root/bin
- Command line tools
- root/lib
- Libraries
- root/samples/bin
- Examples and benchmark tools root/samples/src
- Code examples
NOTE : We've named all the tools with a suffix of .exe
, but it's executable on Linux and MacOS.
Please add all the libraries from lib
dir into your library load path.
# For Linux
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:LIB_DIR
# For MacOS
export DYLD_LIBRARY_PATH=$LD_LIBRARY_PATH:LIB_DIR
If you have multiple gcc installed, please make sure the correct version's libraries are also in the library path.
- nlt_build.exe
- Build a Terark
Nested Succinct Trie
, TerarkDB's index is compressed by the same data structure. - Example:
- ./nlt_build.exe -o outputfile inputfile.txt
- Each line in the input file is a single key
- Build a Terark
- zbs_build.exe
- Terark global compression, this algorithm is used for value compression.
- Example:
- ./zbs_build.exe -o outputfile inputfile.txt
- Each line in the input file is a single value
- zbs_unzip.exe
- De-compress all data or retrieve a single record from
zbs_build.exe
, can be used for benchmark
- De-compress all data or retrieve a single record from
- fplcat.exe
- Pack multiple files together then use
zbs_build.exe
to compress them. When passing the packed file tozbs_build.exe
, you should add a-B
parameter.
- Pack multiple files together then use
- adfa_build.exe
- Create a
ADFA (Acyclic DFA)
from the input file, each line of the file is aKEY
. The generated result could be used for key matching (http://nark.cc/p/?p=172).
- Create a
- ac_build.exe
- Create an AC automata from input patterns, the created AC automata can be loaded from Terark's core API
- regex_build.exe
- Create multiple regex matching automata from the input regex collection
- pinyin_build.exe
- Create a DFA for PinYin correction.