• John Zhang's avatar
    luindex: new enwiki workload · 644553fa
    John Zhang authored
    - add `--linedoc` and `--dirwalk` option to specify doc-per-line
      format and directory walk;
    - add uncompressed Wikipedia archive used for Lucene benchmark;
    - modified benchmark harness;
    - currently referencing the Wikipedia data relative to scratch,
      this needs to be changed later with external data feature.
    - download and prepare data at build-time
enwiki.txt.MD5 33 Bytes