Skip to content

Commit

Permalink
Merge branch 'master' into snowflake-indexing
Browse files Browse the repository at this point in the history
  • Loading branch information
lintool committed Dec 7, 2024
2 parents 710968d + f9f73c2 commit 4884f5d
Show file tree
Hide file tree
Showing 777 changed files with 20,063 additions and 1,290 deletions.
113 changes: 79 additions & 34 deletions README.md

Large diffs are not rendered by default.

3 changes: 3 additions & 0 deletions docs/experiments-msmarco-passage.md
Original file line number Diff line number Diff line change
Expand Up @@ -538,3 +538,6 @@ The BM25 run with default parameters `k1=0.9`, `b=0.4` roughly corresponds to th
+ Results reproduced by [@b8zhong](https://github.com/b8zhong) on 2024-11-23 (commit [`c619dc8`](https://github.com/castorini/anserini/commit/c619dc8d9ab28298251964053a927906e9957f51))
+ Results reproduced by [@ShreyasP20](https://github.com/ShreyasP20) on 2024-11-24 (commit [`c619dc8`](https://github.com/castorini/anserini/commit/c619dc8d9ab28298251964053a927906e9957f51))
+ Results reproduced by [@Divyajyoti02](https://github.com/Divyajyoti02) on 2024-11-24 (commit [`a1bcf88`](https://github.com/castorini/anserini/commit/a1bcf8853062da9f73915c873968a4e998d4e904))
+ Results reproduced by [@nihalmenon](https://github.com/nihalmenon) on 2024-11-29 (commit [`aa91e2db`](https://github.com/castorini/anserini/commit/aa91e2db224c752cc6a947bdf66bdbf9ce90af25))
+ Results reproduced by [@sherloc512](https://github.com/sherloc512) on 2024-12-04 (commit [`9e55b1c`](https://github.com/castorini/anserini/commit/9e55b1c97fced46530dac1f78975d19635ffaf7a))
+ Results reproduced by [@zdann15](https://github.com/zdann15) on 2024-12-04 (commit [`9d311b4`](https://github.com/castorini/anserini/commit/9d311b4409a9ff3d79b01910178eaec3931f0abe))
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-arguana.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-arguana.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-arguana.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-arguana.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-arguana.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-arguana.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-arguana.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-bioasq.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-bioasq.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-bioasq.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-bioasq.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-bioasq.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-bioasq.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-bioasq.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-bioasq.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-climate-fever.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-climate-fever.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-climate-fever.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-climate-fever.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-climate-fever.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-climate-fever.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-climate-fever.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-climate-fever.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-gaming.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-gaming.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-gaming.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,8 +43,6 @@ bin/run.sh io.anserini.index.IndexHnswDenseVectors \
```

The path `/path/to/beir-v1.0.0-cqadupstack-gaming.bge-base-en-v1.5/` should point to the corpus downloaded above.
Note that here we are explicitly using Lucene's `NoMergePolicy` merge policy, which suppresses any merging of index segments.
This is because merging index segments is a costly operation and not worthwhile given our query set.

## Retrieval

Expand Down
Loading

0 comments on commit 4884f5d

Please sign in to comment.