Results of the SISAP 2025 Indexing Challenge

Participating Teams

TeamMembersTaskPaper
BrownCICESEFoster, Magdaleno-Gatica, Kimia1, 2Refinement-Based Graph Construction for Search in Low-Memory Systems
cm-lllLou, Ma, Luo, Ruan, Wu, Lu, Mao1Memory-Constrained DiskANN
Crusty CodersDearle, Connor, Claydon, McKeogh1, 2Fast, Compact NN-Table Build Using Equi-Voronoi Polytopes
DCC-UChileBustos, Chen2Solution for Task 2 Using Root Join
hforestImamura1, 2
JLapeyraLapeyra1, 2
TeamDoubleFilteringHiguchi, Imamura, Shinohara, Hiratta, Kuboyama1Double Filtering Using Short and Long Quantized Projections

Four teams participated in both tasks, two teams focused solely on Task 1, and one team addressed only Task 2. Two baselines were provided: BL-SearchGraph (a graph-based index using SimilaritySearch.jl) and BL-Bruteforce (parallel exhaustive scan), both using PCA projection and 8-bit scalar quantization.

Task 1: Resource-limited Indexing

Task 1 required memory-efficient approximate nearest neighbor search on the PUBMED23 dataset (23.9 million 384-dimensional vectors, 35 GB on disk). Solutions operated under 16 GB RAM, 8 virtual CPUs, and a 12-hour wall-clock limit. The ranking criterion was the highest search throughput (queries/second) among configurations achieving at least 0.7 average recall for k=30k=30 nearest neighbors using out-of-distribution queries.

Results for Task 1

TeamRankRecallBuild time (s)Query time (s)Throughput (q/s)Container time (s)Median rmem (GB)Max rmem (GB)
BL-SearchGraph10.73224,3200.6016,7694,6679.613.2
BrownCICESE20.78849,5631.446,9289,64614.214.3
TeamDoubleFiltering30.72129.251,0813249.69.6
hforest40.70532,24315.706372,59412.216.0
cm-lll50.83476,41934.612896,45711.611.6
Crusty Coders60.80482,980178.00563,16114.514.5
JLapeyra71.0000870.32118737.915.3
BL-Bruteforce80.855901,265.1581,5885.25.3

The ranking was led by the BL-SearchGraph baseline. The top-performing participant was BrownCICESE (rank 2) with a query time of 1.44 seconds. TeamDoubleFiltering stands out for overall efficiency: its total pipeline time of 324 seconds is nearly five times faster than the brute-force baseline, and its query phase is over 100 times faster. All participating teams surpassed the BL-Bruteforce baseline.

The following figure shows the speed–recall trade-offs for teams with multiple hyperparameter configurations.

Performance for Task 1

Task 2: kk-NN Graph Construction

Task 2 required memory-efficient approximation of the kk-nearest neighbor graph for k=15k=15 on the GOOAQ dataset (3 million 384-dimensional vectors, 7.4 GB on disk). The same resource limits applied (16 GB RAM, 8 vCPUs, 12 hours). Solutions were ranked by total container time for configurations achieving at least 0.8 average recall.

Results for Task 2

TeamRankRecallAll-knn time (s)Container time (s)Median rmem (GB)Max rmem (GB)
hforest10.8049991057.17.4
BL-SearchGraph20.82571121651.92.3
BrownCICESE30.81984464506.26.2
Crusty Coders40.80125425483.13.1
BL-Bruteforce (160d)0.5210*9,3789,4101.21.6
JLapeyra0.994461,43061,433*5.414.8
DCC-UChile0.5432*113,203113,213*8.216.0

* Did not meet all constraints (time limit and/or minimum recall).

The hforest team achieved the first position, completing the task in just 105 seconds. BL-SearchGraph was second at 165 seconds, followed by BrownCICESE (450 s) and Crusty Coders (548 s). JLapeyra achieved near-perfect recall but exceeded the 12-hour time limit. DCC-UChile did not meet either the time or recall constraints.

Performance for Task 2

Winner Ceremony

The diverse set of approaches were presented during a special session at SISAP 2025.

Honorable Mentions

The challenge committee awarded two honorable mentions for outstanding contributions:

BrownCICESE (Tasks 1 & 2), represented at the conference by Cole Foster, received an honorable mention for their competitive performance across both tasks. Their scalar-quantization approach with HSP-based graph indexing achieved the highest throughput among all participants in Task 1 and placed third in Task 2, demonstrating a versatile and effective solution.

Cole Foster representing team BrownCICESE at the award ceremony

Crusty Coders (Task 2), represented at the conference by Alan Dearle, Richard Connor, and Ben Claydon, received an honorable mention for their novel ultra-quantization technique. Their 2-bit compression scheme with a modified NN-Descent algorithm achieved a compact and efficient kk-NN graph construction, placing fourth in Task 2 with a total time of 548 seconds while using only 3.1 GB of memory.

Alan Dearle, Richard Connor, and Ben Claydon representing team Crusty Coders at the award ceremony

Reference

For the full technical details, see the Overview of the SISAP 2025 Indexing Challenge by E. S. Tellez, E. Chavez, M. Aumüller, and V. Mic, published in the SISAP 2025 proceedings.

CC BY-SA 4.0 sisap challenge committee. Last modified: March 03, 2026. Website built with Franklin.jl and the Julia programming language.