We will find several kinds of files. For instance, LIAON parts come in two formats: .npz for embeddings[1] and .parquet files for metadata[2]. We use HDF5 files[3] (.h5) extensively for datasets, projections, queries, and gold standards, and we also require result files to be HDF5 files. HDF5 files can contain tree-like organization and may include several kinds of data, working well among platforms.
| [2] | The pyarrow package provide support for parquet files in Python. Julia users can use the Parquet2.jl package. |