GitLab 100k compressed graph
A compact and highly-efficient representation of the graph dataset, suited for scale-up analysis on high-end machines with large amounts of memory. The graph is compressed in Boldi-Vigna representation, designed to be loaded by the WebGraph framework, specifically using our swh-graph library.
- Comments
-
A teaser dataset containing the 100k most popular GitLab.com repositories
- Dataset size
- Unknown
- Export date
- Teaser of
- Compressed graph [2020-12-15]
- SWH Annex URL
- https://annex.softwareheritage.org/public/dataset/graph/2020-12-15-gitlab-100k/compressed/
- Deprecated
- False
Download the dataset
The HTTP links point to directories listing all available files.
wget --recursive --no-parent --reject "index.html*" https://annex.softwareheritage.org/public/dataset/graph/2020-12-15-gitlab-100k/compressed/