Compressed graph
A compact and highly-efficient representation of the graph dataset, suited for scale-up analysis on high-end machines with large amounts of memory. The graph is compressed in Boldi-Vigna representation, designed to be loaded by the WebGraph framework, specifically using our swh-graph library.
- Dataset size
- Unknown
- Export date
- Teaser dataset
- GitLab 100k compressed graph
- GitLab all compressed graph
- S3 URL
- s3://softwareheritage/graph/2020-12-15/compressed/
- SWH Annex URL
- https://annex.softwareheritage.org/public/dataset/graph/2020-12-15/compressed/
- Deprecated
- False
Download the dataset
The HTTP links point to directories listing all available files. For Amazon S3 links, you'll need to install either awscli or swh.datasets.
aws s3 cp --recursive --no-sign-request s3://softwareheritage/graph/2020-12-15/compressed/ 2020-12-15-compressed
# ORswh datasets download-graph 2020-12-15
wget --recursive --no-parent --reject "index.html*" https://annex.softwareheritage.org/public/dataset/graph/2020-12-15/compressed/