Popular 500 python compressed graph
A compact and highly-efficient representation of the graph dataset, suited for scale-up analysis on high-end machines with large amounts of memory. The graph is compressed in Boldi-Vigna representation, designed to be loaded by the WebGraph framework, specifically using our swh-graph library.
- Comments
-
This teaser contains a subset of the 443 repositories archived by Software Heritage as of 2024-08-23, among the 700 GitHub repositories tagged as being written in Python with the most stars.
- Dataset size
- 15 GB
- Export date
- Teaser of
- Compressed graph [2024-08-23]
- S3 URL
- s3://softwareheritage/graph/2024-08-23-popular-500-python/compressed/
- Deprecated
- False
Download the dataset
For Amazon S3 links, you'll need to install either awscli or swh.datasets.
aws s3 cp --recursive --no-sign-request s3://softwareheritage/graph/2024-08-23-popular-500-python/compressed/ 2024-08-23-popular-500-python-compressed
# ORswh datasets download-graph 2024-08-23-popular-500-python