Dataset of the complete texts of free/open source software (FOSS) license variants.
Referencing Software Heritage
If you use any of the datasets indexed on this website for research purposes, please acknowledge Software Heritage as recommended in the publications page, which means doing the next two things:
Add a footnote on the title page of your paper, formatted as: “This work was made possible by Software Heritage, the universal source code archive: https://www.softwareheritage.org”
If you use this dataset for research purposes, please cite one of the following papers:
Stefano Zacchiroli. A Large-scale Dataset of (Open Source) License Text Variants. In proceedings of the MSR 2022. Co-located with ICSE 2022. (Paper, BibTeX);
or Jesús M. González-Barahona, Sergio Raúl Montes León, Gregorio Robles, Stefano Zacchiroli. The Software Heritage License Dataset (2022 edition). In Empirical Software Engineering 28(6): 147 (2023). (Paper, BibTeX).
Download
The HTTP links point to directories listing all available files.
If you use this dataset for research purposes, please cite one of the following papers:
Stefano Zacchiroli. A Large-scale Dataset of (Open Source) License Text Variants. In proceedings of the MSR 2022. Co-located with ICSE 2022. (Paper, BibTeX);
or Jesús M. González-Barahona, Sergio Raúl Montes León, Gregorio Robles, Stefano Zacchiroli. The Software Heritage License Dataset (2022 edition). In Empirical Software Engineering 28(6): 147 (2023). (Paper, BibTeX).
Download
The HTTP links point to directories listing all available files.
If you use this dataset for research purposes, please cite one of the following papers:
Stefano Zacchiroli. A Large-scale Dataset of (Open Source) License Text Variants. In proceedings of the MSR 2022. Co-located with ICSE 2022. (Paper, BibTeX);
or Jesús M. González-Barahona, Sergio Raúl Montes León, Gregorio Robles, Stefano Zacchiroli. The Software Heritage License Dataset (2022 edition). In Empirical Software Engineering 28(6): 147 (2023). (Paper, BibTeX).
Download
The HTTP links point to directories listing all available files.
If you use this dataset for research purposes, please cite one of the following papers:
Stefano Zacchiroli. A Large-scale Dataset of (Open Source) License Text Variants. In proceedings of the MSR 2022. Co-located with ICSE 2022. (Paper, BibTeX);
or Jesús M. González-Barahona, Sergio Raúl Montes León, Gregorio Robles, Stefano Zacchiroli. The Software Heritage License Dataset (2022 edition). In Empirical Software Engineering 28(6): 147 (2023). (Paper, BibTeX).
Download
The HTTP links point to directories listing all available files.
If you use this dataset for research purposes, please cite one of the following papers:
Stefano Zacchiroli. A Large-scale Dataset of (Open Source) License Text Variants. In proceedings of the MSR 2022. Co-located with ICSE 2022. (Paper, BibTeX);
or Jesús M. González-Barahona, Sergio Raúl Montes León, Gregorio Robles, Stefano Zacchiroli. The Software Heritage License Dataset (2022 edition). In Empirical Software Engineering 28(6): 147 (2023). (Paper, BibTeX).
Download
The HTTP links point to directories listing all available files.