debian-graph
debian-graph copied to clipboard
Representing Debian UDD in a graph database
Debian UDD into Graph Database
The scripts in this repository pull (some) data from the Ultimate Debian Database and convert them to a graph database, in particular Neo4j.
A detailed description of the process is available either on my blog (Part 1, Part 2, Part 3), or debian-package-neo4j.md.
The scripts here are:
-
pull-udd.pl
queries the UDD and downloads the two tables for packages and sources. Needs DBI::PG Perl module. This scripts needs quite some time, as the server is not fast. Please be patient. -
generate-graph
is a Perl script that reads the two csv files generated frompull-udd.pl
and generates csv files ready to be imported into Neo4j -
sort-uniq.sh
ensures that duplicate lines are removed from the csvs -
build-db
assembles the proper command line forneo4j-import
-
build-all
glues everything together
After the scripts has run, the files generated are in debian-neo4j
and can be used as Neo4j database files.
Comments and improvements are always welcome.
Copyright
Copyright 2017-2018 Norbert Preining
License: GPL3+