Merge YAML files describing software products harvested from various open data sources. Generate an open database of software (in YAML format).
Each set of YAML file is stored in a git repository and updated by a dedicated script. For instance the nuit-debout-yaml YAML files are generated by the nuit-debout-to-yaml script. The semantic of the YAML is specific to each source and no attempt is made to unify them.
Step 1 and step 2 must be applied both before commiting, in order to preserve history.
git clone https://git.framasoft.org/codegouv/merge-open-software-base-yaml.git
cd merge-open-software-base-yaml
pip install -r requirements.txt
Use
--user
option if relevant, or work in a virtual env.
./merge.py all ../ merged-yaml/
./generate_canonical.py merged-yaml/ ../open-software-base-yaml/
./canonical_yaml_to_csv.py ../open-software-base-yaml/ ./
The generated database is the Open Sofware Base (in YAML format).