Skip to content

This is a repository of code and data for exploring University of Wisconsin GitHub and GitLab repositories.

License

Notifications You must be signed in to change notification settings

UW-Madison-DSI/UW-Open-Source-Exploration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

University of Wisconsin Logo Open Source Logo

UW Open Source Exploration

This project is means of exploring open source activity associated with university of Wisconsin-Madison.

The project contains scripts for downloading information from GitHub and GitLab about open source projects and people and storing this information in a database for further analysis and includes a REST API for retreiving this information.

Data

Results data in a variety of formats are contained in the following directories:

data/
│
├── github/
│   ├── csv/
│   ├── json/
│   └── sql/
│ 
└── gitlab/
    ├── csv/
    ├── json/
    └── sql/

Below are some sample findings from GitHub about respositories related to "Wisconsin":

Description Count Percent
All repositories 3028 100%
Repositories that are not part of the Wisconsin breast Cancer dataset or CS classes 1748 58%

Repositories containing essential components

Component Count Percent
Description 2433 80%
README 2185 72%
README Images 256 8%
Homepage 151 5%
License 436 14%

Repositories containing multiple essential components

Description README README Images License Homepage Count Percent
1111 37%
149 5%
84 3%
59 1.5%
31 1%
17 0.5%

Requirements

To run the code in this project, you will need the following:

  • A SQL Database - MySQL, MariaDB etc.
  • Python3 or PHP

Installation

Before running the scripts in this project, you will need to create a database to store the data as described here.

Configuration

Before running the scripts in this project, you will need to configure your code to use GitHub / GitLab access tokens as described here.

Running

Once you have created a database and have configured the code with your access tokens, you are ready to run the data collection scripts as described here.

License

Distributed under the permissive MIT license. See the license for more information.

Acknowledgements

This software was created by the Data Science Institute at the University of Wisconsin-Madison

About

This is a repository of code and data for exploring University of Wisconsin GitHub and GitLab repositories.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published