Giskard CI/CD runner (WIP)

Overview

The idea is to have a common CI/CD core that can interface with different input sources (loaders) and output destinations (reporters).

The core is responsible for running the tests and generating a report.

The loaders are responsible for loading the model and dataset, wrapped as Giskard objects, from a given source (for example the HuggingFace hub, a Github repository, etc.).

The reporters are responsible for sending the report to the appropriate destination (e.g. a comment to a Github PR, a HuggingFace discussion, etc.).

Tasks

Task could be data objects containing all the information needed to run a CI/CD pipeline. For example:

{
    "loader_id": "huggingface",
    "model": "distilbert-base-uncased",
    "dataset": "sst2",
    "loader_args": {
        "dataset_split": "validation",
    },
    "reporter_id": "huggingface_discussion",
    "reporter_args": {
        "discussion_id": 1234,
    }
}

or

{
    "loader_id": "github",
    "model": "my.package::load_model",
    "dataset": "my.package::load_test_dataset",
    "loader_args": {
        "repository": "My-Organization/my_project",
        "branch": "dev-test2",
    },
    "reporter_id": "github_pr",
    "reported_args": {
        "repository": "My-Organization/my_project",
        "pr_id": 1234,
    }
}

These tasks may be generated by a watcher (e.g. a Github action, a HuggingFace webhook, etc.) and put in a queue. The CI/CD runner will then pick them up and run the pipeline.

Otherwise, a single task can be created to run a single-shot Github action, without queueing.

CI/CD Core

In pseudocode, the CI/CD core could look like this:

task = get_task_from_queue_or_envirnoment()

loader = get_loader(task.loader_id)
gsk_model, gsk_dataset = loader.load_model_dataset(
    task.model,
    task.dataset,
    **task.loader_args,
)

runner = PipelineRunner()
report = runner.run(gsk_model, gsk_dataset)

reporter = get_reporter(task.reporter_id)
reporter.push_report(report, **task.reporter_args)

Prototype

Current implementation has two loaders:

The github loader which can be run from the command line (after running python train.py in examples/github):

$ python cli.py --loader github --model examples/github/artifacts/model --dataset examples/github/artifacts/dataset

The huggingface loader which can be run from the command line:

$ python cli.py --loader huggingface --model distilbert-base-uncased-finetuned-sst-2-english --dataset_split validation --output demo_report.html

Automatically post to discussion area for a given repo

$ python cli.py --loader huggingface --model distilbert-base-uncased-finetuned-sst-2-english --dataset_split validation --output_format markdown --output_portal huggingface --discussion_repo [REPO_ID] --hf_token [HF_TOKEN]

$   python cli.py --loader huggingface --model distilbert-base-uncased-finetuned-sst-2-english --dataset_split validation --scan_config [Path to scan_config.yaml] --hf_token [Huggingface Token]

Manually input label and feature mapping

Label Mapping: map the dataset labels to model label ids. Use the labels2id or id2labels in model card to help you if needed. This should be idx to key. Example:
```
--label_mapping '{"0":"negative","1":"positive"}'
```
Feature Mapping: map the feature labels directly from key to key.
```
--feature-mapping '{"text": "sentence"}'
```

This will launch a pipeline that will load the model and dataset from the HuggingFace hub, run the scan and generate a report in HTML format (for now).

Name	Name	Last commit message	Last commit date
Latest commit andreybavt Merge pull request #56 from Giskard-AI/feature/gsk-3373-fix-empty-bas… Apr 19, 2024 de00d67 · Apr 19, 2024 History 196 Commits
.github/workflows	.github/workflows	Added credentials in order for pip to access cicd private repo	Sep 8, 2023
examples/github	examples/github	cleanup	Sep 13, 2023
giskard_cicd	giskard_cicd	Merge pull request #56 from Giskard-AI/feature/gsk-3373-fix-empty-bas…	Apr 19, 2024
.gitignore	.gitignore	Add github Python gitignore template	Dec 14, 2023
.models_and_datasets_to_be_skipped.csv	.models_and_datasets_to_be_skipped.csv	added functionality to skip models already scanned or errored	Sep 13, 2023
.pre-commit-config.yaml	.pre-commit-config.yaml	Add precommit hook	Dec 11, 2023
LICENSE	LICENSE	Create LICENSE	Dec 4, 2023
cli.py	cli.py	Re-organize project structure to export cli	Jan 2, 2024
pyproject.toml	pyproject.toml	[GSK-2665] Add possibility to view and download report artifacts (#50 )	Mar 1, 2024
readme.md	readme.md	update readme with correct mapping examples	Feb 19, 2024
retriever.py	retriever.py	enhanced retriever.py	Sep 13, 2023
scan_config_template.yaml	scan_config_template.yaml	added possibility to pass scan_config as yaml	Sep 13, 2023
scan_retrieved.py	scan_retrieved.py	updated error logging	Sep 13, 2023
setup.cfg	setup.cfg	poc of gh loader	Aug 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Giskard CI/CD runner (WIP)

Overview

Tasks

CI/CD Core

Prototype

About

Releases

Packages

Contributors 7

Languages

License

Giskard-AI/cicd

Folders and files

Latest commit

History

Repository files navigation

Giskard CI/CD runner (WIP)

Overview

Tasks

CI/CD Core

Prototype

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Languages

Packages