Interface to `<: Supervised` MLJ models #69

pat-alt · 2022-10-13T08:34:22Z

This is an issue reserved for the TU Delft Student Software Project '23

MLJ is a popular machine learning framework for Julia. It ships with a large suite of common machine learning models, both supervised and unsupervised. It seems natural to interface this package to MLJ, although currently differentiability is a major challenge: to be able to use any of our counterfactual generators to explain MLJ models, those models need to be differentiable with respect to features. Still, this is worth exploring.

I propose the following steps:

Implement basic interface to MLJ (essentially have an AbstractFittedModel for MLJ.Supervised)
From the MLJ model list, identify which ML models fulfil the differentiability criterium. Note that some models, like decision trees, may be differentiable after probability calibration. See below for a potential starting point. Start by focusing on pure Julia models, before dealing with non-native models (like sklearn).
Ideally, I think we would like a single MLJModel<:AbstractFittedModel class that can handle all (compatible) supervised MLJ models. To this end, we will need a mechanism to differentiate between compatible and incompatible models.
Thoroughly test and document your contributions.

This is a challenging task and it is not critical that you succeed at everything. But we would like to aim for the following minimum achievements:

Add the basic interface (point 1)
Document your process and findings regarding point 2
If a complete interface turns out to be too challenging, work on a proof-of-concept at least for one particular MLJ model, ideally Evotrees (points 3 and 4)

Previous attempts

I have tried this in the past, which might or might not be a good starting point:

At this point all of the counterfactual generators need gradient-access and currently leverage Zygote.jl for auto-diff. Not sure if all MLJ models can just be "auto-diffed" in that sense, but some early experiments with EvoTrees has shown that in principal gradient-based counterfactual generators should be applicable (see here).
That being said, Zygote.jl didn't work in this case and I had to rely on ForwardDiff (see here). The problem with trees is that the counterfactual loss function is not smooth and hence taking gradients just resulted in gradients with all elements equal to zero (at least I think the non-smoothness was the issue here). Would still be preferable to use Zygote if possible.
(Non-)Differentiability of models may be a more general issue.

The text was updated successfully, but these errors were encountered:

pat-alt · 2023-04-03T06:49:53Z

MLJFlux is probably the most obvious place to start for this (see related discussion here)

pat-alt · 2024-05-19T06:04:21Z

This is in principle now implemented (#450), but by default MLJ models are assumed to be non-differentiable (the MLJBase.predict call and other functions don't play nicely with Zygote)

pat-alt added enhancement New feature or request help wanted Extra attention is needed labels Oct 13, 2022

pat-alt added the difficult This is expected to be difficult. label Nov 29, 2022

pat-alt self-assigned this Nov 29, 2022

This was referenced Dec 21, 2022

Dealing with count data #83

Closed

For classification, let y be a categorical vector #84

Closed

This was referenced Mar 20, 2023

interface to MLJ.jl #131

Closed

🚀 Beyond Deep Learning #130

Closed

pat-alt added the students 🎯 label Mar 20, 2023

pat-alt mentioned this issue Mar 31, 2023

Plans for autodiff on fitted models? FluxML/MLJFlux.jl#220

Open

RaunoArike mentioned this issue Jun 9, 2023

Add counterfactual generator support for EvoTrees #225

Closed

5 tasks

pat-alt removed the students 🎯 label Nov 8, 2023

pat-alt mentioned this issue Apr 18, 2024

[Interface to MLJ] Implement a single class to handle all MLJ models #178

Closed

pat-alt closed this as completed May 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interface to `<: Supervised` MLJ models #69

Interface to `<: Supervised` MLJ models #69

pat-alt commented Oct 13, 2022 •

edited

Loading

pat-alt commented Apr 3, 2023

pat-alt commented May 19, 2024

Interface to <: Supervised MLJ models #69

Interface to <: Supervised MLJ models #69

Comments

pat-alt commented Oct 13, 2022 • edited Loading

Previous attempts

pat-alt commented Apr 3, 2023

pat-alt commented May 19, 2024

Interface to `<: Supervised` MLJ models #69

Interface to `<: Supervised` MLJ models #69

pat-alt commented Oct 13, 2022 •

edited

Loading