Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

toncho11 · 2025-11-10T10:30:43Z

A re-written _inc_exc_datasets() that fixes issues and provides much needed checks. Before it was possible not to recognize correctly if the input is string or object and thus process the input incorrectly. Fixes: #654. It avoids some confusion induced by the old version.
Might help with: #659

Below is the code I used for testing:

import os
from moabb import set_download_dir
from moabb import benchmark, set_log_level
from sklearn.pipeline import make_pipeline
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis as LDA
from sklearn.linear_model import LogisticRegression
from pyriemann.estimation import Covariances
from pyriemann.tangentspace import TangentSpace
from moabb.pipelines.classification import SSVEP_CCA
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
from sklearn.pipeline import make_pipeline
from moabb.pipelines import TRCA
from pyriemann.estimation import Covariances
from pyriemann.classification import MDM
from sklearn.pipeline import make_pipeline

# P300 databases
from moabb.datasets import (
    BI2013a,
    BNCI2014_008,
    BNCI2014_009,
    BNCI2015_003,
    EPFLP300,
    Lee2019_ERP,
    BI2014a,
    BI2014b,
    BI2015a,
    BI2015b,
)

# Motor imagery databases
from moabb.datasets import (
    BNCI2014_001,
    Zhou2016,
    BNCI2015_001,
    BNCI2014_002,
    BNCI2014_004,
    #BNCI2015_004, #not tested
    AlexMI,
    Weibo2014,
    Cho2017,
    GrosseWentrup2009,
    PhysionetMI,
    Shin2017A,
    Lee2019_MI, #new
    Schirrmeister2017 #new
)

pipelines = [
    # {
    #     "name": "SSVEP_CCA",
    #     "pipeline": SSVEP_CCA(
    #         n_harmonics=3,
    #         interval=[1, 3],
    #         freqs={"13": 0, "17": 1},
    #     ),
    #     "paradigms": ["SSVEP"],
    # },
    {
        "name": "MDM",
        "pipeline": make_pipeline(
            Covariances(estimator="oas"),   # Estimate covariance matrices
            MDM(metric="riemann")           # Riemannian Minimum Distance to Mean classifier
        ),
        "paradigms": ["LeftRightImagery", "MotorImagery", "P300"],
        #"paradigms": ["P300"],  
        #"paradigms": ["SSVEP"],
    }
]

results = benchmark(
    pipelines=pipelines,
    evaluations=["WithinSession"],
    #include_datasets=["Kalunga2016", "Nakanishi2015"], #should fail
    #include_datasets=["Nakanishi2015"], #should fail
    #include_datasets=["fsdfsdfsdfs"],
    # exclude_datasets=[EPFLP300()], #should be OK with 2 warnings
    # include_datasets=[Lee2019_ERP(), BI2015b()], #should be OK, with 2 warnings 
    # exclude_datasets=["Stieger2021","fsdfsdfs"], # gives warning  
    #exclude_datasets=["Stieger2021","Liu2024"], #must be OK
    #include_datasets=["BNCI2014-001"], # #must be OK    
    #exclude_datasets=[Zhou2016(), Weibo2014()], #must be OK
    #include_datasets=[Zhou2016(), Weibo2014()], #must be OK
    #include_datasets=[PhysionetMI(), Shin2017A(), Lee2019_MI()], #must be OK
    #include_datasets=[PhysionetMI(), Shin2017A(), "BNCI2014-001"], #should fail
    #exclude_datasets=[PhysionetMI(), Shin2017A(), "BNCI2014-001"], #should fail
    #include_datasets=[Lee2019_ERP(), "fsdfsdfdwwww"], #should fail
    # include_datasets=["fsdfsdfdwwww","dasdasd"], #should fail
    #exclude_datasets = None, include_datasets = None, # should be OK
    
    #include_datasets=["Kalunga2016"],
    results="./results/",
    overwrite=True,
    plot=True,
    n_jobs=1, #otherwise memory is not enough, so 4 is a good value
    output="./benchmark/",
)

print("Results:")
print(results.to_string())

print("Averaging the session performance:")
print(results.groupby("pipeline").mean("score")[["score", "time"]])

# save results
save_path = os.path.join(
    os.path.dirname(os.path.realpath(__file__)), "results_dataframe_test_SSVEP.csv"
)
results.to_csv(save_path, index=True)

print(results.groupby(["dataset","pipeline"]).mean("score")[["score", "time"]].to_string())

…checks.

Adds function filter_paradigms(). It provides better error messages.

…ithub.com/toncho11/moabb into improve_fix_inc_exc_datasets_in_benchmark

bruAristimunha · 2025-11-11T13:30:19Z

hey @toncho11,

can you fix the tests please:

FAILED moabb/tests/test_benchmark.py::TestBenchmark::test_benchmark_strdataset - ValueError: Invalid dataset codes in include_datasets: ['FakeDataset-p300-10-2--60-60--120-120--target-nontarget--c3-cz-c4', 'FakeDataset-ssvep-10-2--60-60--120-120--13-15--c3-cz-c4', 'FakeDataset-cvep-10-2--60-60--120-120--10-00--c3-cz-c4']
FAILED moabb/tests/test_benchmark.py::TestBenchmark::test_benchmark_objdataset - ValueError: Some datasets in include_datasets are not part of available datasets for the paradigms you requested in benchmark(): ['FakeDataset-p300-10-2--60-60--120-120--target-nontarget--c3-cz-c4', 'FakeDataset-ssvep-10-2--60-60--120-120--13-15--c3-cz-c4', 'FakeDataset-cvep-10-2--60-60--120-120--10-00--c3-cz-c4']
FAILED moabb/tests/test_benchmark.py::TestBenchmark::test_include_exclude - ValueError: Cannot specify both include_datasets and exclude_datasets.
===== 3 failed, 301 passed, 90 skipped, 207 war

toncho11 · 2025-11-13T10:33:20Z

I need some more time on the code.

Added one FakeDataset for P300 testing.

gcattan

Thanks @toncho11
I think the issue is that you cannot rely on real datasets in Ci/Cd.

Your code remains unchanged. Let me know if you are ok with the edits.

bruAristimunha · 2025-11-15T16:05:31Z

Exactly, we don't have a compute instance to send jobs like MNE and Scikit-learn, with some Azure cluster. It's still too expensive for us.

toncho11 · 2025-11-18T17:29:06Z

Ready for merge. I added a few adjustments and improvements.
With this code there many checks, so many future problems should be avoided.
#659 with this PR and the latest code of MOABB seems fixed.

gcattan · 2025-11-18T17:53:50Z

Thank you for this nice contribution @toncho11 !

toncho11 and others added 8 commits November 10, 2025 10:33

A re-written _inc_exc_datasets() that fixes issues and provides more …

33c1ff5

…checks.

Add another improvement - handling of the paradigms in moabb.

e4d7fc6

Adds function filter_paradigms(). It provides better error messages.

small typo

109ad92

A number of fixes and improvements.

0ae75e8

[pre-commit.ci] auto fixes from pre-commit.com hooks

3df21c6

small docstring update

fdf8ea7

Merge branch 'improve_fix_inc_exc_datasets_in_benchmark' of https://g…

ce0729e

…ithub.com/toncho11/moabb into improve_fix_inc_exc_datasets_in_benchmark

small comment clarification

9a4853c

toncho11 and others added 10 commits November 12, 2025 15:52

Improved tests

9193304

[pre-commit.ci] auto fixes from pre-commit.com hooks

1ca36c7

optuna test enabled

7bc64bf

[pre-commit.ci] auto fixes from pre-commit.com hooks

6652208

There are 8 tests. This has been a lot of effort.

d8cf34b

Merge branch 'develop' into improve_fix_inc_exc_datasets_in_benchmark

b7215ab

updated whats_new file

8733fe4

Improved documentation

58a5da3

[pre-commit.ci] auto fixes from pre-commit.com hooks

210be29

comments updated

c7c05b6

toncho11 marked this pull request as draft November 13, 2025 10:33

toncho11 and others added 9 commits November 13, 2025 14:52

Multiple paradigms in the parameter "paradigms" are handled better now.

abd52c1

Added one FakeDataset for P300 testing.

[pre-commit.ci] auto fixes from pre-commit.com hooks

786d6d9

small improvements

ebd3916

All fake datasets are set to 2 subjects to reduce execution time.

ba08834

[pre-commit.ci] auto fixes from pre-commit.com hooks

2c7bd0a

Update LogVar.yml

3b15ef2

Update SSVEP_CCA.yml

baa51c3

Update CSP.yml

af28fba

Update test_benchmark.py

42a06d0

gcattan and others added 13 commits November 15, 2025 10:54

Update fake.py

b0cee37

[pre-commit.ci] auto fixes from pre-commit.com hooks

da811f8

Update __init__.py

11fb14b

[pre-commit.ci] auto fixes from pre-commit.com hooks

0696f6b

Update test_benchmark.py

a52b834

Update test_benchmark.py

2f0be0c

Update test_benchmark.py

f6e5c91

Update test_benchmark.py

33921ab

Update test_benchmark.py

ba033e6

[pre-commit.ci] auto fixes from pre-commit.com hooks

ec6820a

Update fake.py

105915e

Update __init__.py

caf2589

Update test_benchmark.py

5db6c8b

gcattan approved these changes Nov 15, 2025

View reviewed changes

toncho11 and others added 2 commits November 18, 2025 17:25

More testing. Improved logic of the validations.

3ff5687

[pre-commit.ci] auto fixes from pre-commit.com hooks

8f2b075

toncho11 marked this pull request as ready for review November 18, 2025 16:27

gcattan merged commit 9cabb89 into NeuroTechX:develop Nov 18, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

toncho11 commented Nov 10, 2025 •

edited

Loading

Uh oh!

bruAristimunha commented Nov 11, 2025

Uh oh!

toncho11 commented Nov 13, 2025

Uh oh!

gcattan left a comment

Uh oh!

bruAristimunha commented Nov 15, 2025

Uh oh!

toncho11 commented Nov 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

gcattan commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

Conversation

toncho11 commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bruAristimunha commented Nov 11, 2025

Uh oh!

toncho11 commented Nov 13, 2025

Uh oh!

gcattan left a comment

Choose a reason for hiding this comment

Uh oh!

bruAristimunha commented Nov 15, 2025

Uh oh!

toncho11 commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

gcattan commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

toncho11 commented Nov 10, 2025 •

edited

Loading

toncho11 commented Nov 18, 2025 •

edited

Loading