Create a base for managing groups of features. #357

lionelkusch · 2025-08-27T16:19:36Z

I created an Abstract class for the management of the group of features.

I update the name of groups by features_groups.

codecov · 2025-08-27T18:05:40Z

Codecov Report

❌ Patch coverage is 99.00000% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 97.95%. Comparing base (5292877) to head (0463a83).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
src/hidimstat/base_variable_importance.py	98.07%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #357      +/-   ##
==========================================
- Coverage   98.00%   97.95%   -0.05%     
==========================================
  Files          22       22              
  Lines        1200     1223      +23     
==========================================
+ Hits         1176     1198      +22     
- Misses         24       25       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

bthirion

The PR is mostly OK, thx.

examples/plot_importance_classification_iris.py

src/hidimstat/base_perturbation.py

src/hidimstat/base_variable_importance.py

jpaillard

I have a high-level comment, related to #377
Groups could also be dealt with in a pipeline fashion.

It would give a pipeline GroupFeatures --> FeatureImportance, where GroupFeatures could either be pre-defined or computed in the fit with clustering. That would also solve #377 since the groups could be defined at the instantiation of any FeatureImportance method.

This approach would have similarities with the ColumnTransformer in sklearn.
Let me know what you think.

bthirion · 2025-09-09T07:45:06Z

Sorry, I'm missing what benefit you expect from it.
My main fear is that we are over-engineering the FeatureGroup object, making it hard to understand. But I may be off. I think that I'd like what you want to achieve in an example: what would it simplify to do it ?

lionelkusch · 2025-09-09T14:17:08Z

I have a high-level comment, related to #377 Groups could also be dealt with in a pipeline fashion.

It would give a pipeline GroupFeatures --> FeatureImportance, where GroupFeatures could either be pre-defined or computed in the fit with clustering. That would also solve #377 since the groups could be defined at the instantiation of any FeatureImportance method.

This approach would have similarities with the ColumnTransformer in sklearn. Let me know what you think.

I don't see what you mean.
The question is if the group is defined at the instantiation of the class or when the method fit is called.
This has a different impact on the implementation.
However, I am afraid but it won't simplify the complexities.

lionelkusch · 2025-09-09T14:17:58Z

The addition of groups is equivalent to add a new dimension of the method, this is normal that the complexity will increase due to this.
Nevertheless, the groups are important for Perturbation methods. I just decouple the BasePerturbation and GroupVariableImportance because I think that there will be in the future a desire to have all the possible method using groups.
However, I don't know what is the best moment for including this functionality in the library.

bthirion · 2025-09-09T20:30:43Z

We should not add a feature that is not motivated by a use case. We want to add a class to handle groups. Would it make any example significantly simpler ? What is the actual benefit ?

lionelkusch · 2025-09-10T07:44:21Z

We should not add a feature that is not motivated by a use case. We want to add a class to handle groups. Would it make any example significantly simpler ? What is the actual benefit ?

The group is used for LOCI (see PR #358) which is not inherited from BasePerturbation because BasePerturbation is the logic of marginal importance and feature importance is different.
The functionality of the group is the same for LOCI, PFI, LOCO and CFI. I don't see arguments for having the functionality of groups only for the method of BasePerturbation. From what I see in the domain of explainable AI, grouping features together is very common. I think that it's quite an easy bet that users will want this functionality as a future requirement for the library. In this case, it's better to adapt the architecture now than the latter.

bthirion · 2025-09-10T19:53:13Z

I think that there is a full agreement regarding the need to support feature groups. The question is whether adding a GroupFeatureImportance class is the right way to proceed. Two alternatives come to my mind:

First, natively support features groups in the BaseVariableImportance methods. feature_groups would be an optional parameter, if None, it means that each feature is its own group.
Maybe use a VariablesGroup mixin (not sure it would work).
Can someone explain what is the advantage of the proposed implementation ?

lionelkusch · 2025-09-11T09:08:26Z

I think that there is a full agreement regarding the need to support feature groups. The question is whether adding a GroupFeatureImportance class is the right way to proceed. Two alternatives come to my mind:
* First, natively support features groups in the BaseVariableImportance methods. feature_groups would be an optional parameter, if None, it means that each feature is its own group.

* Maybe use a VariablesGroup mixin (not sure it would work).
  Can someone explain what is the advantage of the proposed implementation ?

What do you mean by VariablesGroup mixing?

If I am correct, this was the spirit when I was coding this class.

For native support, I am a bit against because this means that most of the methods should support feature groups, which is not the case, actually.

bthirion · 2025-09-11T19:41:58Z

Mixin, not Mixing.
https://stackoverflow.com/questions/533631/what-is-a-mixin-and-why-is-it-useful

lionelkusch · 2025-10-02T14:49:27Z

@jpaillard @bthirion Last review?

bthirion · 2025-10-02T21:33:00Z

Good with me.

jpaillard

I have a few minor comments, but otherwise it looks good.

src/hidimstat/base_variable_importance.py

src/hidimstat/leave_one_covariate_out.py

Co-authored-by: Joseph Paillard <joseph.paillard@inria.fr>

bthirion

LGTM, thx.

* first commit * add section structure * [doc quick] [skip tests] skip * missing .rst? [doc quick] [skip tests] * fix link [doc quick] [skip tests] * add CFI fiirst draft and TSI [doc quick] [skip tests] * missing space? [doc quick] [skip tests] * replace space * [doc quick] [skip tests] * add ref and note section [doc quick] [skip tests] * add code snippets * typo cfi [doc quick] [skip tests] * add total sobol index ref [doc quick] [skip tests] * add copy button [doc quick] [skip tests] * missing sphinx requirements [quick doc] [skip tests] * add copybutton config * [doc quick] [skip tests] * solve example test * clarify "sub-model" for classif and regression * trry add figure + update note * add intro * try fix image path * trigger CI * try not to scale * definition * try image * [skip tests] trigger CI * [tests skip] another one * trigger CI * back to figure * add inference section * add reff * skip tests * [skip tests] * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: lionel kusch <lionel.kusch@grenoble-inp.org> * [skip tests] format bullet * rephrase not * [skip tests] * [skip tests] * [skip tests] linkcheck generated ignore images * [skip tests] linkcheck generated ignore * review * trigger CI * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> * genetic example * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> * Update docs/src/model_agnostic_methods/total_sobol_index.rst Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: bthirion <bertrand.thirion@inria.fr> * Update docs/src/model_agnostic_methods/conditional_feature_importance.rst Co-authored-by: bthirion <bertrand.thirion@inria.fr> * Update docs/src/model_agnostic_methods/total_sobol_index.rst Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> --------- Co-authored-by: jpaillard <joseph.paillard@inria.fr> Co-authored-by: lionel kusch <lionel.kusch@grenoble-inp.org> Co-authored-by: Ángel Reyero Lobo <angelreyerolobo@gmail.com> Co-authored-by: bthirion <bertrand.thirion@inria.fr>

[skip tests]

lionelkusch added 3 commits August 27, 2025 16:52

separate the management of the group from base permutation

f3a5e0a

fix some error

508ce53

fix tests

08565c0

lionelkusch linked an issue Aug 27, 2025 that may be closed by this pull request

API contract for groups/feature_groups #337

Open

lionelkusch removed a link to an issue Aug 27, 2025

API contract for groups/feature_groups #337

Open

lionelkusch linked an issue Aug 27, 2025 that may be closed by this pull request

Find a name for "group" of feature #334

Closed

lionelkusch mentioned this pull request Aug 27, 2025

[FEAT] Add Leave One and Covarate In #358

Draft

lionelkusch added 2 commits August 27, 2025 19:48

fix examples

bb969c4

fix declaration in the API

3c290d3

Merge branch 'main' into PR_create_group_base

4881d1a

lionelkusch mentioned this pull request Sep 4, 2025

[API 2]: CFI, PFI, LOCO #372

Merged

Merge branch 'main' into PR_create_group_base

cdc963c

lionelkusch requested review from bthirion and jpaillard September 8, 2025 15:39

bthirion mentioned this pull request Sep 9, 2025

Simplify the examples #348

Open

bthirion reviewed Sep 9, 2025

View reviewed changes

examples/plot_importance_classification_iris.py Show resolved Hide resolved

src/hidimstat/base_perturbation.py Show resolved Hide resolved

src/hidimstat/base_variable_importance.py Outdated Show resolved Hide resolved

jpaillard reviewed Sep 9, 2025

View reviewed changes

lionelkusch added the API 2 Refactoring following the second version of API label Sep 9, 2025

lionelkusch added 3 commits September 10, 2025 09:46

change name of groups

71bb5e7

update docstring

7dae4f8

fix doc

a3b2da8

lionelkusch added 2 commits October 1, 2025 11:55

move check_fetaure type in CFI

4134bde

Merge branch 'main' into PR_create_group_base

b42eb16

lionelkusch requested review from bthirion and jpaillard October 1, 2025 12:16

lionelkusch mentioned this pull request Oct 2, 2025

Remove feature_group_key in base_perturbation #472

Open

Merge branch 'main' into PR_create_group_base

d7c2b55

jpaillard approved these changes Oct 3, 2025

View reviewed changes

lionelkusch and others added 11 commits October 3, 2025 11:35

Update src/hidimstat/leave_one_covariate_out.py

639247b

Co-authored-by: Joseph Paillard <joseph.paillard@inria.fr>

Update src/hidimstat/leave_one_covariate_out.py

3dded6f

Co-authored-by: Joseph Paillard <joseph.paillard@inria.fr>

Update src/hidimstat/leave_one_covariate_out.py

956b177

Co-authored-by: Joseph Paillard <joseph.paillard@inria.fr>

Update src/hidimstat/leave_one_covariate_out.py

b29c48a

Co-authored-by: Joseph Paillard <joseph.paillard@inria.fr>

Update src/hidimstat/leave_one_covariate_out.py

1aa36f8

Co-authored-by: Joseph Paillard <joseph.paillard@inria.fr>

Update src/hidimstat/leave_one_covariate_out.py

7d79d87

Co-authored-by: Joseph Paillard <joseph.paillard@inria.fr>

add option for parameters

269fba1

Merge branch 'main' into PR_create_group_base

dcf3d5f

homogeneis name for features

5c21a11

update

77e00d7

fix minimal version for test

93285bc

lionelkusch requested a review from jpaillard October 3, 2025 12:47

jpaillard approved these changes Oct 3, 2025

View reviewed changes

bthirion approved these changes Oct 3, 2025

View reviewed changes

jpaillard and others added 5 commits October 5, 2025 14:18

Merge branch 'main' into PR_create_group_base

0463a83

add an exception

40c2fd4

Merge branch 'main' into PR_create_group_base

cdd28e1

Add a black line at this of the file of documentation

a483f06

[skip tests]

lionelkusch merged commit ae16933 into mind-inria:main Oct 9, 2025
22 checks passed

lionelkusch deleted the PR_create_group_base branch October 9, 2025 13:04

Create a base for managing groups of features. #357

Create a base for managing groups of features. #357

Uh oh!

Conversation

lionelkusch commented Aug 27, 2025

Uh oh!

codecov bot commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

bthirion left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jpaillard left a comment

Choose a reason for hiding this comment

Uh oh!

bthirion commented Sep 9, 2025

Uh oh!

lionelkusch commented Sep 9, 2025

Uh oh!

lionelkusch commented Sep 9, 2025

Uh oh!

bthirion commented Sep 9, 2025

Uh oh!

lionelkusch commented Sep 10, 2025

Uh oh!

bthirion commented Sep 10, 2025

Uh oh!

lionelkusch commented Sep 11, 2025

Uh oh!

bthirion commented Sep 11, 2025

Uh oh!

lionelkusch commented Oct 2, 2025

Uh oh!

bthirion commented Oct 2, 2025

Uh oh!

jpaillard left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bthirion left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Aug 27, 2025 •

edited

Loading