Feature/generic ensemble by skywardfire1 · Pull Request #366 · smartcorelib/smartcore

skywardfire1 · 2026-04-03T10:18:54Z

this PR does 2 things:

Implements the Generic Ensemble Subsystem, how I name it
Performs multiple fixes on [refactor][important] bump rand_distr to 0.5.1 #365 issue

Part 1

"The what and the why, mr Anderson?"

It allows a user to build his own custom ensemble models. More than that, since I used box dyn predictor, a user can even combine models of a different kind!
In my project I only use 18 kNNs, still I have been only interested in creation of universal ensemble, so... Again here is my attempt.

And here we come to two limitations

We do not have the clone ability in this ensemble model, at least for now.
It is unable at this point to add NaiveBayes and SVC to the ensemble. The reasons are different, and each one can be worked out, I believe. Still, the whole thing doesn't look as bright as planned...

Allright, with that being said...

🔑 Key Features

🔄 Allows creation of heterogeneous predictor ensembles: mix KNN, Random Forest, Decision Tree are now the only supported models.
(Almost) any type implementing Predictor<X, Y> can be a member.
⚖️ Two voting strategies, uniform or weighted: simple majority or confidence-based aggregation.
Switch strategies at runtime with set_voting_strategy(); weights are validated on insertion.
🎛️ Dynamic enable/disable of members at runtime: toggle models without retraining.
Useful for A/B testing, fallback logic, or excluding underperforming models on-the-fly. My own idea!
🏷️ Metadata: descriptions, tags...: document and organize your ensemble.
Attach human-readable notes, group models by tags, I have no idea if anyone will use it, but implementing this was too fun and easy
⚖️ Set weights at anythime
Adjust voting influence with set_weight()
✂️ Feature slicing via predict_using_names(): different inputs per model.
Train models on disjoint feature subsets and combine predictions — ideal for multi-view learning.
Again, it was cruicial in my project, that's why I threw it into Smartcore, but unsure whether it is really useful
📊 Built-in scoring: quick accuracy evaluation with score().
Equivalent to accuracy(y, predict(x)) — and just for being more sklearn-ish

Documentation

📦 Model Management

🔄 Heterogeneous ensembles: Mix KNN, Random Forest, Decision Tree, SVM, or any custom model implementing Predictor<X, Y>.
No common base class required — trait-based composition.

🎯 Three ways to add models (3 public methods total for model management):

Method	Use Case	Auto-name	Custom Name	Weight	Description	Tags
`add(model)`	Quick start	✅	❌	❌ (Uniform only)	❌	❌
`add_named(name, model)`	Named models for debugging	❌	✅	❌ (Uniform only)	❌	❌
`add_with_params(name?, model, weight?, desc?, tags?)`	Full control	✅	✅	✅	✅	✅

🏷️ Rich metadata: Attach descriptions, tags, and voting weights to each member. Query voting weight via weight(name).

ℹ️ description and tags are stored internally but not exposed via public getters yet (reserved for future API).
⚙️ Dynamic runtime control: Enable/disable individual models without retraining via enable(), disable(), enabled(). Perfect for A/B testing, fallback logic, or excluding underperformers on-the-fly.

🗳️ Voting Strategies

⚖️ Uniform or Weighted voting: Simple majority or confidence-based aggregation. Switch at runtime with set_voting_strategy().
🛡️ Rust-style strictness in Weighted mode:

"Explicit is better than implicit."
When using VotingStrategy::Weighted, every member must have an explicit, finite, non-negative weight. The API will fail fast with a clear error if you try to add a model without a weight — no silent defaults, no hidden magic. This prevents subtle bugs and makes ensemble behavior predictable.
🔧 Weight management: Set or update weights anytime via set_weight(). Weights are validated on insertion and on strategy switch.

🔮 Prediction & Evaluation

🎯 Two prediction modes (+ 1 scoring helper):

Method	Input	Use Case
`predict(&x)`	Single `X` for all models	Standard ensemble: all models see the same features
`predict_using_names(&HashMap<String, X>)`	Per-model `X` via name	Feature slicing: each model gets its own feature subset
`score(&x, &y) -> f64`	Single `X` + labels `Y`	Quick accuracy evaluation (sklearn-style convenience)

📊 Built-in scoring: score() returns accuracy in [0.0, 1.0] — equivalent to accuracy(y, predict(x)), but convenient for cross-validation loops and hyperparameter tuning.
✅ Type-safe predictions: All models in an ensemble must share the same X: Array2<f64> and Y: Array1<i32> + Clone, enforced at compile time via generics + PhantomData.

🧰 Introspection & Utilities

🔍 Ensemble state: names(), len(), is_empty(), strategy(), get_ensemble_info() — query structure and configuration anytime.
🏷️ Metadata queries: weight(name) — get voting weight for a member.
🔄 Strategy switching: set_voting_strategy() validates all weights when switching to Weighted, ensuring consistency.

📚 Usage Guide: From Simple to Advanced

🎯 Scenario 1: The "Just Works" Way (3 lines)

Wanna go on ease? No problem! Just do:

use smartcore::ensemble::general_ensemble::Ensemble;
use smartcore::neighbors::knn_classifier::KNNClassifier;

// 1. Create ensemble (defaults to Uniform voting)
let mut ensemble = Ensemble::new();

// 2. Train and add models — names auto-generated ("model_0", "model_1", ...)
let knn1 = KNNClassifier::fit(&x_train, &y_train, params_k3)?;
let knn2 = KNNClassifier::fit(&x_train, &y_train, params_k5)?;
ensemble.add(knn1)?;
ensemble.add(knn2)?;

// 3. Predict — uniform voting out of the box
let predictions = ensemble.predict(&x_valid)?;
let acc = ensemble.score(&x_valid, &y_valid)?;
println!("Accuracy: {:.4}", acc);

✅ That's it. No weights, no names, no config.

🎯 Scenario 2: Name Your Models

Use add_named() when you want explicit control and better observability in your ensemble. Meaningful names make it easier to:

Debug individual model behavior
Enable/disable specific members at runtime
Log and audit which models contributed to a prediction
Manage A/B tests or canary deployments

let mut ensemble = Ensemble::new();

let knn_small = KNNClassifier::fit(&x_train, &y_train, k=3)?;
let knn_large = KNNClassifier::fit(&x_train, &y_train, k=15)?;

// Give them meaningful names — easier to debug and manage
ensemble.add_named("knn_k3".into(), knn_small)?;
ensemble.add_named("knn_k15".into(), knn_large)?;

// Later: inspect, enable/disable, debug by name
println!("Models: {:?}", ensemble.names());  // ["knn_k3", "knn_k15"]
ensemble.disable("knn_k3")?;                  // temporarily exclude from voting
let active = ensemble.enabled();              // ["knn_k15"]

💡 add_named() is syntactic sugar over add_with_params(Some(name), model, None, None, vec![]).

🎯 Scenario 3: Control Voting — Full Lifecycle

Step-by-step: Uniform → assign weights → switch to Weighted

// Step 1: Start with Uniform (default)
let mut ensemble = Ensemble::new();

let model_a = train_model_a()?;
let model_b = train_model_b()?;

// Add models without weights — works in Uniform mode
ensemble.add_named("model_a".into(), model_a)?;
ensemble.add_named("model_b".into(), model_b)?;

// Predict with simple majority voting
let preds_uniform = ensemble.predict(&x_valid)?;

// Step 2: Assign weights to models (prepare for Weighted mode)
ensemble.set_weight("model_a", 1.0)?;  // baseline
ensemble.set_weight("model_b", 2.5)?;  // higher confidence

// Step 3: Switch to Weighted strategy
// ⚠️ This will fail if any enabled member lacks a weight
ensemble.set_voting_strategy(VotingStrategy::Weighted)?;

// Now predictions use weighted voting
let preds_weighted = ensemble.predict(&x_valid)?;

// Compare results
println!("Uniform acc: {:.4}", ensemble.score(&x_valid, &y_valid)?);
ensemble.set_voting_strategy(VotingStrategy::Weighted)?;
println!("Weighted acc: {:.4}", ensemble.score(&x_valid, &y_valid)?);

🔁 Tip: You can switch strategies multiple times at runtime. Just ensure all members have valid weights before activating Weighted mode.

🎯 Scenario 4: Feature Slicing — Different Inputs per Model

For advanced use-cases like training on different feature subsets (multi-view learning).

let mut ensemble = Ensemble::with_strategy(VotingStrategy::Uniform);

// Train models on different feature slices
let model_a = train_on_features(&x_train, &[0,1,2])?;  // features 0-2
let model_b = train_on_features(&x_train, &[3,4,5])?;  // features 3-5

ensemble.add_named("slice_A".into(), model_a)?;
ensemble.add_named("slice_B".into(), model_b)?;

// Prepare inputs: each model gets its own feature slice
let mut inputs = HashMap::new();
inputs.insert("slice_A".into(), extract_features(&x_valid, &[0,1,2])?);
inputs.insert("slice_B".into(), extract_features(&x_valid, &[3,4,5])?);

// Predict with per-model inputs
let predictions = ensemble.predict_using_names(&inputs)?;

✂️ Constraint: All input matrices must have the same number of samples (rows), but can have different numbers of features (columns).

🎯 Scenario 5: Full Control — Metadata, Tags, Dynamic Management

let mut ensemble = Ensemble::with_strategy(VotingStrategy::Weighted);

// Add with full metadata
ensemble.add_with_params(
    Some("rf_prod_v2".into()),
    rf_model,
    Some(1.5),
    Some("Random Forest, depth=20, trained on Q1 data".into()),
    vec!["tree".into(), "production".into(), "v2".into()]
)?;

// Query metadata
assert_eq!(ensemble.weight("rf_prod_v2"), Some(1.5));
// ℹ️ description()/tags() getters are planned for future release

// Dynamic control at runtime
ensemble.disable("rf_prod_v2")?;   // exclude from voting
ensemble.enable("rf_prod_v2")?;    // re-include
ensemble.set_weight("rf_prod_v2", 2.0)?; // adjust influence

// Introspect state
let info = ensemble.get_ensemble_info();
println!("Strategy: {:?}, Members: {}/{}", info.strategy, info.enabled_members, info.total_members);

🏭 Real-World Usage Patterns

Those are of my SAAN project.

Pattern 1: Auto-disable underperforming models

// Disable models with quality score below threshold
let threshold = 0.4;
for model_name in ensemble.names() {
    if ensemble.weight(&model_name) < Some(threshold) {
        ensemble.disable(&model_name)?;
        debug!("Disabled underperforming model: {}", model_name);
    }
}

Pattern 2: Compare voting strategies on the same ensemble

// Evaluate Weighted voting
// Note, that valid_x_combined is a HashMap<K, V>, where K is the name of the model, and V is the x
let predictions_weighted = ensemble.predict_using_names(&valid_x_combined)?;
let (acc_w, prec_w, rec_w, f1_w) = count_metrics!(&y_valid, &predictions_weighted);

// Switch to Uniform and re-evaluate
ensemble.set_voting_strategy(VotingStrategy::Uniform)?;
let predictions_uniform = ensemble.predict_using_names(&valid_x_combined)?;
let (acc_u, prec_u, rec_u, f1_u) = count_metrics!(&y_valid, &predictions_uniform);

println!("Weighted — Acc: {:.4}, F1: {:.4}", acc_w, f1_w);
println!("Uniform  — Acc: {:.4}, F1: {:.4}", acc_u, f1_u);

Pattern 3: Dynamically add a strong model and boost its influence

// Add Random Forest as a new ensemble member
let rf_params = RandomForestClassifierParameters::default()
    .with_n_trees(20)
    .with_max_depth(u16::MAX as u16);
    
let rf_model = RandomForestClassifier::fit(&x_train, &y_train, rf_params)?;
ensemble.add_with_params(
    Some("random_forest".into()), 
    rf_model, 
    Some(1.0),  // initial weight
    Some("Random Forest, 20 trees".into()), 
    vec![]
)?;

// Include RF in the input map for feature-sliced prediction
valid_x_combined.insert("random_forest".into(), x_valid.clone());

// Predict with Uniform voting first
ensemble.set_voting_strategy(VotingStrategy::Uniform)?;
let preds_uniform_rf = ensemble.predict_using_names(&valid_x_combined)?;

// Then boost RF's influence in Weighted mode
ensemble.set_weight("random_forest", 0.9)?;
ensemble.set_voting_strategy(VotingStrategy::Weighted)?;
let preds_weighted_rf = ensemble.predict_using_names(&valid_x_combined)?;

📊 Interpreting Ensemble Logs

Again, the output shown just as an example from a side project:

Ensemble: Strategy=Uniform, Active=19/19 members
After pruning (precision < 0.4): Strategy=Weighted, Active=13/19 members

=== Fold 4 Results ===
Baseline (mean):     Acc: 0.480, Prec: 0.467, Rec: 0.420, F1: 0.417
Uniform ensemble:    Acc: 0.697, Prec: 0.804, Rec: 0.606, F1: 0.667
Weighted ensemble:   Acc: 0.742, Prec: 0.792, Rec: 0.668, F1: 0.703  ← +5.3% F1
+RF, Uniform:        Acc: 0.727, Prec: 0.805, Rec: 0.658, F1: 0.705
+RF, Weighted:       Acc: 0.818, Prec: 0.855, Rec: 0.765, F1: 0.794  ← +12.6% F1 vs baseline

🔍 How to read this:

Member count: Active=13/19 means 6 models were disabled due to low precision
Strategy impact: Weighted voting improved F1 by 5.3% over Uniform on the same models
Model addition: Adding Random Forest boosted performance further
Weight tuning: Giving RF higher weight (0.9) in Weighted mode yielded the best result

💡 Pro tips:

Always log get_ensemble_info() before/after major changes
Compare metrics across strategies to validate your weighting scheme
Use enabled() to verify which models actually contributed to a prediction

🧪 Testing Philosophy

Our test suite covers:

Test Category	Examples
✅ Basic functionality	`add()`, `add_named()`, auto-names
✅ Heterogeneous ensembles	KNN + RF + Decision Tree in one ensemble
✅ Voting strategies	Uniform vs Weighted, weight validation
✅ Feature slicing	`predict_using_names()` with per-model inputs
✅ Runtime management	`enable()`/`disable()` affecting predictions
✅ Metadata	Weights query/update
✅ Error handling	Duplicate names, missing weights, empty ensemble
✅ Scoring	`score()` validity across model additions/removals

📊 Test coverage: 13 focused tests covering basic usage, error paths, voting strategies, feature slicing, and runtime management.

All tests use minimal, reproducible dummy data and verify both success and failure paths.

📋 Public API Summary

Category	Methods	Count
Construction	`new()`, `with_strategy()`	2
Add models	`add()`, `add_named()`, `add_with_params()`	3
Metadata	`set_weight()`, `set_description()`, `weight()`	3
Runtime control	`enable()`, `disable()`, `enabled()`	3
Prediction	`predict()`, `predict_using_names()`, `score()`	3
Introspection	`names()`, `len()`, `is_empty()`, `strategy()`, `get_ensemble_info()`	5
Strategy	`set_voting_strategy()`	1
Total public methods		20

🎯 Of these, 3 methods add models, 3 methods predict, and 1 method scores — the core workflow in 7 calls.

⚠️ Common Pitfalls & How We Prevent Them

Pitfall	Our Solution
Forgetting weights in Weighted mode	❌ Compile-time + runtime validation; clear error message
Duplicate model names	❌ `add_with_params()` checks `HashMap` keys; fails fast
Mismatched input dimensions in `predict_using_names()`	❌ `Array2` trait enforces shape; `Failed` error on mismatch
Using disabled models in voting	✅ `enabled()` filter applied automatically in `predict()`
Type mismatches between models	✅ Generics `Ensemble<X, Y>` enforce same input/output types at compile time

🚀 What's Next? (Roadmap)

Feature	Status	Priority
`predict_proba()` support	🟡 Planned	High
UUID names	⚪ Idea	Medium
Public getters for `description()` and `tags()`	⚪ Idea	Medium
Auto-reset weights to `None` when switching Weighted → Uniform	⚪ Idea	Low

ℹ️ Note on strategy switching: When switching from Weighted to Uniform, weights are preserved but ignored. If you want a "clean slate", manually set weights to None via a future helper method (planned).

Part 2

Done:

Ready for review. 🦀✨

…Tree, RandomForest and kNN only. 7 new methods, a lot of tests

codecov · 2026-04-03T10:21:10Z

Codecov Report

❌ Patch coverage is 29.37500% with 113 lines in your changes missing coverage. Please review.
✅ Project coverage is 44.11%. Comparing base (70d8a0f) to head (c0f703d).
⚠️ Report is 17 commits behind head on development.

Files with missing lines	Patch %	Lines
src/ensemble/generic_ensemble.rs	29.37%	113 Missing ⚠️

Additional details and impacted files

@@               Coverage Diff               @@
##           development     #366      +/-   ##
===============================================
- Coverage        45.59%   44.11%   -1.49%     
===============================================
  Files               93       95       +2     
  Lines             8034     8222     +188     
===============================================
- Hits              3663     3627      -36     
- Misses            4371     4595     +224

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

skywardfire1 · 2026-04-03T10:22:43Z

Around 10 days of work. I'll fix 2 failing builds soon.

Mec-iS · 2026-04-03T10:39:05Z

wow this is great! thanks.
it will take some time to unroll.

it would be nice to have also #365 fixed with this so we can bump to v0.5.0

…ation warnings fixed. Also, the Generic Ensemble documentation examples are now ignored.

…. facepalm. Fmt again.

skywardfire1 · 2026-04-05T21:06:09Z

Well, I have done my work on #365 in accordance with how I personally got the idea.
I also updated the PR text, and fixed some issues in the documentation.

Few thoughts.
If we are not in hurry, maybe rename just "Ensemble" to "ClassificationEnsemble" for future convenience. When and if we will roll out Prediction Ensemble functionality one day, we wont meet naming issues.
On the other side, it will postpone the merging for at least 3 days.

Also, I did not really got the idea on how the changes being published. On website there is version 0.2 in the latest release info, and I haven't found any "news" page.

Just would be cool to have an ability to show the work to the people.

Mec-iS · 2026-04-05T22:29:04Z

the changes go to crates.io and are used by anybody using cargo: https://crates.io/crates/smartcore

You can see your past contributions in history

Mec-iS · 2026-04-05T22:32:56Z

what are all the ignore flags added in the doctests?

skywardfire1 · 2026-04-07T05:21:09Z

it only affects the examples. For now I focused on the functionality itself, so we can work and discuss on it. I can bring it back if it is crucial, and if the rest is ok.

skywardfire1 · 2026-04-09T17:58:50Z

Let's sync on what's going on. Are you in the process of checking the functionality?
To me, I plan to rename "Ensemble" to "ClassificationEnsemble", for future convenience, as I described above, and also re-enable the doctests.

Pls tell me if the rest is ok or not, when you are ready.

Mec-iS · 2026-04-09T18:29:56Z

sure proceed with your changes.
I need some time to look into it, i will try in the weekend.

Mec-iS · 2026-04-11T11:27:13Z

NOTE: remember that if there is any breaking change in the API, it should be explicitly stated in the CHANGELOG.md

Mec-iS

I just put here this long printout, some of them may be not necessary or wrong. I couldn't be sure as the feature is quite complex.

I have reviewed the migration to random 0.9 and it looks OK.

`generic_ensemble.rs` (new file, ~1,178 lines)

🔴 Critical — `predict()` collects all predictions before aggregating (O(n_models × n_samples) memory)

let mut all_preds: Vec<Y> = Vec::with_capacity(enabled.len());
for model_name in &enabled {
    let model = self.members.get(model_name).unwrap();
    let pred = model.model.predict(x)?;
    all_preds.push(pred);
}
// ... then iterate all_preds again per sample

This stores the full prediction output of every model before any aggregation occurs. For large ensembles with large datasets, this is an O(n_models × n_samples) allocation. A more memory-efficient design would aggregate votes per sample inline, iterating one model at a time.[^1]

🔴 Critical — HashMap iteration order makes voting non-deterministic

The members: HashMap<String, EnsembleMember<X, Y>> stores models in a HashMap, which has no guaranteed iteration order. When two classes have the same vote score, max_by returns whichever was encountered last during HashMap iteration — which is random per run. This means identical inputs can yield different outputs across runs in tie cases. Consider using IndexMap (ordered hash map) or BTreeMap for reproducible ordering.[^1]

🔴 Critical — Classification hardcoded to `i32`, blocking broad adoption

pub struct Ensemble<X, Y>
where
    X: Array2<f64>,
    Y: Array1<i32> + Clone,

Both `X` and `Y` are fixed to concrete element types (`f64` and `i32` respectively). This means the `Ensemble` cannot be used with `f32` features or `u32` / `i64` labels, even though the rest of smartcore's API is fully generic over `TX: Number` and `TY: Number`. The bound should be generalized to match the library's own conventions, e.g. `X: Array2<TX>` / `Y: Array1<TY>`.[^1]

🔴 Critical — Regression support is entirely absent

The PR title says "Generic Ensemble" and the description mentions "universal ensemble", but the implementation is classification-only — there is no regressor counterpart. The existing base_forest_regressor.rs already provides a pattern for regression. Without a regression variant, this feature is incomplete relative to the existing ensemble modules (which always provide both *_classifier and *_regressor variants).[^2]

🔴 Critical — `Ensemble` does not implement `Serde`, `Clone`, or the library's own `Predictor` trait

Every other model in the codebase implements Predictor<X, Y> for itself, enabling interoperability with model_selection, cross-validation, and serialization. Ensemble only has standalone predict() / score() methods, but does not implement impl Predictor<X, Y> for Ensemble<X, Y>. This means it cannot be passed into any model_selection utility without a wrapper. Additionally the lack of Serde support prevents saving/loading trained ensembles, and the author notes the absence of Clone as a known limitation.[^1]

🔴 Critical — `predict()` panics silently when all members are disabled

pub fn predict(&self, x: &X) -> Result<Y, Failed> {
    if self.members.is_empty() {
        return Err(Failed::predict("Empty ensemble"));
    }
    let enabled: Vec<String> = self.enabled();
    // ... if enabled is empty, all_preds will be empty, then:
    let n_samples = all_preds[^0].shape(); // INDEX OUT OF BOUNDS

If all members are disabled, enabled is empty, all_preds is empty, and all_preds[^0] panics with an index-out-of-bounds. There should be an explicit guard: if enabled.is_empty() { return Err(Failed::predict("No enabled members")); }.[^1]

🟠 Design — `set_voting_strategy` is logically exhaustive yet returns an `Err` on an impossible branch

pub fn set_voting_strategy(&mut self, strategy: VotingStrategy) -> Result<(), Failed> {
    if matches!(strategy, VotingStrategy::Weighted) { ... return Ok(()); }
    if matches!(strategy, VotingStrategy::Uniform)  { ... return Ok(()); }
    Err(Failed::input("Invalid voting strategy"))
}

Since VotingStrategy is a non-#[non_exhaustive] enum with exactly two variants, this final Err branch is dead code that can never be reached. A match strategy { Weighted => ..., Uniform => ... } pattern would be exhaustive and idiomatic, eliminating the unreachable error path.[^1]

🟠 Design — `get_model_metadata` is referenced in doc comment but never implemented

The get_ensemble_info() doc comment says:

"Does not include per-model hyperparameters (use get_model_metadata for that)."

But get_model_metadata is never defined anywhere in the file. This is a dead reference in the public documentation.[^1]

🟠 Design — `description()` and `tags()` are stored but inaccessible

The `EnsembleMember` struct holds `description: Option<String>` and `tags: Vec<String>`, and `add_with_params` / `set_description` allow writing them — but there are **no public getters**. The PR description acknowledges this but lists it as a future feature. Meanwhile, `has_tag()` is also marked `#[allow(dead_code)]` with a `TODO` comment. These fields add API surface and maintenance burden with zero current utility.[^1]

🟠 Design — `predict_using_names` does not validate that all inputs have the same number of rows

let n_samples = all_preds[^0].1.shape();

No check enforces that the X matrices supplied by the caller for different models have the same number of samples. A mismatched row count will silently produce a truncated or index-out-of-bounds result. An explicit validation loop should be added before the aggregation step.[^1]

🟡 API — `add()` silently ignores `weight` when strategy is `Uniform`

pub fn add<P>(&mut self, model: P) -> Result<String, Failed>
    where P: Predictor<X, Y> + 'static {
    self.add_with_params(None, model, None, None, vec![])
}

When the strategy is Uniform, passing weight: None to add_with_params succeeds. Later, set_voting_strategy(Weighted) will fail because the member has no weight. This is correct behaviour but the failure is deferred and the error message won't identify which member is missing a weight. Consider returning the offending member name in the error.[^1]

🟡 API — Auto-name counter never resets; can shadow a later manually-named model

fn generate_auto_name(&mut self) -> String {
    let name = format!("model_{}", self.counter);
    self.counter += 1;
    name
}

If a user manually adds a model with name "model_3" after three auto-named additions, and then calls add() again, generate_auto_name will produce "model_3" and the contains_key check will return a duplicate error. The counter should check for existence and skip occupied names.[^1]

🟡 Code — Leftover `TODO` / informal comments in public code

Several comments are informal or left as placeholders that shouldn't survive a library review:

// TODO unwrap is safe here, but maybe "if let" will just look better — in predict()
// TODO We'll use it later, maybe someone — on has_tag()
// TODO To be implemented one day — in test test_add_with_custom_names
// Dummy placeholder — in the Debug impl
Comments in Russian (// y имеет длину 6) inside the test suite

[^1]

🟡 Test — Tests use train data as test data (in-sample evaluation)

fn test_add_all_classifier_types() {
    let (x, y) = dummy_data_2class();
    // ...
    let preds = ensemble.predict(&x).unwrap(); // same x used for training
    let score = ensemble.score(&x, &y).unwrap();
    assert!((0.0..=1.0).contains(&score));
}

The score assertion only checks that accuracy is in [0.0, 1.0] — which is trivially true for any classifier on in-sample data. Tests should use a held-out split and assert a meaningful accuracy threshold, or at minimum test that the ensemble outperforms random chance.[^1]

🟡 Module — `generic_ensemble` not re-exported from `lib.rs`

Looking at the existing module structure, random_forest_classifier, extra_trees_classifier, etc. are all accessible from smartcore::ensemble::*. The new generic_ensemble module is declared in ensemble/mod.rs but there is no pub use re-export at the lib.rs level to make Ensemble and VotingStrategy directly accessible via smartcore::ensemble::Ensemble. Users must import the full path smartcore::ensemble::generic_ensemble::Ensemble.[^2]

Part 3 — Minor / Style

Location	Observation
`mod.rs` doc comment	Says "KNN, RF, DT are now the only supported models" — this is a limitation statement in module-level docs, which is confusing to new users. Should be positive framing (what is supported)
`EnsembleInfo`	`#[allow(missing_docs)]` suppresses the warning but leaves all fields undocumented — this should be fixed not suppressed
`predict()` / `predict_using_names()`	Code is substantially duplicated — the voting aggregation loop (scores HashMap + argmax) is copy-pasted verbatim. Extract into a private `aggregate_votes()` helper
`weight()`	The early-return on `!contains_key` followed by a second `get` is a double-lookup — use `self.members.get(name)?` directly

Mec-iS · 2026-04-11T16:16:16Z

NOTE: because of random new version is now v0.10 please merge the changes from #369 that fix the bumping of random libraries.

skywardfire1 · 2026-04-15T07:09:37Z

Interesting analysis

`predict()` collects all predictions before aggregating (O(n_models × n_samples) memory)

I'll leave it as is, since we're not talking about a lot of models. Usually ensembles will not be bigger than 10-20 models.
Still, this is a good point to improve, so I'll leave a TODO remark, though

HashMap iteration order makes voting non-deterministic

Again, this is a subject to improve, but far from critical from my standpoint. I'll put it in my backlog.

Classification hardcoded to `i32`, blocking broad adoption

Nice, I love to make the things to be as compatible as possible. I'll try to fix that.

Regression support is entirely absent

The PR title says "Generic Ensemble"...

And I'll change it to GenericClassificationEnsemble, as I said before.

`Ensemble` does not implement `Serde`, `Clone`, or the library's own `Predictor` trait

Let's leave this as it is for now.
There's a problem with Clone.
There's no need of Predictor, from my standpoint.
Implementation of Serde is a good thing to put to backlog.

`predict()` panics silently when all members are disabled

Yup. I'll fix it.

`set_voting_strategy` is logically exhaustive yet returns an `Err` on an impossible branch

Yup. I'll fix it.

`get_model_metadata` is referenced in doc comment but never implemented

I'll add information in the documentation, saying this is for future functionality.

`description()` and `tags()` are stored but inaccessible

It's ok. Reserved for future functionality.

`predict_using_names` does not validate that all inputs have the same number of rows

Yup. I'll fix it.

`add()` silently ignores `weight` when strategy is `Uniform`

Nice. I'll fix or rework it.

Auto-name counter never resets; can shadow a later manually-named model

Eh. The original idea was to use UUIDs actually. I faced some problems with crates, so, I have made current functionality with "names". Let's leave it like it is, but I will add a warning to the documentation, that users should avoid our naming convention, or use a custom prefix.

Leftover `TODO` / informal comments in public code

Absolutely normal thing to me.

Comments in Russian (// y имеет длину 6) inside the test suite

Ahah, that's funny. I missed it )

Tests use train data as test data (in-sample evaluation)

To backlog.

`generic_ensemble` not re-exported from `lib.rs`

I'll fix it.

| mod.rs doc comment | Says "KNN, RF, DT are now the only supported models" — this is a limitation statement in module-level docs, which is confusing to new users. Should be positive framing (what is supported) |
I'll fix it.

| EnsembleInfo | #[allow(missing_docs)] suppresses the warning but leaves all fields undocumented — this should be fixed not suppressed |
It only uses in place where having docs would be excessive.

| predict() / predict_using_names() | Code is substantially duplicated — the voting aggregation loop (scores HashMap + argmax) is copy-pasted verbatim. Extract into a private aggregate_votes() helper |
Yes. Good point to improve. I'll put it in my backlog

| weight() | The early-return on !contains_key followed by a second get is a double-lookup — use self.members.get(name)? directly |
I'll fix it.

All those fixes will take some time, so I have a general question on smartcore development process.
It is a normal practice when PRs remains on improvements for 1-2-3 weeks, or should those periods be shorter?

Mec-iS · 2026-04-15T10:00:25Z

All those fixes will take some time, so I have a general question on smartcore development process.
It is a normal practice when PRs remains on improvements for 1-2-3 weeks, or should those periods be shorter?

it depends from the complexity of the changes. Usually It takes what it takes to have the right thing. This is a volunteering-based project.

Mec-iS · 2026-05-22T12:53:29Z

thanks again @skywardfire1

I had now the time to review your idea in its different aspects and I reached the understanding that it is quite interesting but it is not complete at this moment to be among the main features as add some overhead to compilation.

I am really interested in these use-cases:

Use Cases That Justify This Addition

Use Case A: Heterogeneous Model Ensembling (Primary)

The problem it solves: smartcore currently provides RandomForest and ExtraTrees — both homogeneous ensembles (same algorithm, different seeds/splits). There is no way to combine a KNN with a Random Forest and a Decision Tree into a single meta-model. Why this matters: Heterogeneous ensembles (stacking/voting) are a well-established technique in ML. They reduce variance more effectively than homogeneous ensembles when base models have uncorrelated errors. This is the standard approach in Kaggle-style competitions and production ML pipelines.
Concrete scenario: A fraud detection system where:
• KNN captures local anomaly patterns
• Random Forest captures non-linear feature interactions
• Decision Tree provides interpretable rule-based fallback

Combining them via majority voting yields a more robust classifier than any single model.
Justification strength: ⭐⭐⭐⭐ — This is a genuine gap in the library.
────────────────────────────────────────

Use Case B: Multi-View / Feature-Sliced Learning

The problem it solves: In some domains, different feature subsets are best processed by different models (e.g., text
features → one model, numerical features → another). The predict_using_names() method enables this via a
HashMap<String, X> routing.
Concrete scenario: The author's own "SAAN" project trains 18 KNNs on different feature slices and combines them. In
medical imaging, one might train separate models on MRI features, lab results, and patient demographics, then ensemble.
Justification strength: ⭐⭐ — This is a niche use case. The HashMap<String, X> routing is functional but awkward — most ML frameworks handle multi-view learning at the pipeline level, not the ensemble level. The author admits "unsure
whether it is really useful" for the broader community.
────────────────────────────────────────

Use Case C: Runtime Model Management (A/B Testing, Graceful Degradation)

The problem it solves: The enable()/disable() methods allow toggling models without re-creating the ensemble. Useful
for:
• A/B testing: compare predictions with and without a new model
• Graceful degradation: disable a model that starts producing low-confidence predictions
• Cost optimization: disable expensive models when latency budgets are tight

Concrete scenario: A production recommendation system that disables a deep model when response time exceeds a
threshold, falling back to a lightweight KNN.
Justification strength: ⭐⭐⭐ — This is a genuinely useful operational feature that most ML libraries don't offer
out-of-the-box. However, it's more of a production ops concern than a model training concern, and the current
implementation stores models in-memory with no persistence.

This is a quite relevant features pack so it takes time to adjust the work to the mainstream of the library.

Recommendations

Gate behind a feature flag and reduce scope before stabilization:
Gate behind #[cfg(feature = "generic_ensemble")] — Users who don't need it don't pay the compilation and maintenance cost. This is standard practice for experimental features in Rust libraries.

Trim speculative API surface before stabilization:
• Remove description, tags, has_tag (all dead code)
• Make predict_using_names a separate FeatureSlicedEnsemble type (or mark #[experimental])
• Reduce the public API from 20 methods to ~10 core methods

Fix the 4 blocking issues identified by @Mec-iS:
• #[non_exhaustive] match in set_voting_strategy
• Panic when all members disabled
• Type generalization (TX: Number, TY: Number)
• predict_using_names row-count validation

Defer to a follow-up PR:
• Predictor trait implementation
• Serde / Clone support
• Regression variant
• predict_proba()

Rename to ClassificationEnsembl, this avoids naming collisions when a regression variant is added later.

Verdict

This is a valuable addition that fills a genuine functional gap. The core concept — heterogeneous voting ensembles
with runtime model management — is well-justified and serves a clear user need. The implementation shows strong effort.
However, the current scope is too ambitious for a first version. The PR tries to ship a production-ready ensemble
system with metadata management, feature slicing, and dynamic control all at once, which has led to architectural
shortcuts (hardcoded types, missing trait implementations, weak tests). A smaller, well-tested v1 that ships the core
add → predict → score workflow with proper type generics and test coverage would be more valuable than the current
"everything and the kitchen sink" approach.

Recommendation: Approve with feature flag, merge after blocking fixes, defer speculative features to v0.6.

Please consider putting your new features pack in a feature flag and I can merge this version. Consider also that some more work may be needed to complete the minimal implementation (v1) because of the above caveats.

skywardfire1 and others added 3 commits April 1, 2026 17:52

implemented the generic ensemble subsystem. For now supports Decision…

e58aa4c

…Tree, RandomForest and kNN only. 7 new methods, a lot of tests

fmt, clippy, bingen tests

8c2cf6d

Merge branch 'smartcorelib:development' into feature/generic_ensemble

469e9ee

skywardfire1 requested a review from Mec-iS as a code owner April 3, 2026 10:18

skywardfire1 added 8 commits April 5, 2026 23:02

updated 8 files to fix smartcorelib#365 issue. Also, around 10 deprec…

0cf841a

…ation warnings fixed. Also, the Generic Ensemble documentation examples are now ignored.

fixed use directives

6d8e06b

code updated to satisfy new rand requirements

bc6919d

deprecation warnings fixed

9111c43

DistString -> SampleString + deprecation warnings fixed

0e1aa8c

fmt

2a9fb0a

clippy

8f277bf

so, clippy actually produces code which does not conform with the fmt…

5dcbf40

…. facepalm. Fmt again.

Mec-iS requested changes Apr 11, 2026

View reviewed changes

Merge branch 'development' into feature/generic_ensemble

5bb78eb

Merge branch 'development' into feature/generic_ensemble

fd1169b

Merge branch 'development' into feature/generic_ensemble

c0f703d

Uh oh!

Conversation

skywardfire1 commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

"The what and the why, mr Anderson?"

🔑 Key Features

Documentation

📦 Model Management

🗳️ Voting Strategies

🔮 Prediction & Evaluation

🧰 Introspection & Utilities

📚 Usage Guide: From Simple to Advanced

🎯 Scenario 1: The "Just Works" Way (3 lines)

🎯 Scenario 2: Name Your Models

🎯 Scenario 3: Control Voting — Full Lifecycle

Step-by-step: Uniform → assign weights → switch to Weighted

🎯 Scenario 4: Feature Slicing — Different Inputs per Model

🎯 Scenario 5: Full Control — Metadata, Tags, Dynamic Management

🏭 Real-World Usage Patterns

Pattern 1: Auto-disable underperforming models

Pattern 2: Compare voting strategies on the same ensemble

Pattern 3: Dynamically add a strong model and boost its influence

📊 Interpreting Ensemble Logs

🔍 How to read this:

💡 Pro tips:

🧪 Testing Philosophy

📋 Public API Summary

⚠️ Common Pitfalls & How We Prevent Them

🚀 What's Next? (Roadmap)

Uh oh!

codecov Bot commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

skywardfire1 commented Apr 3, 2026

Uh oh!

Mec-iS commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

skywardfire1 commented Apr 5, 2026

Uh oh!

Mec-iS commented Apr 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mec-iS commented Apr 5, 2026

Uh oh!

skywardfire1 commented Apr 7, 2026

Uh oh!

skywardfire1 commented Apr 9, 2026

Uh oh!

Mec-iS commented Apr 9, 2026

Uh oh!

Mec-iS commented Apr 11, 2026

Uh oh!

Mec-iS left a comment

Choose a reason for hiding this comment

generic_ensemble.rs (new file, ~1,178 lines)

🔴 Critical — predict() collects all predictions before aggregating (O(n_models × n_samples) memory)

🔴 Critical — HashMap iteration order makes voting non-deterministic

🔴 Critical — Classification hardcoded to i32, blocking broad adoption

🔴 Critical — Regression support is entirely absent

🔴 Critical — Ensemble does not implement Serde, Clone, or the library's own Predictor trait

🔴 Critical — predict() panics silently when all members are disabled

🟠 Design — set_voting_strategy is logically exhaustive yet returns an Err on an impossible branch

🟠 Design — get_model_metadata is referenced in doc comment but never implemented

🟠 Design — description() and tags() are stored but inaccessible

🟠 Design — predict_using_names does not validate that all inputs have the same number of rows

🟡 API — add() silently ignores weight when strategy is Uniform

🟡 API — Auto-name counter never resets; can shadow a later manually-named model

🟡 Code — Leftover TODO / informal comments in public code

🟡 Test — Tests use train data as test data (in-sample evaluation)

🟡 Module — generic_ensemble not re-exported from lib.rs

Part 3 — Minor / Style

Uh oh!

Mec-iS commented Apr 11, 2026

Uh oh!

skywardfire1 commented Apr 15, 2026

predict() collects all predictions before aggregating (O(n_models × n_samples) memory)

HashMap iteration order makes voting non-deterministic

Classification hardcoded to i32, blocking broad adoption

skywardfire1 commented Apr 3, 2026 •

edited

Loading

codecov Bot commented Apr 3, 2026 •

edited

Loading

Mec-iS commented Apr 3, 2026 •

edited

Loading

Mec-iS commented Apr 5, 2026 •

edited

Loading

`generic_ensemble.rs` (new file, ~1,178 lines)

🔴 Critical — `predict()` collects all predictions before aggregating (O(n_models × n_samples) memory)

🔴 Critical — Classification hardcoded to `i32`, blocking broad adoption

🔴 Critical — `Ensemble` does not implement `Serde`, `Clone`, or the library's own `Predictor` trait

🔴 Critical — `predict()` panics silently when all members are disabled

🟠 Design — `set_voting_strategy` is logically exhaustive yet returns an `Err` on an impossible branch

🟠 Design — `get_model_metadata` is referenced in doc comment but never implemented

🟠 Design — `description()` and `tags()` are stored but inaccessible

🟠 Design — `predict_using_names` does not validate that all inputs have the same number of rows

🟡 API — `add()` silently ignores `weight` when strategy is `Uniform`

🟡 Code — Leftover `TODO` / informal comments in public code

🟡 Module — `generic_ensemble` not re-exported from `lib.rs`

`predict()` collects all predictions before aggregating (O(n_models × n_samples) memory)

Classification hardcoded to `i32`, blocking broad adoption

`Ensemble` does not implement `Serde`, `Clone`, or the library's own `Predictor` trait

`predict()` panics silently when all members are disabled

`set_voting_strategy` is logically exhaustive yet returns an `Err` on an impossible branch

`get_model_metadata` is referenced in doc comment but never implemented

`description()` and `tags()` are stored but inaccessible

`predict_using_names` does not validate that all inputs have the same number of rows

`add()` silently ignores `weight` when strategy is `Uniform`

Leftover `TODO` / informal comments in public code

`generic_ensemble` not re-exported from `lib.rs`