Implement missing 4th-edition algorithms (game theory, EM, Kalman, DBN, DDN, SARSA) with tests and demo notebooks, plus bug fixes and CI modernization by dmeoli · Pull Request #1323 · aimacode/aima-python

dmeoli · 2026-06-23T11:23:39Z

This PR fills in several algorithms from the 4th edition that were not yet implemented, adds tests (including cases taken from the book and the 3rd-edition solutions manual) and runnable demo notebooks, fixes a few bugs, and modernizes the CI. The full test suite passes (446 tests).

New algorithms

game_theory.py (new module) - Chapter 18, Multiagent Decision Making

18.2 Non-cooperative: dominates / dominant_strategy, iterated_dominance, pure_nash_equilibria, solve_zero_sum_game (von Neumann minimax via linear programming)
18.3 Cooperative: shapley_value, is_in_core
18.4 Collective decisions: plurality_winner, borda_winner, condorcet_winner, vickrey_auction, contract_net, alternating_offers_bargaining

probability.py

KalmanFilter / kalman_filter - Kalman filtering (Section 15.4)
DynamicBayesNet - dynamic Bayesian networks with unroll() and exact filtering (Section 15.5)
baum_welch - EM for learning HMM parameters (Section 20.3.3)

learning.py

gaussian_mixture_em - EM for a mixture of Gaussians (Section 20.3.1)
naive_bayes_em - EM for a Bayes net with a hidden variable, the candy-bags model (Section 20.3.2)

mdp4e.py

pomdp_lookahead and update_belief - online POMDP agent over a dynamic decision network via belief-state expectimax look-ahead (Section 17.4)

reinforcement_learning.py and reinforcement_learning4e.py

SARSALearningAgent - on-policy temporal-difference control (Section 21.3)

Bug fixes

return NotImplementedError changed to raise NotImplementedError in deep_learning4e.py and ipyviews.py (was returning the exception class instead of raising it)
Fixed SyntaxWarning from invalid escape sequences in docstrings (csp.py, logic.py) by using raw strings
Keras 3 compatibility: optimizers.SGD(lr=...) changed to learning_rate=... in deep_learning4e.py
Made test_value_iteration robust to floating-point rounding across numpy/BLAS versions (pytest.approx)

Tests

New tests/test_game_theory.py and new tests for the Kalman filter, DBN, Baum-Welch, both EM variants, the DDN look-ahead agent, and SARSA
Cases taken from the book / 3rd-edition solutions manual: the zero-sum game of exercise 17.17 (optimal mixed strategy [1/9, 1/9, 1/9, 1/3, 1/3], value 0), a saddle-point game, battle of the sexes, stag hunt, the umbrella DBN cross-checked against the HMM forward algorithm, and the Kalman
steady-state variance

Demo notebooks

Added executed notebooks with output (and plots): game_theory.ipynb, expectation_maximization.ipynb, kalman_filter.ipynb, dynamic_decision_network.ipynb, sarsa.ipynb

Cleanup

Fixed spelling typos in comments, docstrings, and data labels (codespell)
Updated the README algorithm index for all the new entries and corrected the Python version section (now Python 3.9 and up)

CI

Added a GitHub Actions workflow running the full suite on Python 3.9, 3.10, 3.11, and 3.12
Bumped the Python versions in .travis.yml to match (3.9-3.12)
Updated the build-status badge

… fixed typo errors

This reverts commit c4139e5.

…laky value_iteration test

… LP (Chapter 18)

…, Vickrey auction)

…18.4)

…ion 15.5)

…point, battle of the sexes, stag hunt, DBN sequence, Kalman steady state)

…flow

dmeoli · 2026-06-23T11:24:13Z

@antmarakis @norvig

Cover the previously untested non-book algorithms so they match the baseline test coverage: - test_dpll_branching_heuristics / test_cdcl_restart_strategies exercise every branching heuristic (moms, momsf, posit, dlis, dlcs, jw, jw2, zm) and restart strategy (no_restart, luby, glucose) on small SAT/UNSAT instances. - test_nary_csp, test_ac_solver_classes, test_crossword, test_kakuro cover NaryCSP/Constraint, the ACSolver/ACSearchSolver classes directly, and the Crossword/Kakuro models.

- game_theory.py -> game_theory4e.py (module written against 4th-edition chapter 18 numbering, consistent with the other 4e-structured modules); test file, notebook and README links updated accordingly. - making_simple_decision4e.py -> making_simple_decisions4e.py to match the book chapter title 'Making Simple Decisions'. - planning_graphPlan.ipynb -> planning_graph_plan.ipynb and knowledge_FOIL.ipynb -> knowledge_foil.ipynb (snake_case). - images/pluralityLearner_plot.png and images/knowledge_FOIL_grandparent.png renamed to snake_case; notebook references updated.

Make function and variable names consistent with the snake_case style used across the rest of the codebase: - planning.py: the *_graphPlan example helpers become *_graph_plan, plus graphPlan_solution, initialPlan, nConstraints, nPartial. - nlp.py: loadPageHTML, initPages, stripRawHTML, determineInlinks, findOutlinks, onlyWikipediaURLS, getInLinks, getOutLinks and their local variables (pagesIndex, pagesContent, inLinks, outLinks, etc.). - Updated all call sites in tests and notebooks, and snake_cased the affected test names and test-local variables. Class names (PascalCase) and the canvas strokeWidth method, which mirrors the JS canvas API, are intentionally left unchanged.

dmeoli added 30 commits August 23, 2019 19:37

Merge remote-tracking branch 'upstream/master'

396c38b

defined the PlanningProblem as a specialization of a search.Problem &…

776c131

… fixed typo errors

fixed doctest in logic.py

ccc7de1

fixed doctest for cascade_distribution

7e98afb

added ForwardPlanner and tests

061cba1

added __lt__ implementation for Expr

8c10d9f

added more tests

aa61869

renamed forward planner

c4139e5

Revert "renamed forward planner"

e4c4343

This reverts commit c4139e5.

renamed forward planner class & added doc

6e084c0

added backward planner and tests

b6a0cbd

Merge remote-tracking branch 'upstream/master'

1131f4d

fixed mdp4e.py doctests

1af8978

removed ignore_delete_lists_heuristic flag

a4ad133

fixed heuristic for forward and backward planners

26f2b5d

added SATPlan and tests

9faf17a

fixed ignore delete lists heuristic in forward and backward planners

0be0f5d

fixed backward planner and added tests

2cc2d3f

updated doc

4222176

Merge remote-tracking branch 'upstream/master'

30af352

added nary csp definition and examples

b69a907

added CSPlan and tests

6ff465a

fixed CSPlan

d3c291c

added book's cryptarithmetic puzzle example

785850a

fixed typo errors in test_csp

7249058

fixed aimacode#1111

42e9cbc

added sortedcontainers to yml and doc to CSPlan

0fb48f6

added tests for n-ary csp

5cce7d9

fixed utils.extend

b567a6d

updated test_probability.py

2eba772

dmeoli added 26 commits June 23, 2020 01:16

removed not allowed imports

c1ba725

fixed

76452f4

fixed keras

7f817ec

fixed

fa68504

updated requirements.txt

0240551

Merge remote-tracking branch 'upstream/master'

31a09c9

Merge branch 'aimacode:master' into master

05ca823

Merge remote-tracking branch 'upstream/master'

3094b8d

Fix NotImplementedError raises, docstring escapes, keras lr arg and f…

597131e

…laky value_iteration test

Add Kalman filter (Section 15.4)

d62e21a

Add EM for Gaussian mixtures (Section 20.3)

0a70586

Add Baum-Welch for HMM learning (Section 20.3)

d4899a3

Add dynamic decision network POMDP look-ahead agent (Section 17.4)

fd985f8

Add EM for Bayes net with hidden variables (Section 20.3.2)

36054b0

Add non-cooperative game theory: dominance, Nash equilibria, zero-sum…

f95540c

… LP (Chapter 18)

Add cooperative game theory (Shapley, core) and social choice (voting…

a149b53

…, Vickrey auction)

Add contract net protocol and alternating-offers bargaining (Section …

b9d7374

…18.4)

Add dynamic Bayesian network with unrolling and exact filtering (Sect…

e5055d7

…ion 15.5)

Add SARSA on-policy TD-learning agent (Section 21.3)

46240c2

Add iterated elimination of dominated strategies (Section 18.2)

33ea0b9

Add executed demo notebooks for the new 4e algorithms

1f5dfb1

Add tests from the book and solutions manual (zero-sum 17.17, saddle …

51fb6c5

…point, battle of the sexes, stag hunt, DBN sequence, Kalman steady state)

Fix spelling typos in comments, docstrings and data labels

774263e

Update README: modern Python version support and typo fix

ead0c1b

Clean up game_theory.py line lengths for flake8

28018e3

Modernize CI: bump Travis Python versions and add GitHub Actions work…

1128866

…flow

dmeoli added 3 commits June 23, 2026 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement missing 4th-edition algorithms (game theory, EM, Kalman, DBN, DDN, SARSA) with tests and demo notebooks, plus bug fixes and CI modernization#1323

Implement missing 4th-edition algorithms (game theory, EM, Kalman, DBN, DDN, SARSA) with tests and demo notebooks, plus bug fixes and CI modernization#1323
dmeoli wants to merge 283 commits into
aimacode:masterfrom
dmeoli:master

dmeoli commented Jun 23, 2026

Uh oh!

dmeoli commented Jun 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dmeoli commented Jun 23, 2026

New algorithms

Bug fixes

Tests

Demo notebooks

Cleanup

CI

Uh oh!

dmeoli commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dmeoli commented Jun 23, 2026 •

edited

Loading