PythonOT · hinanohart · May 16, 2026 · Jun 1, 2026
diff --git a/RELEASES.md b/RELEASES.md
@@ -23,13 +23,15 @@ This new release adds support for sparse cost matrices and a new lazy EMD solver
 - Add "BSP-OT: Sparse transport plans between discrete measures in loglinear time" (PR #768)
 - Added UOT1D with Frank-Wolfe in `ot.unbalanced.uot_1d` (PR #765)
 - Add Sliced UOT and Unbalanced Sliced OT in `ot/unbalanced/_sliced.py` (PR #765)
+- Add a numerically stable log-domain solver for entropic partial Wasserstein, selectable via the new `method` parameter of `entropic_partial_wasserstein` (`method='sinkhorn_log'`) or directly through `entropic_partial_wasserstein_logscale` (Issue #723)
 - Add cost functions between linear operators following  
   [A Spectral-Grassmann Wasserstein metric for operator representations of dynamical systems](https://arxiv.org/pdf/2509.24920),  
   implemented in `ot.sgot` (PR #792)
 - Build wheels on ubuntu ARM to avoid QEMU emulation (PR #818)
 
 #### Closed issues
 
+- Mitigate NaN regime of `entropic_partial_wasserstein` at small `reg` via a new log-domain solver, reachable with `entropic_partial_wasserstein(..., method='sinkhorn_log')` (Issue #723; the default `method='sinkhorn'` path is unchanged — callers opt into the log-domain variant)
 - Fix NumPy 2.x compatibility in Brenier potential bounds (PR #788)
 - Fix MSVC Windows build by removing __restrict__ keyword (PR #788)
 - Fix O(n³) performance bottleneck in sparse bipartite graph arc iteration (PR #785)

diff --git a/docs/source/user_guide.rst b/docs/source/user_guide.rst
@@ -791,8 +791,13 @@ Interestingly the problem can be casted into a regular OT problem by adding rese
 in which the surplus mass is sent [29]_. We provide a solver for partial OT
 in :any:`ot.partial`. The exact resolution of the problem is computed in :any:`ot.partial.partial_wasserstein`
 and :any:`ot.partial.partial_wasserstein2` that return respectively the OT matrix and the value of the
-linear term. The entropic solution of the problem is computed in :any:`ot.partial.entropic_partial_wasserstein` 
-(see [3]_).
+linear term. The entropic solution of the problem is computed in :any:`ot.partial.entropic_partial_wasserstein`
+(see [3]_). Following the convention of :any:`ot.sinkhorn`, this solver takes a ``method`` parameter:
+``method='sinkhorn'`` (default) runs the classical multiplicative-domain iterations, while
+``method='sinkhorn_log'`` switches to a numerically stable log-domain solver
+(:any:`ot.partial.entropic_partial_wasserstein_logscale`) for small regularisation values where the
+standard solver returns NaN. Both solve exactly the same problem; the log-domain variant is slower
+because it computes everything in log-space.
 
 The partial Gromov-Wasserstein formulation of the problem 
 

diff --git a/examples/unbalanced-partial/plot_entropic_partial_wasserstein_logscale.py b/examples/unbalanced-partial/plot_entropic_partial_wasserstein_logscale.py
@@ -0,0 +1,120 @@
+# -*- coding: utf-8 -*-
+"""
+==========================================================================
+Numerically-stable entropic partial Wasserstein (log-domain solver)
+==========================================================================
+
+.. note::
+    Example added in release: 0.9.7.
+
+`ot.partial.entropic_partial_wasserstein` is numerically unstable at small
+regularisation: the iterates underflow to zero and the returned plan
+contains NaNs (see PythonOT/POT issue #723). This example reproduces the
+failure mode on a small problem and shows that the log-domain solver,
+selected with ``entropic_partial_wasserstein(..., method='sinkhorn_log')``
+(equivalently :any:`ot.partial.entropic_partial_wasserstein_logscale`),
+produces a finite plan over the same sweep, agreeing with the original
+solver at large ``reg`` and degrading gracefully at small ``reg``.
+
+Following the :any:`ot.sinkhorn` convention, the solver to use is chosen
+through the ``method`` parameter: ``'sinkhorn'`` (default) for the classical
+solver and ``'sinkhorn_log'`` for the log-domain one. The log-domain solver
+is slower per iteration than the standard one, so the recommendation is to
+use the standard solver by default and fall back to the log-domain solver
+when ``reg`` is small enough to risk underflow.
+"""
+
+# Author: wzm2256 <wzm2256@qq.com> (original PR #724)
+# License: MIT License
+
+import numpy as np
+import scipy as sp
+import matplotlib.pylab as pl
+
+import ot
+
+##############################################################################
+# Construct a 50x50 cost matrix
+# -----------------------------
+#
+# Mirrors the cost-matrix scale (~50) used in PythonOT/POT issue #723.
+
+rng = np.random.RandomState(0)
+n = 50
+xs = rng.rand(n, 2)
+xt = rng.rand(n, 2)
+M = sp.spatial.distance.cdist(xs, xt) * 50.0
+
+a = np.ones(n) / n
+b = np.ones(n) / n
+m = 0.6  # transport ~60% of the mass
+
+##############################################################################
+# Sweep regularisation
+# --------------------
+#
+# Run both solvers across a range of ``reg`` values. On this 50×50 problem
+# at cost-scale 50 the standard solver returns NaN at the ``reg`` values
+# closest to the underflow boundary (typically ``reg`` ~0.05–0.01 in our
+# runs, though the exact transition depends on the BLAS / platform's
+# float64 underflow behaviour); the log-domain solver stays finite over
+# the whole sweep, including the very small ``reg`` regime where the
+# standard exp(−M/reg) path would underflow to zero everywhere.
+
+regs = [1.0, 0.5, 0.1, 0.05, 0.01, 5e-3, 1e-3, 5e-4]
+standard_finite = []
+logscale_finite = []
+standard_mass = []
+logscale_mass = []
+
+for reg in regs:
+    G_std = ot.partial.entropic_partial_wasserstein(
+        a, b, M, reg=reg, m=m, numItermax=2000
+    )
+    G_log = ot.partial.entropic_partial_wasserstein(
+        a, b, M, reg=reg, m=m, method="sinkhorn_log", numItermax=2000
+    )
+    standard_finite.append(bool(np.isfinite(G_std).all()))
+    logscale_finite.append(bool(np.isfinite(G_log).all()))
+    standard_mass.append(float(G_std.sum()) if np.isfinite(G_std).all() else np.nan)
+    logscale_mass.append(float(G_log.sum()))
+
+print(
+    "reg          standard_finite logscale_finite  std_mass logscale_mass (target m={:.2f})".format(
+        m
+    )
+)
+for reg, sf, lf, sm, lm in zip(
+    regs, standard_finite, logscale_finite, standard_mass, logscale_mass
+):
+    print(f"{reg:>10.4g}   {str(sf):<14}  {str(lf):<14}  {sm:>8.3f}      {lm:>8.3f}")
+
+##############################################################################
+# Plot the resulting plans at large vs. small reg
+# -----------------------------------------------
+
+fig, axes = pl.subplots(2, 2, figsize=(9, 8))
+for ax, reg in zip(axes[:, 0], (1.0, 0.01)):
+    G_std = ot.partial.entropic_partial_wasserstein(
+        a, b, M, reg=reg, m=m, numItermax=2000
+    )
+    if not np.isfinite(G_std).all():
+        G_std = np.zeros_like(G_std)
+        ax.set_title(f"standard, reg={reg}  (NaN)")
+    else:
+        ax.set_title(f"standard, reg={reg}")
+    ax.imshow(G_std, cmap="viridis", aspect="auto")
+    ax.set_xlabel("target")
+    ax.set_ylabel("source")
+
+for ax, reg in zip(axes[:, 1], (1.0, 0.01)):
+    G_log = ot.partial.entropic_partial_wasserstein(
+        a, b, M, reg=reg, m=m, method="sinkhorn_log", numItermax=2000
+    )
+    ax.set_title(f"logscale, reg={reg}")
+    ax.imshow(G_log, cmap="viridis", aspect="auto")
+    ax.set_xlabel("target")
+    ax.set_ylabel("source")
+
+fig.tight_layout()
+pl.show()
diff --git a/ot/partial/__init__.py b/ot/partial/__init__.py
@@ -13,6 +13,7 @@
     partial_wasserstein,
     partial_wasserstein2,
     entropic_partial_wasserstein,
+    entropic_partial_wasserstein_logscale,
     gwgrad_partial,
     gwloss_partial,
     partial_gromov_wasserstein,
@@ -28,6 +29,7 @@
     "partial_wasserstein",
     "partial_wasserstein2",
     "entropic_partial_wasserstein",
+    "entropic_partial_wasserstein_logscale",
     "gwgrad_partial",
     "gwloss_partial",
     "partial_gromov_wasserstein",