Make networkx dependency optional #1301

MariusWirtz · 2025-10-31T09:13:24Z

Switch verify_edges default to False

onefloid · 2025-11-06T05:17:47Z

I hate to ask it, but should we deprecate this first?

This isn’t a real breaking change, but rather a behavioral change with lower risk.

A gradual change could make sense here because the check only affects early error detection, not core functionality. Keeping it temporarily enabled helps avoid surprising users while still allowing the dependency to become optional later. Depending on how heavy the workflows are, it might still be beneficial to fail early to save resources. Also, if TM1 declines building the hierarchy, it should ideally raise the same ValueError; otherwise, the risk of breaking existing code increases due to differences in error handling in the calling code.

vmitsenko · 2025-11-07T16:32:38Z

Hi both, just a suggestion — it might be worth replacing the networkx library with a depth-first search implementation to maintain backward compatibility and simplify dependencies.

onefloid · 2025-11-19T05:36:16Z

@vmitsenko I like this approach. In theory, it could be faster and use less memory than networkx. Have you tested it on a large dataframe?

vmitsenko · 2025-11-21T14:55:25Z

@onefloid Current code is definitely not the most efficient implementation of the depth-first search - it’s simply the easiest and most straightforward to write. However, it appears to run faster and use less memory than when using networkx.

I ran tests using the following dataframe, and these are the results:

def generate_df(n_rows=100_000, n_parents=10):
    elements = np.arange(1, n_rows + 1)

    data = {"Element": elements}

    for p in range(1, n_parents + 1):
        parents = elements - p
        parents[parents < 1] = 0
        data[f"Parent_{p}"] = parents

    return pd.DataFrame(data)

depth-first search
Execution time: 7.2776 seconds
Peak memory usage: 65.047 MB

networkx
Execution time: 14.5668 seconds
Peak memory usage: 185.592 MB

MariusWirtz · 2025-11-24T21:01:36Z

@onefloid Current code is definitely not the most efficient implementation of the depth-first search - it’s simply the easiest and most straightforward to write. However, it appears to run faster and use less memory than when using networkx.

I ran tests using the following dataframe, and these are the results:
def generate_df(n_rows=100_000, n_parents=10):
    elements = np.arange(1, n_rows + 1)

    data = {"Element": elements}

    for p in range(1, n_parents + 1):
        parents = elements - p
        parents[parents < 1] = 0
        data[f"Parent_{p}"] = parents

    return pd.DataFrame(data)
depth-first search Execution time: 7.2776 seconds Peak memory usage: 65.047 MB

networkx Execution time: 14.5668 seconds Peak memory usage: 185.592 MB

Amazing! Thank you @vmitsenko

Switch `verify_edges` default to `False` Fixes #1300

MariusWirtz and others added 3 commits November 24, 2025 22:02

Make networkx dependency optional

934ce74

Switch `verify_edges` default to `False` Fixes #1300

Replacing the networkx library with a depth-first search

ac2a5b6

Set verify_edges default back to True

c78bbe9

MariusWirtz force-pushed the 1300-make-networkx-optional-dependency branch from 63c5174 to c78bbe9 Compare November 24, 2025 21:02

MariusWirtz merged commit 4b09446 into master Nov 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make networkx dependency optional #1301

Make networkx dependency optional #1301

Uh oh!

MariusWirtz commented Oct 31, 2025

Uh oh!

onefloid commented Nov 6, 2025

Uh oh!

vmitsenko commented Nov 7, 2025

Uh oh!

onefloid commented Nov 19, 2025

Uh oh!

vmitsenko commented Nov 21, 2025

Uh oh!

MariusWirtz commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Make networkx dependency optional #1301

Make networkx dependency optional #1301

Uh oh!

Conversation

MariusWirtz commented Oct 31, 2025

Uh oh!

onefloid commented Nov 6, 2025

Uh oh!

vmitsenko commented Nov 7, 2025

Uh oh!

onefloid commented Nov 19, 2025

Uh oh!

vmitsenko commented Nov 21, 2025

Uh oh!

MariusWirtz commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants