gh-93714 Create fast version of match_keys for exact dict by da-woods · Pull Request #93752 · python/cpython

da-woods · 2022-06-12T15:29:53Z

When the type is an exact dict, there's a real speed-up to be gained by skipping the call to "get" and using PyDict_GetItemWithError instead. The implementation is just a slightly modified copy of the existing match_keys function.

When the type is an exact dict, there's a real speed-up to be gained by skipping the call to "get" and using PyDict_GetItemWithError instead.

sweeneyde · 2022-06-13T00:39:16Z

Do you have any benchmark results to prove whether this is worth it?

da-woods · 2022-06-13T06:33:57Z

I'd made a set of microbenchmarks for pattern matching. This change obviously only affects the first test subject (the dict). I've also included the second subject (an inexact dict) below for comparison.

Without:

subject {'a': 'xxx', 1: 500} - time (μs): 1.29 1.28 1.28 1.29 1.28
subject D({'a': 'xxx', 1: 500}) - time (μs): 1.29 1.28 1.29 1.28 1.29
...

With:

subject {'a': 'xxx', 1: 500} - time (μs): 1.05 1.04 1.05 1.04 1.06
subject D({'a': 'xxx', 1: 500}) - time (μs): 1.33 1.27 1.27 1.27 1.27
...

It's about a 20% speed up here. If I increase the number of keys to be tested so that the subject is {'a': 'xxx', 1: 500, 'b': 1, 'c': 2, 'd': 3} and the pattern is {"a": "xxx", A.b: x, "b": _, "c": _, "d": _, **extra} then the speed-up remains about 0.2 μs per call, suggesting that it's the constant work of getting subject.get and setting up the dummy object that's being eliminated.

So the benefit is probably "real, but not huge".

(I should add: I'm just compiling Python with make and no special C flags. The Python 3.10 version pre-installed on my system is notably quicker than both versions here, so I suspect I'm not getting the right compiler flags to really test it)

da-woods · 2022-06-13T07:35:23Z

With --enable-optimizations --with-ltoI get:

With this change

subject {'a': 'xxx', 1: 500} - time (μs): 0.506 0.505 0.506 0.504 0.506
subject D({'a': 'xxx', 1: 500}) - time (μs): 0.593 0.596 0.594 0.594 0.597

and without

subject {'a': 'xxx', 1: 500} - time (μs): 0.569 0.574 0.574 0.577 0.578
subject D({'a': 'xxx', 1: 500}) - time (μs): 0.578 0.58 0.585 0.583 0.584

So a fairly similar level of speed-up.

markshannon · 2022-06-13T15:30:13Z

I think this is heading in the wrong direction.
We should be improving the performance of pattern matching by producing better bytecode, not by making the interpreter more complex.

da-woods · 2022-06-14T17:13:23Z

I think this is heading in the wrong direction. We should be improving the performance of pattern matching by producing better bytecode, not by making the interpreter more complex.

I agree at least in principle - I known @brandtbucher said he'd done some work on that. I was mainly proposing this PR as a fairly simple short-term improvement. But I'd ultimately expect it to be replaced with bytecode.

Please do reject it if you don't think the extra code is worth the improvement though.

markshannon · 2022-06-15T08:51:01Z

I won't close this just yet.
We might choose to use it as a temporary measure, until the compiler generates better code for matching mappings.
@brandtbucher thoughts?

github-actions · 2026-04-11T06:21:34Z

This PR is stale because it has been open for 30 days with no activity.

eendebakpt · 2026-06-01T18:42:32Z

I think this is heading in the wrong direction. We should be improving the performance of pattern matching by producing better bytecode, not by making the interpreter more complex.

There PR has been inactive for almost 3 years now, so maybe time to consider closing or picking up again. I explored a bit, here is a summary:

The current PR (once rebased onto main) still gives a 1.20x speedup for matching dicts with few keys (1 to 6)
Another optimization is to detect at compile time that the keys are not duplicates (this works for literal keys) . With a dedicated opcode MATCH_KEYS_UNIQUE we can skip the duplicate key check. Gives a 1.22x speedup. In tier2 the changes for this are quite small.
Combination of the above gives a 1.6x speedup (that seems large, probably there is some noise in the benchmarks)
I implemented Marks idea (or my interpretation of it). When compiling

def f(d):
    match d:
        case {"a": a, "b": b}:
            return a + b
        case _:
            return -1

we can optimize

MATCH_MAPPING
POP_JUMP_IF_FALSE   -> L2
GET_LEN; LOAD_SMALL_INT 2; COMPARE_OP >=; POP_JUMP_IF_FALSE -> L2
LOAD_CONST 1 (('a', 'b'))      ← the keys as a constant tuple
MATCH_KEYS                     ← one C call: builds a *values tuple* (or None)
COPY 1; POP_JUMP_IF_NONE -> L1 ← "did the whole thing match?"
UNPACK_SEQUENCE 2              ← explode the values tuple back onto the stack
STORE_FAST_STORE_FAST (a, b)
POP_TOP; POP_TOP              ← pop keys-tuple + subject

to

MATCH_MAPPING
POP_JUMP_IF_FALSE   -> L3
GET_LEN; LOAD_SMALL_INT 2; COMPARE_OP >=; POP_JUMP_IF_FALSE -> L3
COPY 1; LOAD_CONST 1 ('b'); MATCH_KEY; POP_JUMP_IF_FALSE -> L2   ← key 'b' → value on stack
COPY 2; LOAD_CONST 2 ('a'); MATCH_KEY; POP_JUMP_IF_FALSE -> L1   ← key 'a' → value on stack
STORE_FAST_STORE_FAST (a, b)
POP_TOP                       ← pop subject only

This requires a new tier1 opcode (the MATCH_KEY), but skips the LOAD_CONST, UNPACK_SEQUENCE. This approach is fast for few keys (factor 1.9x), but scales not so good for dozens of keys (but I suspect that case is rare). Branch eendebakpt#63

@markshannon @da-woods Any opinion on which way to go, and on whether this is worth the changes?

da-woods · 2026-06-01T18:54:42Z

@eendebakpt It sounds like you have up-to-date rebased versions of this + some other worthwhile optimizations. So even if we pick this up we should probably do it from your branch and close this one.

I don't have a view on MATCH_KEY vs MATCH_KEYS.

My personal view is the same as it was 3 years ago - that there are some fairly cheap optimizations here so it's worth doing something. But I'm not sure how much weight that opinion has.

Create fast version of match_keys for exact dict

128517b

When the type is an exact dict, there's a real speed-up to be gained by skipping the call to "get" and using PyDict_GetItemWithError instead.

da-woods requested a review from markshannon as a code owner June 12, 2022 15:29

bedevere-bot added the awaiting review label Jun 12, 2022

📜🤖 Added by blurb_it.

0996d52

da-woods mentioned this pull request Jun 12, 2022

Specialized match_keys for exact dictionary type #93714

Open

AA-Turner added performance Performance or resource usage interpreter-core (Objects, Python, Grammar, and Parser dirs) labels Jun 12, 2022

iritkatriel requested a review from brandtbucher September 7, 2022 09:40

github-actions Bot added the stale Stale PR or inactive for long period of time. label Apr 11, 2026

github-actions Bot removed the stale Stale PR or inactive for long period of time. label Jun 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gh-93714 Create fast version of match_keys for exact dict#93752

gh-93714 Create fast version of match_keys for exact dict#93752
da-woods wants to merge 2 commits into
python:mainfrom
da-woods:fast-match-keys

da-woods commented Jun 12, 2022

Uh oh!

sweeneyde commented Jun 13, 2022

Uh oh!

da-woods commented Jun 13, 2022 •

edited

Loading

Uh oh!

da-woods commented Jun 13, 2022

Uh oh!

markshannon commented Jun 13, 2022

Uh oh!

da-woods commented Jun 14, 2022

Uh oh!

markshannon commented Jun 15, 2022

Uh oh!

github-actions Bot commented Apr 11, 2026

Uh oh!

eendebakpt commented Jun 1, 2026

Uh oh!

da-woods commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Conversation

da-woods commented Jun 12, 2022

Uh oh!

sweeneyde commented Jun 13, 2022

Uh oh!

da-woods commented Jun 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

da-woods commented Jun 13, 2022

Uh oh!

markshannon commented Jun 13, 2022

Uh oh!

da-woods commented Jun 14, 2022

Uh oh!

markshannon commented Jun 15, 2022

Uh oh!

github-actions Bot commented Apr 11, 2026

Uh oh!

eendebakpt commented Jun 1, 2026

Uh oh!

da-woods commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

da-woods commented Jun 13, 2022 •

edited

Loading