sql: post repr types: Rename `output_type()` to `output_sql_type()` on all scalar funcs by mgree · Pull Request #35219 · MaterializeInc/materialize

mgree · 2026-02-25T19:52:20Z

Motivation

Cleanup stacked on #35084.

Description

The various scalar functions compute their SQL output types given SQL input types. This renames functions to clarify and adds a shim that converts repr types up to canonical SQL types and then converts the resulting SQL type back to a repr type.

Verification

All the tests.

…r types

…f trusting `Get`s, which now have canonicalized types

These fire when repr-type canonicalization does not fully prevent type mismatches, providing CI visibility without crashing production. - scalar.rs: base_eq_or_repr_eq_for_assertion panic -> soft_panic_or_log - context.rs: arrangement key fallback panic -> soft_panic_or_log - relation.rs: try_union trace -> soft_panic_or_log

This tweaks `MirScalarExpr::typ` function case and `try_col_with_input_cols`'s TableFunc and AggregateFunc cases, to prevent function return types from reintroducing non-canonical types, e.g. CastOidToRegProc or AclExplode.

…ationType analysis Partially inspired by MaterializeInc#35073

…text Add a ReduceContext enum to MirScalarExpr::reduce to distinguish between SQL-level callers (which need exact types like VarChar, RegProc) and optimizer callers (which should work with repr-canonical types). When context is ReduceContext::Optimizer, column types are canonicalized via repr round-trip at the start of reduce(). This ensures consistency after ReprizeSqlTypes has run. Note: The ReduceContext approach introduced in this commit has been superseded by the reduce_repr() approach in a later commit. The ReduceContext enum and its usage have been removed.

…iteral

…eInc#35073

…eprColumnType

Add infrastructure: - SqlRelationType::from_repr(&ReprRelationType) in src/repr/src/relation.rs - MirRelationExpr::repr_typ() -> ReprRelationType in src/expr/src/relation.rs - MirRelationExpr::repr_typ_with_input_types(&[ReprRelationType]) in src/expr/src/relation.rs Switch the following transform files to work with ReprRelationType/ReprColumnType: - column_knowledge.rs: optimize() now takes &[ReprColumnType], all .typ() sites produce Vec<ReprColumnType>, scalar.repr_typ() for output types. - predicate_pushdown.rs: repr_typ() for nullable checks, extract_equal_or_both_null and helpers take &[ReprColumnType], uses repr_typ()/repr_typ_with_input_types throughout. - fusion/filter.rs: uses input.repr_typ().column_types directly. - join_implementation.rs: input_types is Vec<ReprRelationType> via repr_typ(), uses new_from_input_repr_types. - redundant_join.rs: same pattern as join_implementation.rs. - literal_constraints.rs: inp_typ switched to ReprRelationType, converts back to SqlRelationType only for MIR node construction (Constant, take_safely). Convert back to SqlColumnType/SqlRelationType only where structurally required (reduce() calls, MIR node construction).

…sform code

Add take_safely_repr(Option<ReprRelationType>) and take_safely_with_repr_col_types(Vec<ReprColumnType>) to MirRelationExpr. These are thin wrappers that convert to SQL types internally, providing a cleaner API for optimizer transforms that work with repr types natively. Update 4 call sites: - equivalence_propagation.rs (2): eliminate SqlColumnType::from_repr conversions - predicate_pushdown.rs: eliminate SqlRelationType::from_repr - literal_constraints.rs: eliminate SqlRelationType::from_repr

Change the signature of on_unique and its helpers (on_unique_ranking_window_funcs, on_unique_window_agg) from &[SqlColumnType] to &[ReprColumnType]. Internally, convert to SqlColumnType once at the top for self.typ() calls that need SQL types. This eliminates the SqlColumnType::from_repr conversion in reduce_elision.rs, which was the only external caller.

Do the following easy switches from reduce(&[SqlColumnType]) to reduce_repr(&[ReprColumnType]): - simplify_to_literal / simplify_to_literal_with_result: It's ok to do the constant folding in MIR, because the result is untyped. - canonicalize.rs: 3 calls in canonicalize_equivalences, canonicalize_predicates, and replace_subexpr_and_reduce. These already had ReprColumnType available and were converting to SqlColumnType just to call reduce. The conversion is now removed entirely. - plan_index_exprs in query.rs: Index key expressions should have repr types so optimizer code can find matching indexes. - WebhookValidation::reduce_expression in plan.rs: Webhook validation expressions are used at runtime where repr types are the native currency.

Several transform files were storing SqlRelationType obtained via typ() (which converts stored ReprRelationType to SqlRelationType), only to convert back to ReprRelationType later. Switch these to use repr_typ() and store ReprRelationType directly, eliminating unnecessary round-trips. Files changed: projection_lifting.rs, projection_pushdown.rs, cse/anf.rs, union_cancel.rs, demand.rs, dataflow.rs.

…ype analysis

Switch Scope struct in expr-test-util to use ReprRelationType instead of SqlRelationType. Update all insert/set/get/iter signatures accordingly. build_get no longer needs conversion, build_let uses repr_typ(), and reverse_syntax_override stores ReprRelationType directly. Update doc comments in fusion/filter.rs and predicate_pushdown.rs to construct MirRelationExpr::Constant directly with ReprRelationType instead of using MirRelationExpr::constant() with SqlRelationType.

github-actions · 2026-02-25T19:52:31Z

Thanks for opening this PR! Here are a few tips to help make the review process smooth for everyone.

PR title guidelines

Use imperative mood: "Fix X" not "Fixed X" or "Fixes X"
Be specific: "Fix panic in catalog sync when controller restarts" not "Fix bug" or "Update catalog code"
Prefix with area if helpful: compute: , storage: , adapter: , sql:

Pre-merge checklist

The PR title is descriptive and will make sense in the git log.
This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).

mgree and others added 30 commits February 25, 2026 09:15

MIR transform to canonicalize SQL types by round tripping through rep…

2c48487

…r types

make arrangement resolution try to fall back; rewrite tests

a327198

have persist_fast_path_order actually look up catalog types instead o…

10d19d2

…f trusting `Get`s, which now have canonicalized types

expect that Get's ID is also imported

0ea031f

Canonicalize function return types

e2cb30c

This tweaks `MirScalarExpr::typ` function case and `try_col_with_input_cols`'s TableFunc and AggregateFunc cases, to prevent function return types from reintroducing non-canonical types, e.g. CastOidToRegProc or AclExplode.

Add ReprizeSqlTypes to constant_optimizer

c370fa9

Switch optimizer transforms to use ReprRelationType instead of SqlRel…

b631f66

…ationType analysis Partially inspired by MaterializeInc#35073

Canonicalize type in ColumnKnowledge before creating MirScalarExpr::L…

c138ac6

…iteral

Canonicalize types in index key MIR expressions

92e0c50

Canonicalize types in export_index

92a2e6f

Simplify DatumKnowledge to use ReprScalarType, inspired by Materializ…

5ab379b

…eInc#35073

Switch canonicalize_predicates and canonicalize_equivalences to use R…

563ef55

…eprColumnType

Create reduce_repr instead of ReduceContext; simplifies a lot of tran…

a1133c6

…sform code

Change MIR structs to repr types

9642554

Fix mzreflect glitch

c4aacdf

Eliminate SQL types from FoldConstants

7575ddb

Eliminate the repr type fallback in CollectionBundle::arrangement

9cd7d57

drop reprize, push repr types through index imports and exports

f3da11a

expr-parser: more repr types

d9158af

custom eq and ord instances, drop Repr*::base_eq

a23a5d7

canonicalize typ() methods; drop repr_canonicalize; drop SqlRelationT…

646e87d

…ype analysis

Use smart constructors for MirScalarExpr::Literal

fabc87f

change MRE::constant to take a repr type

37d185e

ggevay and others added 6 commits February 25, 2026 18:27

fix lints

b272ca2

renames and cleanup

d3b1609

turn mz_repr::... into imports and unqualified uses

7fcfcde

Switch HIR lowering and anti_lookup/lookup to repr types

7a9286f

rename output_type to output_sql_type throughout

6ac684f

mgree mentioned this pull request Feb 25, 2026

Repr types: Introduce repr types in all MIR code, and then change the MIR structs #35084

Open

mgree added 5 commits February 25, 2026 15:17

fix doctests, rewrite insta snapshots, satisfy linter

e078e76

rewrite tests

b386013

cleanup usage of output_sql_type and sql_typ

3931934

drop unused DatumKnowledge sql type interactions

92b1733

stray output_sql_type

cb7e9c7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: post repr types: Rename `output_type()` to `output_sql_type()` on all scalar funcs #35219

sql: post repr types: Rename `output_type()` to `output_sql_type()` on all scalar funcs #35219
mgree wants to merge 41 commits intoMaterializeInc:mainfrom
mgree:repr-type-canonicalize-xform-rename-output-type

mgree commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mgree commented Feb 25, 2026

Motivation

Description

Verification

Uh oh!

github-actions bot commented Feb 25, 2026

PR title guidelines

Pre-merge checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants