Block Matrix Linear Operator by corwinjoy · Pull Request #67 · cornellius-gp/linear_operator

corwinjoy · 2023-06-05T21:43:32Z

Idea

Represent [TN, TM] tensors by TxT blocks of NxM lazy tensors. While block matrices are currently supported, the efficient representation is only when there is a diagonal structure over the T dimensions.

Pitch

Add a block linear operator class that can keep track of the [T, T] block structure, represented as T^2 lazy tensors of the same shape. Implement matrix multiplication between block matrices as the appropriate linear operators on the blocks.

Previous Discussion

Issue #54

Additional Considerations

In pursuing this, it seems that the base test class checks for many operations beyond what is required to create a LinearOperator. I propose a refactoring of the test class into required / core operations and optional operations. For now, I have created a new core test class CoreLinearOperatorTestCase and have shown what has been excluded by commenting out the relevant code. This idea could also use a review for accuracy.

Co-authored-by: Danny Friar <dannyfriar@hotmail.co.uk>

…make it clear what we are skipping.

corwinjoy · 2023-06-05T21:51:05Z

@Balandat After more discussion with @hughsalimbeni and @dannyfriar here is our initial proposal to expand the library as per the previous discussion.
We think it represents a good way to generalize the operator structure to capture nested operations.
We also look forward to any suggestions you may have.
Also tagging: @gpleiss, @jacobrgardner

linear_operator/test/linear_operator_core_test_case.py

corwinjoy · 2023-07-12T10:06:53Z

Hi, I just wanted to check in and see if you had time to take a look at our proposal and if you had any thoughts? Thanks again for the feedback so far!
@Balandat @gpleiss

gpleiss · 2023-07-15T01:21:50Z

Sorry for the delay @corwinjoy - this PR is on my todo list for Monday.

linear_operator/operators/block_matrix_linear_operator.py

linear_operator/test/linear_operator_core_test_case.py

…re advanced tests. This allows us to create and test operators that only support core operations.

…or_lo

corwinjoy · 2023-07-19T22:44:14Z

@gpleiss Thanks for all the great feedback! I believe I have addressed all these and would appreciate a second look when you have time.

linear_operator/operators/block_matrix_linear_operator.py

gpleiss · 2023-07-27T19:27:13Z

@corwinjoy after playing around with this PR some more, and dealing with the typeguard errors, I made 2 commits that change around some of the internals. The first is hopefully not to controversial, the second one might be :)

Rearranging the logic of _matmul. The purpose of the _matmul function is defining a matrix-vector product that can be used by iterative algorithms (e.g. CG), so - upon further reflection - I think that _matmul should only accept Tensor rhs, and it should also output a Tensor as well.

I rearranged some of the logic, in what I hope won't affect anything that you've already written.

The previous base case of _matmul now lives in a private function called _matmul_two_block_matrix_linear_operators, which takes in two BlockMatrixLOs and outputs a BlockMatrixLO
The public matmul function now has logic to call this new private function if other is an appropriate BlockMatrixLO. Otherwise it calls the _matmul function.

Changing the constructor. (slightly more controversial). There were some typing issues with the constructor, since it was taking in a list of lists of linear operators. The arguments to all other LinearOperators are expected to be linear operators or Tensors. (I noticed that you had to overwrite the representation function to accommodate this). There are unfortunately a lot of (necessary) hacks throughout LinearOperator that do expect the arguments to be LinearOperators and Tensors, and I'm a bit nervous that there could be some unintended bugs by not adhering to this pattern. (Historically, these bugs emerge as gradient issues or type-casting issues.)

My proposed solution: the user passes in a flattened list of T^2 linear operators into the constructor. It's a little more unintuitive for the user (passing in a list of lists is more natural, I agree), but it actually made some of the code quite a bit simpler.

Anyways, both commits are now part of this branch. Let me know your thoughts, and we can always revert and figure out different solutions.

corwinjoy · 2023-07-28T05:30:40Z

@gpleiss Thanks for the detailed review l. It all sounds good except for possibly the constructor which I need to think about. I am traveling right now but should be able to take a detailed look by Wednesday. @dannyfriar any thoughts? @hughsalimbeni

dannyfriar · 2023-07-31T13:47:48Z

@gpleiss Thanks for the detailed review l. It all sounds good except for possibly the constructor which I need to think about. I am traveling right now but should be able to take a detailed look by Wednesday. @dannyfriar any thoughts? @hughsalimbeni

I also think that passing in a list of lists is more natural. One thing we'd like to do with this is have the ability to rotate the blocks - this is simpler with the list of lists but I'm sure it can be made to work with a flat list too. @gpleiss out of curiosity what are the hacks that you mentioned that require LOs/tensors?

corwinjoy · 2023-08-01T22:03:05Z

@gpleiss Thanks for the code changes! Looking at these, they make sense. I like the better matmul function. The flattened list of operators for the constructor is not my first choice, but if that is what needs to happen for compatibility I can live with it. We do have a from_tensor helper constructor already and if we can add a nested list helper later if we find out we need it. Like @dannyfriar I am a bit curious about the hacks that require this format if you do have time to explain. At any rate, I am happy with the changes as they are.

CarloGraziani · 2025-02-20T18:12:26Z

This looks pretty close to something that I also need: a block-Toeplitz linear operator, which arises in multi-output GPs with kernels that are stationary over their 1-D input space. This is relevant to LLM work, where the input space is sequence location and the output space is token categorical probability.

If I were to simply replicate LazyTensor blocks as required in the list of N**2 blocks to make a block-Toeplitz structure, would that work, or would it break something (like autograd treating the blocks as independent variables when they are not)?

Corwin Joy and others added 15 commits May 30, 2023 15:17

Rebase vs. main

e2e32f1

Fix block tensor type signatures

15effdf

Get simple test running for BlockTensor

4055a7f

Add simple property implementations

2cf2db0

Upgrade linear operator to add block / sparse test

61d9b8a

Add and document core test cases

bbc12ea

Cleanup dead comments

bfc843a

Update linear_operator/operators/block_tensor_linear_operator.py

516b369

Co-authored-by: Danny Friar <dannyfriar@hotmail.co.uk>

Update linear_operator/operators/block_tensor_linear_operator.py

9d12d7c

Co-authored-by: Danny Friar <dannyfriar@hotmail.co.uk>

Improve construction tests and types

56809b2

Rename class to MatrixLinearOperator

2f9b4b7

Add parts omitted from base test case. Show them as commented out to …

7cbbc54

…make it clear what we are skipping.

Rename class to BlockMatrixLinearOperator

32e9a52

Fix type signature

30ad0ed

Improve comments

889ce0f

corwinjoy commented Jun 5, 2023

View reviewed changes

linear_operator/test/linear_operator_core_test_case.py Show resolved Hide resolved

Merge branch 'main' into block_tensor_lo

d15f368

gpleiss reviewed Jul 17, 2023

View reviewed changes

Corwin Joy added 3 commits July 18, 2023 16:34

Refactor linear_operator_test_case.py into a set of core tests and mo…

c557aa3

…re advanced tests. This allows us to create and test operators that only support core operations.

Merge remote-tracking branch 'origin/block_tensor_lo' into block_tens…

d2cb1cc

…or_lo

Incorporate review suggestions from Geoff Pleiss.

67b8fe9

gpleiss reviewed Jul 27, 2023

View reviewed changes

linear_operator/operators/block_matrix_linear_operator.py Show resolved Hide resolved

Add comment explaining matmul override.

75b565a

gpleiss approved these changes Jul 27, 2023

View reviewed changes

gpleiss and others added 2 commits July 27, 2023 17:48

Add jaxtyping requirement for conda

ff0b6a2

Merge branch 'main' into block_tensor_lo

ffc3116

gpleiss added 4 commits July 27, 2023 17:55

Fix linter

58e8686

Hopefully fix weird CI errors

7f803e3

Refactor BlockMatrixLO._matmul to better adhere to type signatures

c7d094d

BlockMatrixLO takes in a flattened represetation

8d29dd7

Conversation

corwinjoy commented Jun 5, 2023

Idea

Pitch

Previous Discussion

Additional Considerations

Uh oh!

corwinjoy commented Jun 5, 2023

Uh oh!

Uh oh!

corwinjoy commented Jul 12, 2023

Uh oh!

gpleiss commented Jul 15, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

corwinjoy commented Jul 19, 2023

Uh oh!

Uh oh!

gpleiss commented Jul 27, 2023

Uh oh!

corwinjoy commented Jul 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dannyfriar commented Jul 31, 2023

Uh oh!

corwinjoy commented Aug 1, 2023

Uh oh!

CarloGraziani commented Feb 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

corwinjoy commented Jul 28, 2023 •

edited

Loading