GPT-5.4 & fast mode live #552

zemaj · 2026-03-06T01:36:33Z

zemaj
Mar 6, 2026
Maintainer

I've cleaned up the /model list as well to just show GPT-5.4, GPT-5.3-Codex and GPT-5.3-Codex-Spark.

You can still launch Code with the model param (e.g. -m gpt-5.2) to access earlier models.

Open to feedback if you think this is too early!

cbusillo · 2026-03-06T06:54:42Z

cbusillo
Mar 6, 2026

Working great! I have a couple issues I want to bring up, I'll make issues or PRs tomorrow.

TUI selection for compaction. I believe there are some config values you can put in for this, but 5.4 does really well even with larger context use. I was testing close to the 1m limit.

There is an issue where large console output from a process Every Code is watching that causes the session to basically become unresponsive. Just a few minutes ago I had a session I really wanted to recover so we found the problem.

Anyway, 100% on gpt-5.4 and your choice to hide the other models. I'll write up something tomorrow for these, unless you fix them before I have time, as you usually do.

GUI/remote view is coming along. I've had to trash and start over a few times. I did have it fully working at one point with a decent UI, and I really like it. It's so nice to be able to step away and answer quick messages or questions or just check on progress.

0 replies

zemaj · 2026-03-06T10:21:15Z

zemaj
Mar 6, 2026
Maintainer Author

@cbusillo Yeah good ideas! I'm adding in the 1M compaction toggle. I'm also adding in an "1M Auto" model (probably the new default) which will automatically pick the best time to compact between 100k and 1M. Basically and the start of each turn, it'll decide based on the turn (new user input & prior convo) if the turn will use most of the prior context and/or if it'll run out of context before completing. Also keeping in mind the 2x cost of > 290k. So this will allow sessions to go above the 290k when needed, but avoid it most of them time. I've had quite a few sessions keep looping on compaction on difficult tasks and have the ability to automatically into 1M when useful.

1 reply

cbusillo Mar 6, 2026

I was trying to think of how to have auto compaction be something the model controls. Sounds like you are ahead of me already. I've been misusing it the larger context, it didn't seem to burn rate limit as fast as I expected. I did notice that the fast option was enabled after restarting. It didn't seem to save my preference. I haven't played with that yet.

cbusillo · 2026-03-07T18:24:00Z

cbusillo
Mar 7, 2026

Are you aware of the leaking into assistant messages? I assume you want those cleaned up somehow? Or do you not use memory. I love new features sometimes only because they are new features and tend to turn everything on. I did turn off fast mode. I already burn through two Pro accounts a week in 4-6 days.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPT-5.4 & fast mode live #552

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

GPT-5.4 & fast mode live #552

Uh oh!

zemaj Mar 6, 2026 Maintainer

Replies: 3 comments · 1 reply

Uh oh!

cbusillo Mar 6, 2026

Uh oh!

zemaj Mar 6, 2026 Maintainer Author

Uh oh!

cbusillo Mar 6, 2026

Uh oh!

cbusillo Mar 7, 2026

zemaj
Mar 6, 2026
Maintainer

Replies: 3 comments 1 reply

cbusillo
Mar 6, 2026

zemaj
Mar 6, 2026
Maintainer Author

cbusillo
Mar 7, 2026