Fix OpenAI-compatible STT for "Speech to text selected lines" by dkakaie · Pull Request #11291 · SubtitleEdit/subtitleedit

dkakaie · 2026-05-31T09:32:52Z

Two fixes to the OpenAI-compatible STT engine when transcribing selected lines:

Empty transcription text: the result was matched back to the audio clip by the transcoded temp file name instead of the original clip, so it never attached and the selected line came back empty. Now matched by _videoFileName, like the whisper engines.
Multi-segment results not split: a selected line whose audio the engine split into several segments was only split into multiple subtitle lines when the line was longer than 10 s; otherwise the segments were concatenated into one line. Removed the 10 s gate so any multi-segment result splits into one line per segment.

Copilot

Pull request overview

This PR fixes OpenAI-compatible speech-to-text behavior when transcribing selected subtitle lines, ensuring transcriptions attach to the correct extracted clip and multi-segment results become multiple subtitle lines.

Changes:

Matches OpenAI-compatible STT results back to the original clip filename instead of the transcoded temporary upload file.
Splits selected-line transcription output whenever multiple paragraphs/segments are returned, rather than only for lines longer than 10 seconds.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
`src/ui/Features/Video/SpeechToText/SpeechToTextViewModel.cs`	Updates OpenAI-compatible transcription result assignment to use `_videoFileName`, aligning with other STT paths.
`src/ui/Features/Main/MainViewModel.cs`	Removes the duration gate so multi-paragraph transcription results replace the selected line with multiple timed lines.

niksedk · 2026-06-01T07:35:56Z

Thanks for the PR! Just a quick heads up regarding the logic here:

The condition selectedLine.Duration.TotalSeconds > 10 is used to switch the logic from single line mode to one-selection-to-many-lines mode. Your changes might break this distinction I think...

dkakaie · 2026-06-01T07:53:29Z

Thanks for the reply. What user scenario was the Duration.TotalSeconds > 10 check originally intended to handle? Was it mainly to avoid creating many short subtitle segments from a single selection?

My thinking was that modern ASR models are generally quite good at segmentation and timestamping. When a transcription contains multiple timestamped segments, concatenating them back into a single subtitle line—especially for longer clips—can discard some of that structure.

If the 10-second threshold was added as a UX choice rather than due to limitations of the transcription engine, perhaps we could consider relying on the model's segmentation directly and remove the single line mode branching. Another option would be to make the threshold configurable, although that may be unnecessary if the segmentation quality is consistently good.

I'm interested in the original rationale and would be curious to hear your thoughts on whether the model's segmentation should drive subtitle splitting here.

dkakaie force-pushed the main branch from 5484f5a to b73ac05 Compare May 31, 2026 09:43

niksedk requested a review from Copilot May 31, 2026 15:04

Copilot started reviewing on behalf of niksedk May 31, 2026 15:04 View session

Copilot AI reviewed May 31, 2026

View reviewed changes

dkakaie added 2 commits June 1, 2026 10:37

fix empty transcription text

189e4fd

fix multi-segment results not split regardless of source duration

635d649

dkakaie force-pushed the main branch from b73ac05 to 635d649 Compare June 1, 2026 07:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix OpenAI-compatible STT for "Speech to text selected lines"#11291

Fix OpenAI-compatible STT for "Speech to text selected lines"#11291
dkakaie wants to merge 2 commits into
SubtitleEdit:mainfrom
dkakaie:main

dkakaie commented May 31, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

niksedk commented Jun 1, 2026

Uh oh!

dkakaie commented Jun 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dkakaie commented May 31, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

niksedk commented Jun 1, 2026

Uh oh!

dkakaie commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dkakaie commented Jun 1, 2026 •

edited

Loading