English language output is default, with no indication on how to change it #5

FredrikKarlssonSpeech · 2024-06-15T08:56:11Z

The transcription system works and provides an output that is content-wise fine, but the language is English always.
Unless I am missing some way of forcing the output language, I suspect that I am not alone in assuming that if I give this command

Whisper.transcribe(
           "/Users/frkkan96/Desktop/kaa_yw_pb_16000.flac", "/Users/frkkan96/Desktop/kaa_yw_pb_16000.srt";
           model_name="large-v3", language="swedish", dev=cpu, precision=f32)

I would get an .srt file with the Swedish language transcription in it?

The text was updated successfully, but these errors were encountered:

pxl-th · 2024-06-16T22:28:09Z

Yes, by default the model performs transcribing to English and if the audio is non-English, then it also translates to english.

To actually transcribe to non-English language you have to specify language keyword argument.

To do this automatically, we'd have to run language detection first, before transcribing.

FredrikKarlssonSpeech · 2024-06-20T12:06:38Z

To actually transcribe to non-English language you have to specify language keyword argument.

Sure, and I did (see the post). So, when you want to transcribe, in my case Swedish, the usual case is also to want the output in Swedish language text. Not English language text in the SRT - which is what I got.

FredrikKarlssonSpeech mentioned this issue Jun 15, 2024

English #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

English language output is default, with no indication on how to change it #5

English language output is default, with no indication on how to change it #5

FredrikKarlssonSpeech commented Jun 15, 2024

pxl-th commented Jun 16, 2024

FredrikKarlssonSpeech commented Jun 20, 2024

English language output is default, with no indication on how to change it #5

English language output is default, with no indication on how to change it #5

Comments

FredrikKarlssonSpeech commented Jun 15, 2024

pxl-th commented Jun 16, 2024

FredrikKarlssonSpeech commented Jun 20, 2024