Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

English language output is default, with no indication on how to change it #5

Open
FredrikKarlssonSpeech opened this issue Jun 15, 2024 · 2 comments

Comments

@FredrikKarlssonSpeech
Copy link

The transcription system works and provides an output that is content-wise fine, but the language is English always.
Unless I am missing some way of forcing the output language, I suspect that I am not alone in assuming that if I give this command

Whisper.transcribe(
           "/Users/frkkan96/Desktop/kaa_yw_pb_16000.flac", "/Users/frkkan96/Desktop/kaa_yw_pb_16000.srt";
           model_name="large-v3", language="swedish", dev=cpu, precision=f32)

I would get an .srt file with the Swedish language transcription in it?

@pxl-th
Copy link
Member

pxl-th commented Jun 16, 2024

Yes, by default the model performs transcribing to English and if the audio is non-English, then it also translates to english.

To actually transcribe to non-English language you have to specify language keyword argument.

To do this automatically, we'd have to run language detection first, before transcribing.

@FredrikKarlssonSpeech
Copy link
Author

To actually transcribe to non-English language you have to specify language keyword argument.

Sure, and I did (see the post). So, when you want to transcribe, in my case Swedish, the usual case is also to want the output in Swedish language text. Not English language text in the SRT - which is what I got.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants