Add support for open source models #38

bodhish · 2024-10-22T04:05:53Z

The current version uses OPEN AI models to transcribe the audio and OAI LLM to transform the text, it would be great if we can add options to use open source LLM's and transcription models.

selamanse · 2024-10-31T11:25:58Z

@bodhish I'd like to try this if it is still of interest. can you assign me please?

selamanse · 2024-10-31T11:30:55Z

also some questions:

did you have any specific llms in mind that you already now could be suitable?
how do I test your service, do you have integration tests?
I've roughly looked into he code and can see that the transctriptions controller is taking care of the transcription to form by using fill_form function of the openai api. can you point out other AI interaction relevant sections in the code?

bodhish · 2024-11-09T09:08:34Z

Hi @selamanse! I was travelling, sorry for the late replay.

For LLM recommendations:
- Transcription: WhisperX/Faster-Whisper/Self-Hosted Whisper
- Form Processing: Llama/Mistral
The project doesn't have a proper automated testing workflow, I will add a few for the services in the coming week.
The main AI interactions happen in:

OpenaiHelper module (currently handles both transcription and completion)
transcriptions_controller.rb (workflow orchestration)

I'd suggest extracting our AI interactions into a provider-based pattern that would allow easy swapping of models based on environment configuration that will let users pick a model for transcription and another for completion.

bodhish added the hacktoberfest label Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for open source models #38

Add support for open source models #38

bodhish commented Oct 22, 2024

selamanse commented Oct 31, 2024

selamanse commented Oct 31, 2024 •

edited

Loading

bodhish commented Nov 9, 2024 •

edited

Loading

Add support for open source models #38

Add support for open source models #38

Comments

bodhish commented Oct 22, 2024

selamanse commented Oct 31, 2024

selamanse commented Oct 31, 2024 • edited Loading

bodhish commented Nov 9, 2024 • edited Loading

selamanse commented Oct 31, 2024 •

edited

Loading

bodhish commented Nov 9, 2024 •

edited

Loading