Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for open source models #38

Open
bodhish opened this issue Oct 22, 2024 · 3 comments
Open

Add support for open source models #38

bodhish opened this issue Oct 22, 2024 · 3 comments

Comments

@bodhish
Copy link
Member

bodhish commented Oct 22, 2024

The current version uses OPEN AI models to transcribe the audio and OAI LLM to transform the text, it would be great if we can add options to use open source LLM's and transcription models.

@selamanse
Copy link

@bodhish I'd like to try this if it is still of interest. can you assign me please?

@selamanse
Copy link

selamanse commented Oct 31, 2024

also some questions:

  • did you have any specific llms in mind that you already now could be suitable?
  • how do I test your service, do you have integration tests?
  • I've roughly looked into he code and can see that the transctriptions controller is taking care of the transcription to form by using fill_form function of the openai api. can you point out other AI interaction relevant sections in the code?

@bodhish
Copy link
Member Author

bodhish commented Nov 9, 2024

Hi @selamanse! I was travelling, sorry for the late replay.

  1. For LLM recommendations:

    • Transcription: WhisperX/Faster-Whisper/Self-Hosted Whisper
    • Form Processing: Llama/Mistral
  2. The project doesn't have a proper automated testing workflow, I will add a few for the services in the coming week.

  3. The main AI interactions happen in:

I'd suggest extracting our AI interactions into a provider-based pattern that would allow easy swapping of models based on environment configuration that will let users pick a model for transcription and another for completion.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants