In our Application We Are utilizing Retrieval Augmented Generation (RAG) for querying and interacting with your data,
either locally or deployed via cloud.
In every question we give you the option to see The context That was Build from the DB(Faiss) with The original text orgenized with the Priority From the
closest text to the farthest, mark with #
We also Added the Distance Strategy option of the DB we use (cosine is the default)
The Gui is written with Streamlit
Streamlit lets you transform Python scripts into interactive web apps in minutes, instead of weeks. Build dashboards, generate reports, or create chat apps. Once you’ve created an app, you can use Community Cloud platform to deploy, manage, and share your app.
- for Dicta - download the dictalm2.0-instruct.Q4_K_M.gguf version
- for Aya - download the aya-23-8B-Q5_K_M.gguf version
- for ChatGpt we Implement 'gpt-4o'
you need to create a .env file with the Following Parameters:
Environment Variable | Value | Description |
---|---|---|
OPENAI_API_KEY | open a count in OPENAI and get your key openAI | api key for ChatGpt |
EMBEDDING_MODEL | the local path for Embedding Model | embedding model for the Faiss DB Embedding Function |
MODEL_1 | the local path for dictalm2.0-instruct.Q4_K_M.gguf | Dicta LLM with GGUF format of Tensor size Q4_K_M |
MODEL_2 | the local path for aya-23-8B-Q5_K_M.gguf | Aya LLM with GGUF format of Tensor size Q5_K_M |
- clone [email protected]:Eliyahou/HebrewChatBot.git
- cd HebrewChatBot
- conda create -n HebrewChatBot python=3.12.4 anaconda
- conda activate HebrewChatBot
- code .
- Install dependencies with
pip install -r requirements.txt
- create folder name files and put there all your files you want to investigate
download Visual Studio Installar
You need to install the desktop c++ block with visual studio to get cmake properly installed.Open the Visual Studio Installer and click Modify, then check Desktop development with C++ and click Modify to start the install.
pip install llama-cpp-python
conda install pytorch torchvision torchaudio pytorch-cuda=12.4 -c pytorch -c nvidia
Download your cuda version CUDA 12.4
Take The Files from C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.4\extras\visual_studio_integration\MSBuildExtensions
and Paste in C:\Program Files (x86)\Microsoft Visual Studio\2022\BuildTools\MSBuild\Microsoft\VC\v170\BuildCustomizations
$env:CMAKE_ARGS = "-DGGML_BLAS=ON -DGGML_BLAS_VENDOR=OpenBLAS"
$env:CUDATOOLKITDIR="C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.4"
pip install --force-reinstall --no-cache-dir llama-cpp-python
pip install numpy== 1.26.4
streamlit run rag/app.py