-
Notifications
You must be signed in to change notification settings - Fork 160
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix Dataprep Ingest Data Issue. #1271
base: main
Are you sure you want to change the base?
Conversation
Trace: 1. The update of `langchain_huggingface.HuggingFaceEndpointEmbeddings` caused the wrong size of embedding vectors. 2. Wrong size vectors are wrongly saved into Redis database in type of `byte`, and the indices are not created correctly. 3. The retriever can not retrieve data from Redis using index due to the reasons above. 4. Then the RAG seems `not work`, for the file uploaded can not be retrieved from database. Solution: Replace all of the `langchain_huggingface.HuggingFaceEndpointEmbeddings` to `langchain_community.embeddings.HuggingFaceInferenceAPIEmbeddings`, and modify related READMEs and scirpts. Related issue: opea-project/GenAIExamples#1482 Signed-off-by: letonghan <[email protected]>
for more information, see https://pre-commit.ci
One more thing, I noticed that retriever is also using the |
Signed-off-by: letonghan <[email protected]>
Signed-off-by: letonghan <[email protected]>
for more information, see https://pre-commit.ci
The |
Signed-off-by: letonghan <[email protected]>
Description
Fix Dataprep Ingest Data Issue.
Root Cause:
The package of
langchain_huggingface
updated, caused different output ofHuggingFaceEndpointEmbeddings.embed_documents
.Trace:
langchain_huggingface.HuggingFaceEndpointEmbeddings
caused the wrong size of embedding vectors.not work
, for the file uploaded can not be retrieved from database.Solution:
Replace all of the
langchain_huggingface.HuggingFaceEndpointEmbeddings
tolangchain_community.embeddings.HuggingFaceInferenceAPIEmbeddings
, and modify related READMEs and scirpts.Issues
opea-project/GenAIExamples#1473
opea-project/GenAIExamples#1482
Type of change
List the type of change like below. Please delete options that are not relevant.
Dependencies
None
Tests
Local tested