Workaround for dynamic indexing on CPU distribution #3802
-
Hey, Have any of you ever had to use the CPU distribution but with the need to index progressively? Even if it is designed specifically for static dataset, is there any possible workaround (obviously at the expense of performance)? Thanks, Matteo |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 2 replies
-
Hi, The data is stored in segment, and the size of each segment is the |
Beta Was this translation helpful? Give feedback.
-
Thank you @shiyu22 for such a quick response! To make sure that I understood you correctly before I try it, let me describe my flow of operations at a high level: For each vector
Is it right? If so, how does the CreateIndex function work? Does it add to the index only the current vector of each iteration ( I hope I've been clear enough :) |
Beta Was this translation helpful? Give feedback.
-
Thanks, I think I have now enough info to start the implementation! By the way, is it then a bit confusing saying that the CPU version can deal with static dataset only? However, if the "loop" I mentioned earlier is performed in parallel, let's say by different pods in a k8s environment, I assume that in order to find the vectors they must be indexed (indexing for me equals storing, somehow). That's why I was thinking to index vectors one by one. |
Beta Was this translation helpful? Give feedback.
-
That phrase is really confusing and is it on the website? Please submit a doc issue and point the place where we can change it. |
Beta Was this translation helpful? Give feedback.
That phrase is really confusing and is it on the website? Please submit a doc issue and point the place where we can change it.