Stored data de-duplication #330
cayolblake
started this conversation in
Ideas
Replies: 2 comments 4 replies
-
It's an interesting idea. We will have a discussion internally. If you have enough time, contributions are very welcome. 😀 |
Beta Was this translation helpful? Give feedback.
0 replies
-
De-duplication is hot topic in storage, and useful for some cases, may not justify for the high cost as a general feature, so we don't put that in the roadmap. JuiceFS does provide some features to dedup the data at application level, for example, hardlink and faster CopyFileRange (without coping the data). |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I wonder if data de-duplication is on your roadmap.
Basically, if an attempt to store the same file multiple times would first check if the file chunks already exists somewhere (by using some sort of fast hashing algorithm) and if it previously existed, it would just point to the already existing file without consuming additional space and effort.
Beta Was this translation helpful? Give feedback.
All reactions