Optimize/configure experiment commits for which to push cache #6593
dberenbaum
started this conversation in
New Features & Ideas
Replies: 1 comment 1 reply
-
I think this is a real issue. One of the motivations behind #6549. Having a granular control over "cache-depth" is also important, though not as immediate as experiments. I believe we shouldn't push any artifact to a Git repository if it's not already tracked by Git. We are trying to push 50+MB model files to Git and only notice this when Github refuses. These artifacts can quickly fill up the repository space. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
dvc exp push/pull
transfer all cached outputs for all experiment commits on which the experiment is based, including the baseline commit (see #6592).Potential issues:
dvc exp branch
.I can understand the default behavior because otherwise the user risks losing all of the cached data from the experiment commits if the local experiments get destroyed. However, if a user wants to reproduce or build on top of an existing experiment commit, it may be simpler to ask them to follow the typical workflow of having to
dvc pull
first. Users may want granular control to say to pull the cache for the last n commits of the branch/experiment.Is this really even an experiment-specific functionality? Being able to push the cache for multiple commits in a regular Git branch would enable more consistency between pushing/pulling experiments and
dvc exp branch
branches and seems useful outside of experiments, too. Even for a single commit, this would resolve user complaints of having to separately dogit push
anddvc push
.@pmrowla @daavoo @iesahin
Beta Was this translation helpful? Give feedback.
All reactions