-
Notifications
You must be signed in to change notification settings - Fork 394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cmd ref: improve run
and commit
/ how to add outs/deps without re-running stage?
#460
Comments
dvc commit
/ how to add output without re-running stage?dvc run
doc / how to add output without re-running stage?
dvc run
doc / how to add output without re-running stage?run
and commit
/ how to add output without re-running stage?
already started working on it. @jorgeorpinel |
This would make a great note in the command descriptions and possibly a new example, at least in |
I agree - it can be even a section in user guide - editing the pipeline. And of course, |
run
and commit
/ how to add output without re-running stage?run
and commit
/ how to add output and dependencies without re-running stage?
Per #612 (comment) this wasn't fully addressed by #1840. @imhardikj can you confirm what part this is still pending? Thanks |
"How to add missing dependency to existing pipeline." part is remaining. |
run
and commit
/ how to add output and dependencies without re-running stage?run
and commit
/ how to add outs/deps without re-running stage?
@imhardikj the same doc should be enough. I see you opened #1913, let's follow up there. Thanks |
Often when I run a command with
dvc run
, I realise that I have forgotten to specify one of the outputs. I would therefore like to update the DVC file with an additional output, but without re-running the (potentially expensive) command.With the help of @efiop on the discourse channel, I was able to figure out that this can be achieved with the following steps:
dvc run
with the additional output and the--no-exec
flagdvc commit
to add the new output to the dvc cache, compute its checksum and add it to the dvc file.This works perfectly, but looking at the documentation, it wasn't obvious that this is what
dvc commit
would do. In particular, the opening line “Record changes to the repository by updating DVC-files and saving outputs to cache.” It wasn't clear to me that “updating DVC-files” meant recomputing the checksums of the outputs.In the step-by-step explanation of what
dvc commit
does:I would suggest the first bullet could be reworded as “computes the checksum of each output file/directory, as well as the checksum of the DVC-file itself” (if my understanding is correct). The second bullet should read “enter the checksums of the outputs and of the DVC-file into the DVC-file”. I'm actually still unsure what is meant by “enters the file name”. Aren't all file names already present in the DVC-file?
UPDATE (From #612)
Dependencies can also be added to a stage without re-running a stage , using the same steps as described above.
The text was updated successfully, but these errors were encountered: