Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

As a user, I want to exclude files from harvest run based upon PDS4 product class #219

Open
jordanpadams opened this issue Nov 14, 2024 · 0 comments

Comments

@jordanpadams
Copy link
Member

jordanpadams commented Nov 14, 2024

Checked for duplicates

Yes - I've already checked

πŸ§‘β€πŸ”¬ User Persona(s)

Node Operator

πŸ’ͺ Motivation

...so that I can ignore specific product classes that I do not want loaded into the Registry

πŸ“– Additional Details

From user:

We have non-archive files in our service directories to support serving the data. Things like DOI landing pages, for example, and pre-made bulk download files. The the loader is going to assume that everything in a directory is either a PDS4 label or something pointed to by a PDS4 label, it's going to choke. We could make an exclusion list of directory names, file names, and file extensions to ignore, if that would help.

Acceptance Criteria

Given a root/ directory containing 2 XML products within its sub-directory structure, 1 Product_Observational product, urn:nasa:pds:observational_x, and one Product_Browse product, urn:nasa:pds:browse_y
When I perform harvest run with dataPath = root/ and productFilter = Product_Browse
Then I expect urn:nasa:pds:observational_x to be loaded into the Registry, and urn:nasa:pds:browse_y to be ignored.

βš™οΈ Engineering Details

No response

πŸŽ‰ I&T

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: ToDo
Development

No branches or pull requests

1 participant