You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Variable Catalog tends to get filled up with descriptive-type variables (site location, treatment ID, block ID, latitude, longitude, etc.) where most researchers are likely looking for measurement-type variables (stand count, biomass, percent carbon, volumetric water content). We should consider providing a way to differentiate these variable types for better filtering.
This can easily get out of hand (see here and here).
Best option is to be consistent with other such filters (e.g. zone, processing, quality control) and allow variable types to be defined in the app-config.json file.
We also will need to provide default values. Maybe follow statistics and go with the upper level: Quantitative/Numeric, Qualitative/Categorical? Or go with an extra step and do: Discrete, Continuous, Nominal, Ordinal.
Keeping things simple (for speed-to-metadata): Numeric, Categorical.
The text was updated successfully, but these errors were encountered:
Take a cue from dimensional modeling. A "dimension" describes "who, what, where, when, why, and how". A "metric" is a quantitative measurement.
I think it's safe to say the context of the dataset can determine what is a dimension vs a metric -- so we have some leeway here. For example, measuring crop height, "height" will be a metric. But if we are describing the height of a sensor then "height" will be a dimension.
Nominal and ordinal variables pose some confusion. There are arguments that ordinal vars can be considered continuous vars (https://www.frontiersin.org/articles/10.3389/feduc.2020.589965/full). So maybe we can handwave that. (although there are best practices that state "metrics" should allow the calculation of means, min, max, etc.)
I'm not sure about nominal though. Should we always treat those as dimensions? If so, something like a management dataset, which is mostly nominal values, will pose an issue. Would we want a "drill type" or "tractorId" grouped with "plotId", "NearestTown", and other descriptive variables? Maybe this is fine?
The Variable Catalog tends to get filled up with descriptive-type variables (site location, treatment ID, block ID, latitude, longitude, etc.) where most researchers are likely looking for measurement-type variables (stand count, biomass, percent carbon, volumetric water content). We should consider providing a way to differentiate these variable types for better filtering.
This can easily get out of hand (see here and here).
Best option is to be consistent with other such filters (e.g. zone, processing, quality control) and allow variable types to be defined in the app-config.json file.
We also will need to provide default values. Maybe follow statistics and go with the upper level: Quantitative/Numeric, Qualitative/Categorical? Or go with an extra step and do: Discrete, Continuous, Nominal, Ordinal.
Keeping things simple (for speed-to-metadata): Numeric, Categorical.
The text was updated successfully, but these errors were encountered: