Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create cohort-based standard table 1 #169

Open
schuemie opened this issue Jun 7, 2022 · 7 comments
Open

Create cohort-based standard table 1 #169

schuemie opened this issue Jun 7, 2022 · 7 comments

Comments

@schuemie
Copy link
Member

schuemie commented Jun 7, 2022

The current createTable1() function relies on concept-based covariates. We should create an alternative table 1 based on cohorts. For example, instead of 'Diabetes type 2' being based on the occurrence of a single condition concept, it should be based on a more complex definition, possibly involving antidiabetes treatments, lab values, etc.

I propose to use the same set of items as the current table 1 (see specs here), since those were selected before to be 'good enough'. For each item (except possible demographics) we should find a corresponding cohort definition. We could then

  1. load those cohort definitions into FeatureExtraction.
  2. add a function in FeatureExtraction to retrieve all these cohort definitions (in a format suitable for CohortGenerator),
  3. add a function for creating cohort-based covariates specifically for this set of cohorts.
  4. adapt the createTable1() function for working with this new table 1.

@gowthamrao: I'd imagine the cohort definitions would come from the OHDSI Phenotype Library.

Also looping in @pbr6cornell .

@schuemie
Copy link
Member Author

I have extracted the concepts used in Table 1 here, so people know which cohort definitions we need.

@gowthamrao
Copy link
Member

gowthamrao commented Jun 30, 2022

Yes - here is a link to candidate cohorts

https://docs.google.com/spreadsheets/d/1pdGYoH2bwr_fjFvvAWUiSJIVzY1_7FTdfkDHq1sO5dk/edit?usp=sharing

(updated link)

@gowthamrao
Copy link
Member

gowthamrao commented Jul 8, 2022

The version 3.1.1 of PhenotypeLibrary R package has the above referenced candidate cohorts. We can use those cohorts with FeatureExtraction to proof test this idea.

@schuemie
Copy link
Member Author

schuemie commented Aug 8, 2022

Hi @gowthamrao. The Excel sheet you referenced contains a lot of entries labeled 'not started' or status = 'yes'. Would it be possible to create a clean table. where one column contains the cohort IDs as found the PhenotypeLibrary package?

@gowthamrao
Copy link
Member

@schuemie yes

Since building and maintaining these cohorts are now the responsibility of OHDSI Phenotype Development and Evaluation workgroup using the OHDSI Peer review process - i suggest we move this issue to https://github.com/OHDSI/PhenotypeLibrary

We can circle back to technical implementation once peer reviewed cohorts are available

Thoughts?

@gowthamrao
Copy link
Member

FYI - Kevin Haynes and @gowthamrao are revisiting this effort now

@anthonysena
Copy link
Collaborator

I think we'll want to get #167 merged into FeatureExtraction and use that work to help drive the Table 1 functionality described in this issue.

I suppose once this is in place, we could potentially have a dependency on the PhenotypeLibrary (PL) package such that defining the Table 1 specs simply references the cohort(s) in the PL. Hoping this is what you had in mind @gowthamrao but let me know if I've misunderstood here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants