Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Load an entire genotype dataset as a nada-numpy secret and run lookup operations on it #74

Open
2 of 4 tasks
vishakh opened this issue Oct 1, 2024 · 2 comments
Open
2 of 4 tasks
Assignees
Labels
enhancement New feature or request

Comments

@vishakh
Copy link
Contributor

vishakh commented Oct 1, 2024

  • Create a branch off feature/nada_array_ops
  • Study the contents of large_dataset-test.ipynb
  • Complete the work in the above notebook so it successfully invokes programs/muscle-perform-array.py
  • After getting a small subset of the sample dataset working, try to load 5,000 rows as a secret and operate on it.
@vishakh vishakh added the enhancement New feature or request label Oct 1, 2024
@vishakh
Copy link
Contributor Author

vishakh commented Oct 2, 2024

After consulting the Nillion team, I'm changing the last task to only load it 5,000 rows instead of the whole dataset.

@amardeepio
Copy link
Member

For Muscle perfomance,snp1815739 isnt there when i reduced the the number of processed rows to 5000. I tried to get some popular SNPs from snpedia on their homepage -> snpedia

Screenshot from 2024-10-07 14-15-15

But in the following 5000 rows , i wasnt able to find target SNPs from the popular ones listed there so i randomly got one from the processed rows ie snp 548049170 and was able to execute it.

Screenshot from 2024-10-07 14-18-50

Now, the goal is to write a nada program in the sheet and execute it ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants