Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Default to UMAP dimensionality reduction method? #138

Open
PaulHax opened this issue Oct 18, 2024 · 1 comment
Open

Default to UMAP dimensionality reduction method? #138

PaulHax opened this issue Oct 18, 2024 · 1 comment

Comments

@PaulHax
Copy link
Collaborator

PaulHax commented Oct 18, 2024

UMAP is much slower, but much more intesteing.

With 1000 images, dim reduction time in seconds:
PCA: 0.08
UMAP: 10.4

Perhaps we just need a faster/better way to "scrub" around the tightly clustered clouds made by PCA?

Ideas for copeing with slow embeddings+dim-reduction plot:

  • Cache embedding computation.
  • Allow pre-computing of UMAP, then Explorer loads it from "dataset."
  • Spinner over whole plot, not just the "Compute" button.
  • Instead of spinner, BS progress bar: image_count * seconds_per_image * 1/cpu_power
@PaulHax
Copy link
Collaborator Author

PaulHax commented Jan 3, 2025

UMAP.transform output changes with batch effects which is a problem for our lazy computed transformed images:
lmcinnes/umap#158

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant