Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About 17w training samples in papers #65

Open
LeeCASC opened this issue Dec 17, 2024 · 6 comments
Open

About 17w training samples in papers #65

LeeCASC opened this issue Dec 17, 2024 · 6 comments

Comments

@LeeCASC
Copy link

LeeCASC commented Dec 17, 2024

The paper mentioned the use of the gobjaverse dataset, which has about 170,000 shapes. However, the current gobjaverse index json contains 260,000 shapes. After removing more than 40,000 poor quality shapes, there are still 220,000 shapes. How were the extra 50,000 shapes screened out?

@hitsz-zuoqi
Copy link
Collaborator

hello, somewhat we have two version of data, a initial version of 220000 samples while the 170000 shapes has a clip score above 28. You could use text-rendering clip score to filter the 220000 shapes but overall the 220000 shapes are all for training.

@LeeCASC
Copy link
Author

LeeCASC commented Dec 26, 2024

Thank you for your reply~
Could you please provide a json or txt file of the file directory corresponding to the following 170,000 shapes?

@hitsz-zuoqi
Copy link
Collaborator

Thank you for your reply~ Could you please provide a json or txt file of the file directory corresponding to the following 170,000 shapes?

wget this link: https://virtualbuy-devo.oss-cn-hangzhou.aliyuncs.com/muyuan/CVPapers/tmp/valid_paths_v4_cap_filter_thres_28_catfilter19w.json?OSSAccessKeyId=LTAI5tAGAZPVv9b26UUBYDuk&Expires=1735886306&Signature=%2FFTkuAb8zWVUU%2FxP%2BFHl0L6UfM8%3D

@LeeCASC
Copy link
Author

LeeCASC commented Dec 30, 2024

Thank you very much~

@LeeCASC LeeCASC closed this as completed Dec 30, 2024
@LeeCASC
Copy link
Author

LeeCASC commented Jan 8, 2025

Thank you for your reply~ Could you please provide a json or txt file of the file directory corresponding to the following 170,000 shapes?

wget this link: https://virtualbuy-devo.oss-cn-hangzhou.aliyuncs.com/muyuan/CVPapers/tmp/valid_paths_v4_cap_filter_thres_28_catfilter19w.json?OSSAccessKeyId=LTAI5tAGAZPVv9b26UUBYDuk&Expires=1735886306&Signature=%2FFTkuAb8zWVUU%2FxP%2BFHl0L6UfM8%3D

Hello, Can you provide me with the download link again? The current link has expired. Can you provide me with the division of training, testing and validation sets?

@LeeCASC LeeCASC reopened this Jan 8, 2025
@hitsz-zuoqi
Copy link
Collaborator

Thank you for your reply~ Could you please provide a json or txt file of the file directory corresponding to the following 170,000 shapes?

wget this link: https://virtualbuy-devo.oss-cn-hangzhou.aliyuncs.com/muyuan/CVPapers/tmp/valid_paths_v4_cap_filter_thres_28_catfilter19w.json?OSSAccessKeyId=LTAI5tAGAZPVv9b26UUBYDuk&Expires=1735886306&Signature=%2FFTkuAb8zWVUU%2FxP%2BFHl0L6UfM8%3D

Hello, Can you provide me with the download link again? The current link has expired. Can you provide me with the division of training, testing and validation sets?

https://virtualbuy-devo.oss-cn-hangzhou.aliyuncs.com/muyuan/CVPapers/tmp/valid_paths_v4_cap_filter_thres_28_catfilter19w.json?OSSAccessKeyId=LTAI5tAGAZPVv9b26UUBYDuk&Expires=1746752381&Signature=KCk0w35rXKmgaB1cO9Tr8nb1H0I%3D
here, you may download it in 1000000 seconds
also a permanent version
valid_paths_v4_cap_filter_thres_28_catfilter19w.json

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants