Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I wonder if you could share some experience on colllecting dataset #26

Open
Pisces032 opened this issue Sep 1, 2024 · 1 comment
Open

Comments

@Pisces032
Copy link

I'm trying to peft it. And I have got some dataset, but they either too small or having too many headers to install. The install commands of different headers differ greatly.
So I wonder if you have any advice on how to find suitable datasets like AnghaBench.
Thank you so much!

@albertan017
Copy link
Owner

We've only found AnghaBench and Exebench, which cover nearly all available C libraries. If you have specific requirements, you might need to manually compile larger projects like Linux. While it's time-consuming, this approach can be beneficial for improving the model further, and that's what we're doing now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants