IFEval-extended: Instruction Following Eval Extended

Extended IFEval Benchmark with Integrated Prompt Generation.

Dependencies

Please make sure that all required python packages are installed via:

pip3 install -r requirements.txt

How to generate instructions and prompts

python3 generator.py --num_runs 500 --max_instructions 3

How to run

You need to create a jsonl file with two entries: prompt and response. Then, call evaluation_main from the parent folder of instruction_following_eval. For example:

# Content of `--input_response_data` should be like:
# {"prompt": "Write a 300+ word summary ...", "response": "PUT YOUR MODEL RESPONSE HERE"}
# {"prompt": "I am planning a trip to ...", "response": "PUT YOUR MODEL RESPONSE HERE"}
# ...
python3 -m evaluation_main \
  --input_data=./data/input_llama_3_8b.jsonl \
  --input_response_data=./data/input_response_llama_3_8b.jsonl \
  --output_dir=./data/

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
data		data
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
evaluation_main.py		evaluation_main.py
generator.py		generator.py
instruction_generator.py		instruction_generator.py
instructions.json		instructions.json
instructions.py		instructions.py
instructions_registry.py		instructions_registry.py
instructions_test.py		instructions_test.py
instructions_util.py		instructions_util.py
instructions_util_test.py		instructions_util_test.py
requirements.txt		requirements.txt
run.sh		run.sh
transform_to_dataset.py		transform_to_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IFEval-extended: Instruction Following Eval Extended

Dependencies

How to generate instructions and prompts

How to run

About

Releases

Packages

Languages

Kovbo/IFEval-extended

Folders and files

Latest commit

History

Repository files navigation

IFEval-extended: Instruction Following Eval Extended

Dependencies

How to generate instructions and prompts

How to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages