Official repository for LLM-PIEval. This release contains full API specifications along with the blackbox benchmark prompts generated for this paper.
See CONTRIBUTING for more information.
This library is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License.
If you use this benchmark or the APIs, consider citing our work:
@misc{ramakrishna2024llm,
title={LLM-PIEval: A benchmark for indirect prompt injection attacks in large language models},
author={Ramakrishna, Anil and Majmudar, Jimit and Gupta, Rahul and Hazarika, Devamanyu},
year={2024},
howpublished={AdvML-Frontiers’24: The 3nd Workshop on New Frontiers in Adversarial Machine Learning@NeurIPS’24,
Vancouver, CA},
url = {https://www.amazon.science/publications/llm-pieval-a-benchmark-for-indirect-prompt-injection-attacks-in-large-language-models},
}