ragas: pass rubric in via RubricScore.SingleTurnPrompt.instruction #211

alimaredia · 2025-01-12T15:44:41Z

In ragas, RubricScores.rubrics isn't being used anywhere except in repr thus it was not being passed into the prompt for the judge to evaluate responses against reference answers.

Passing a string rubric into SingleTurnPrompt.instruction allows us to pass the rubric into the prompt sent
to the judge.

alimaredia · 2025-01-13T04:03:07Z

@RobotSail Here is what the prompts and internal responses look like at without this PR and with this PR:
https://paste.sh/KB9OjHDP#29Fd_qd5QQqCoaV7zMtm6NSG

What should the final prompt look like? There's a lot baked by ragas that's easy to see in the output when this PR is not applied.

RobotSail

Thanks for this PR @alimaredia.

I'm a little confused by this PR, because it seems like there are a lot of things being changed which may not be intended to be.

Could you please update this PR to only be changing the intended behavior (making sure that the old ragas rubric template can still be used)?

Additionally, we should somehow make the old rubric usage be a toggle-able behavior, since there is reason for which the users may want to use the new rubric.

RobotSail · 2025-01-16T21:47:11Z

src/instructlab/eval/ragas.py

@@ -88,29 +96,25 @@ def __init__(
        self.judge_openai_api_key = judge_openai_api_key

    @staticmethod
-    def _validate_dataset(df: DataFrame):
+    def validate_dataset(df: DataFrame):


Why are we exposing this as a public dataset? What is the use-case for this?

RobotSail · 2025-01-16T21:48:32Z

src/instructlab/eval/ragas.py

-        required_keys = {"user_input", "reference"}
-        missing_keys = required_keys - set(df.columns)
-        if missing_keys:
+        required_keys = {"user_input", "reference", "response"}


Why are you changing response to be required? This method's only job is to ensure that the given dataset is not missing any of the values which the (InstructLab) evaluation method cannot proceed without.

RobotSail · 2025-01-16T21:51:46Z

src/instructlab/eval/ragas.py

+        required_keys = {"user_input", "reference", "response"}
+
+        columns_list = set(df.columns)
+        if not columns_list.issubset(required_keys):


This is incorrect. We use a set difference to ensure that the list of required keys are contained within the list of columns provided in the dataset.

In your code, you could have a dataset that passes in {"user_input"} and this method would allow it to be passed because ["user_input"] is mathematically a subset of ["user_input", "reference"], which this method would allow.

RobotSail · 2025-01-16T21:52:15Z

src/instructlab/eval/ragas.py

            )

    def run(
        self,
-        dataset: List[Sample] | Path,
-        student_model: ModelConfig | None = None,
+        dataset: List[Sample] | DataFrame,


How come we are getting rid of the Path type here?

RobotSail · 2025-01-16T21:52:58Z

src/instructlab/eval/ragas.py

        run_config = run_config if run_config else self.run_config
-        student_openai_client = (


Why are we deleting this??

Before ragas v0.2.11 RubricScores.rubrics wasn't being applied properly. This commit sets that as the minimum version for this library. A change in v0.2.11 from previous versions was a change in the prompt for domain specific knowledge evaluation with reference. The prompt from previous versions is now explicitly passed in. Signed-off-by: Ali Maredia <[email protected]>

Refactor RagasEvaluator Class for use for `ilab` interface. Signed-off-by: Ali Maredia <[email protected]>

alimaredia requested review from RobotSail and danmcp January 12, 2025 15:44

mergify bot added the ci-failure label Jan 14, 2025

alimaredia force-pushed the pass-in-rubric branch from ad55824 to f8277b1 Compare January 16, 2025 04:10

mergify bot added dependencies Pull requests that update a dependency file ci-failure and removed ci-failure labels Jan 16, 2025

RobotSail requested changes Jan 16, 2025

View reviewed changes

alimaredia force-pushed the pass-in-rubric branch from f8277b1 to 5d47841 Compare January 17, 2025 16:28

mergify bot added ci-failure and removed ci-failure labels Jan 17, 2025

alimaredia force-pushed the pass-in-rubric branch from 5d47841 to d803496 Compare January 17, 2025 16:32

mergify bot added ci-failure and removed ci-failure labels Jan 17, 2025

Clean up RagasEvaluator interfaces

1a3f6f6

Refactor RagasEvaluator Class for use for `ilab` interface. Signed-off-by: Ali Maredia <[email protected]>

alimaredia force-pushed the pass-in-rubric branch from d803496 to 1a3f6f6 Compare January 17, 2025 22:01

mergify bot added ci-failure and removed ci-failure labels Jan 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ragas: pass rubric in via RubricScore.SingleTurnPrompt.instruction #211

ragas: pass rubric in via RubricScore.SingleTurnPrompt.instruction #211

alimaredia commented Jan 12, 2025

alimaredia commented Jan 13, 2025

RobotSail left a comment

RobotSail Jan 16, 2025

RobotSail Jan 16, 2025

RobotSail Jan 16, 2025

RobotSail Jan 16, 2025

RobotSail Jan 16, 2025

		run_config = run_config if run_config else self.run_config
		student_openai_client = (

ragas: pass rubric in via RubricScore.SingleTurnPrompt.instruction #211

Are you sure you want to change the base?

ragas: pass rubric in via RubricScore.SingleTurnPrompt.instruction #211

Conversation

alimaredia commented Jan 12, 2025

alimaredia commented Jan 13, 2025

RobotSail left a comment

Choose a reason for hiding this comment

RobotSail Jan 16, 2025

Choose a reason for hiding this comment

RobotSail Jan 16, 2025

Choose a reason for hiding this comment

RobotSail Jan 16, 2025

Choose a reason for hiding this comment

RobotSail Jan 16, 2025

Choose a reason for hiding this comment

RobotSail Jan 16, 2025

Choose a reason for hiding this comment