Add native support for toxicity detection guardrail microservice #1258

daniel-de-leon-user293 · 2025-02-05T18:22:11Z

Description

After the re-architecture, Intel/toxic-prompt-roberta was removed from the toxicity detection microservice. This PR brings back the free, token-less, lightweight model as the default option for users.

The new component name, used by default if no component name is passed, is OPEA_NATIVE_TOXICITY. Users who have a Prediction Guard API key must now set TOXICITY_DETECTION_COMPONENT_NAME to PREDICTIONGUARD_TOXICITY_DETECTION.

The reason for choosing to default to Intel/toxic-promp-roberta is that it reduces friction for new users. Acquiring a Prediction Guard API key adds an extra step for new users who may be trying to use OPEA under the assumption it is free and open-source.

Issues

n/a

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

None

Tests

Ran README manually and toxicity detection test script test_guardrails_bias_detection_toxicdetection.py

Signed-off-by: Daniel Deleon [email protected]

for more information, see https://pre-commit.ci

Signed-off-by: Daniel Deleon <[email protected]>

comps/guardrails/deployment/docker_compose/compose.yaml

comps/guardrails/src/toxicity_detection/README.md

lvliang-intel · 2025-02-08T05:41:45Z

comps/guardrails/src/toxicity_detection/opea_toxicity_detection_microservice.py

+toxicity_detection_port = int(os.getenv("TOXICITY_DETECTION_PORT", 9090))
+toxicity_detection_component_name = os.getenv("TOXICITY_DETECTION_COMPONENT_NAME", "OPEA_NATIVE_TOXICITY")
+
+print(f"HELLO:-{toxicity_detection_component_name}-")


Don't use print in the code.

daniel-de-leon-user293 and others added 5 commits January 30, 2025 15:29

add opea native support for toxic-prompt-roberta

b8a4f9f

add test script back

075c7bb

Merge branch 'main' into daniel/update-guardrails-docs

a154df2

[pre-commit.ci] auto fixes from pre-commit.com hooks

314b1e6

for more information, see https://pre-commit.ci

Merge branch 'main' into daniel/update-guardrails-docs

34b5278

daniel-de-leon-user293 requested review from lvliang-intel, letonghan, ftian1 and chensuyue as code owners February 5, 2025 18:22

ashahba requested a review from qgao007 February 5, 2025 18:28

daniel-de-leon-user293 added 4 commits February 5, 2025 11:28

add comp name env variable

518e33e

set default port to 9090

3ba48bb

Signed-off-by: Daniel Deleon <[email protected]>

add service to compose

425e6a8

Signed-off-by: Daniel Deleon <[email protected]>

Merge branch 'main' into daniel/update-guardrails-docs

350cdb5

qgao007 reviewed Feb 6, 2025

View reviewed changes

comps/guardrails/deployment/docker_compose/compose.yaml Show resolved Hide resolved

comps/guardrails/src/toxicity_detection/README.md Show resolved Hide resolved

daniel-de-leon-user293 added 2 commits February 6, 2025 14:46

Merge branch 'main' into daniel/update-guardrails-docs

ba1f075

Merge branch 'main' into daniel/update-guardrails-docs

4f75aa2

lvliang-intel reviewed Feb 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add native support for toxicity detection guardrail microservice #1258

Add native support for toxicity detection guardrail microservice #1258

daniel-de-leon-user293 commented Feb 5, 2025 •

edited

Loading

lvliang-intel Feb 8, 2025

Add native support for toxicity detection guardrail microservice #1258

Are you sure you want to change the base?

Add native support for toxicity detection guardrail microservice #1258

Conversation

daniel-de-leon-user293 commented Feb 5, 2025 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

lvliang-intel Feb 8, 2025

Choose a reason for hiding this comment

daniel-de-leon-user293 commented Feb 5, 2025 •

edited

Loading