Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Profanity check SGID data #107

Open
gregbunce opened this issue Sep 18, 2023 · 7 comments
Open

Profanity check SGID data #107

gregbunce opened this issue Sep 18, 2023 · 7 comments
Assignees
Labels
enhancement New feature or request

Comments

@gregbunce
Copy link
Contributor

gregbunce commented Sep 18, 2023

it would be helpful to have a function that looks through the data names and scans for derogatory names in the data - think trailheads, trail names, place names, etc.. This could be a good opportunity to leverage AI.

@gregbunce
Copy link
Contributor Author

FYI: we do have a derogatory name in the trailheads data - it's a former name. I'm working on this now to clean it up.

@gregbunce
Copy link
Contributor Author

a possible solution to look at:
https://github.com/surge-ai/profanity

@steveoh
Copy link
Member

steveoh commented Sep 19, 2023

I think we'd probably stick to gcp or maybe aws.

https://cloud.google.com/natural-language/docs/moderating-text

@gregbunce
Copy link
Contributor Author

Moving this FY25 Q1 and hopefully things will settle down a bit by then to make some progress on it.

@gregbunce gregbunce added the enhancement New feature or request label Sep 25, 2024
@steveoh steveoh changed the title add function to look for derogatory names in the data Profanity check SGID data Sep 26, 2024
@gregbunce gregbunce assigned ZachBeck and unassigned gregbunce Oct 29, 2024
@steveoh
Copy link
Member

steveoh commented Nov 13, 2024

I submitted a google request to get the "s" word added. Are there any other words we know about in our data that needs to be replaced?

I tested the word and it's still not being flagged! Since I couldn't find the original request I created a new one (internal ref: 377718296).
Product let me know the way I submitted the feature request should work to add new words. It may still take time to implement but I'll be able to provide you updates.
Please let me know what other words we should be tracking.

@ZachBeck
Copy link
Member

Removed squaw names from NHD Lakes, NHD Streams, and UGRC version of the GNIS.

@steveoh
Copy link
Member

steveoh commented Feb 5, 2025

Good news, engineering rolled out a new version of the API and this now flags the derogatory word. To test you’ll need to include "modelVersion": "MODEL_VERSION_2" on the API call.

Using the examples from Wikipedia, it now flags under the “Derogatory” category.

Image
Let me know if you have any questions or other examples.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: No status
Development

No branches or pull requests

3 participants