Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

relevancy threshold for queries including Berlin #92

Open
duncdrum opened this issue Jul 2, 2024 · 3 comments
Open

relevancy threshold for queries including Berlin #92

duncdrum opened this issue Jul 2, 2024 · 3 comments
Labels
question Further information is requested

Comments

@duncdrum
Copy link
Member

duncdrum commented Jul 2, 2024

Some quick experiments for AllFields default queries.

Query Results Link
"Berlin History" 1.194 results
"Berlin History" NOT "berlin.de" 804 results
Berlin History 9.518.477 results
Berlin History NOT berlin.de 5.325 results
Berlin History NOT spk-berlin.de 100.691 results
@duncdrum duncdrum added the question Further information is requested label Jul 2, 2024
duncdrum added a commit that referenced this issue Jul 15, 2024
this is a regression

see investigate failing multi-lang test #88
see #92
duncdrum added a commit that referenced this issue Jul 15, 2024
this is a regression

see investigate failing multi-lang test #88
see #92
@JBrechmacher
Copy link

Was genau ist die Frage?
Bezieht sich das Issue auf den "place"-Test? Gab es hier Probleme mit den Anpassungen im "mulit-language"-Test?

@duncdrum
Copy link
Member Author

Die Frage ist ob wir 9.5 Mio Treffer als relevant ausgeben wollen, weil berlin.de Teil der HAN URL aller unserer elektronischer Ressourcen ist. Bei query strings mit Berlin könnten wir HAN URLs filtern. Die verschiedenen Queries in der Tabelle zeigen in welcher Größenordnung die tatsächliche relevante Treffermenge liegt.

@JBrechmacher
Copy link

Vielen Dank für die Ausführungen.
Bei dem geschilderten Sachverhalt können wir direkt sagen, dass HAN-URLs keine Relevanz für das Ranking haben und entsprechend herausgefiltert werden sollten.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants