Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
For query classification:
I was able to see a minimum of 388 categories with a default number of queries per category to 1000, 70 (!) with a minimum number of 10K queries per category
For integrating query classification with search:
xbox: we went from 3563 aggressively to just 69! The results were very positive as literally xbox filtered to the category “videogames and consoles” . Thus, the results were just xbox consoles and bundles of the console, taking out games and peripherals. Probability score output, categories and scores:
['__label__abcat0701001', '__label__abcat0700000', '__label__abcat0715001', '__label__abcat0715002', '__label__cat02724']
[0.51762199 0.14868429 0.09926192 0.02738438 0.02623567]
Ps4: the same effect as above, filtering down to the consoles category, from 1170 results to just 15. Probability scores 👇🏻
['__label__cat02015', '__label__cat02009', '__label__cat09000', '__label__cat02001', '__label__abcat0900000']
[0.69825941 0.04239719 0.03412125 0.02841064 0.02170236]
iphone: went from 3241 results to just 776, but their results were not literally iPhone devices, but rather accessories to them.
['__label__abcat0811002', '__label__pcmcat209400050001', '__label__abcat0208011', '__label__pcmcat201900050009', '__label__abcat0208007']
[0.58636183 0.23716259 0.02968958 0.02421849 0.01961429]
dress: from 633 results to 69, but again no correct filtering by the correct category. The results were mostly movies in various formats
['__label__cat02015', '__label__cat02009', '__label__cat09000', '__label__cat02001', '__label__abcat0900000']
[0.6951161 0.04240536 0.03508785 0.0285547 0.02147051]