[ENG-6654] fixfix: correct misunderstanding, handle conflicts #836

aaxelb · 2024-12-03T21:02:29Z

partly revert [ENG-6654] indexer recover from connection errors #835 (remove ensure_ack, ack_callback) -- messages can only be ack'd thru the channel they were received by, so trying to recover a channel to ack thru does nothing
handle easily-detectable delete_by_query conflicts in a followup celery task (so the indexer doesn't fall over on it)
local setup: allow the worker's daemon user to access elastic8
update consumer prefetch limit to match the indexing chunk size (seems to avoid connection errors, maybe this is why)
try avoiding connection errors when calling message.ack() -- do a bit more book-keeping to know when a bulk elastic action is the last for its index-card, so it can ack immediately then instead of waiting 'til the whole chunk has completed
test updates to handle changes...

coveralls · 2024-12-03T21:17:20Z

coverage: 91.219% (+0.04%) from 91.179%
when pulling 0880650 on aaxelb:fix/indexer-fallback-failure
into c85fb0d on CenterForOpenScience:develop.

- messages can _only_ be ack'd thru the channel they were received by -- trying to recover a channel to ack thru does nothing - handle easily-detectable delete_by_query conflicts in a followup celery task (so the indexer doesn't fall over on it)

try avoiding connection errors when calling `message.ack()` -- do a bit more book-keeping to know when a bulk elastic action is the last for its index-card, so it can ack immediately then instead of waiting 'til the whole chunk has completed

mfraezz

One Q, one minor thing to mitigate TODOs, but generally looks good.

Pass complete

share/search/daemon.py

mfraezz · 2024-12-05T17:15:16Z

share/search/index_strategy/trovesearch_denorm.py

+                'card_pks': messages_chunk.target_ids_chunk,
+                'timestamp': messages_chunk.timestamp,
+            },
+            countdown=3,  # TODO: config?


Minor: Should probably be configurable, yes.
Something like
settings.ELASTICSEARCH['COUNTDOWN_DELAY'] = int(os.environ.get('ELASTICSEARCH_COUNTDOWN_DELAY', 3))
would probably be reasonable

went with ELASTICSEARCH_POST_INDEX_DELAY

aaxelb force-pushed the fix/indexer-fallback-failure branch 2 times, most recently from 8f54393 to 99f3eb2 Compare December 3, 2024 21:10

aaxelb added 2 commits December 4, 2024 11:04

fix(local): allow worker to use elastic8 directly

adc9b83

aaxelb force-pushed the fix/indexer-fallback-failure branch from 99f3eb2 to adc9b83 Compare December 4, 2024 16:06

fix(tests): fake index strategies and eager tasks

bd1a3b7

aaxelb changed the title ~~fixfix: correct misunderstanding, handle conflicts~~ [ENG-6654] fixfix: correct misunderstanding, handle conflicts Dec 4, 2024

aaxelb added 5 commits December 4, 2024 12:31

fix: stop swallowing errors on ack

ee717c5

fix? trovesearch_denorm: ack messages sooner

ca80590

try avoiding connection errors when calling `message.ack()` -- do a bit more book-keeping to know when a bulk elastic action is the last for its index-card, so it can ack immediately then instead of waiting 'til the whole chunk has completed

plac8 flake8

50d34f6

fix? smaller consumer prefetch

ae8e9fc

fix? avoid 'unhandled messages'

f770082

aaxelb marked this pull request as ready for review December 4, 2024 22:07

aaxelb requested a review from mfraezz December 4, 2024 22:08

mfraezz approved these changes Dec 5, 2024

View reviewed changes

move constant to env ELASTICSEARCH_POST_INDEX_DELAY

0880650

aaxelb merged commit 317b7dc into CenterForOpenScience:develop Dec 5, 2024
3 checks passed

aaxelb deleted the fix/indexer-fallback-failure branch December 5, 2024 18:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENG-6654] fixfix: correct misunderstanding, handle conflicts #836

[ENG-6654] fixfix: correct misunderstanding, handle conflicts #836

aaxelb commented Dec 3, 2024 •

edited

Loading

coveralls commented Dec 3, 2024 •

edited

Loading

mfraezz left a comment

mfraezz Dec 5, 2024

aaxelb Dec 5, 2024

[ENG-6654] fixfix: correct misunderstanding, handle conflicts #836

[ENG-6654] fixfix: correct misunderstanding, handle conflicts #836

Conversation

aaxelb commented Dec 3, 2024 • edited Loading

coveralls commented Dec 3, 2024 • edited Loading

mfraezz left a comment

Choose a reason for hiding this comment

mfraezz Dec 5, 2024

Choose a reason for hiding this comment

aaxelb Dec 5, 2024

Choose a reason for hiding this comment

aaxelb commented Dec 3, 2024 •

edited

Loading

coveralls commented Dec 3, 2024 •

edited

Loading