Search Platform/Weekly Updates/2023-07-28

From Wikitech

Summary

The team is focused on the Search Update Pipeline and Improvements to Multilingual Zero-Result Rate.

A new weekly sync meeting around the deployment of the Search Update Pipeline to Flink on k8s is helping unblock various questions and decisions (Thanks Luke!). Service Ops, Data Platform Engineering, Search Platform and Data Platform SRE are involved.

The integration tests for CirrusSearch are flaky (they have been for a long time) and causing delay in getting work merged. We might need to invest more time into reworking those instead of fixing individual errors as they come (and come again).

The Q&A session around the "unpacking analyzer" work was well received.

What we've accomplished

Search Update Pipeline

Improve multilingual zero-results rate

  • Delays due to instability of integration tests for CirrusSearch
  • Working on acronym/WBH write-up and while working through my explanation I realized I missed an edge case for the acronym_fixer that affects Brahmic scripts (Bengalis, Hindi, Khmer, Thai, etc.). Need to do a few more checks and then I'll have another small patch up, which will delay reindexing a little, alas. https://phabricator.wikimedia.org/T170625

Operations