Page MenuHomePhabricator

EBernhardson (EBernhardson)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Oct 7 2014, 4:49 PM (562 w, 2 d)
Availability
Available
LDAP User
EBernhardson
MediaWiki User
EBernhardson (WMF) [ Global Accounts ]

Recent Activity

Today

EBernhardson added a comment to T397528: ๐Ÿงฌ Propagate provenance of scholarly vs main graph queries to processed table.

Looks to be working as expected, with data starting to arrive on the 15th. Looks like data exists for all rows:

Thu, Jul 17, 1:45 PM ยท Patch-For-Review, Wikibase Cloud (Kanban Board Q3 2025)
EBernhardson moved T398820: Add RKD to WDQS allowlist from Done to In Progress on the Data-Platform-SRE (2025.07.05 - 2025.07.25) board.
Thu, Jul 17, 1:39 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), Data-Platform-SRE (2025.07.05 - 2025.07.25), Wikibase Cloud, Wikidata, Wikidata-Query-Service

Yesterday

EBernhardson added a comment to T398820: Add RKD to WDQS allowlist.

Construct an awkward thing that checks all the servers for errors:

for host in $(cat /etc/dsh/group/wdqs | grep -v ^# | grep -v ^$ | sort); do 
    echo "$host: $(curl -sk "https://$host/sparql?query=SELECT%20%3Fs%20%3Fp%20%3Fo%20%7B%0A%20%20SERVICE%20%3Chttps%3A%2F%2Frkd.triply.cc%2F_api%2Fdatasets%2Frkd%2FRKD-Knowledge-Graph%2Fservices%2FSPARQL%2Fsparql%3E%20%7B%0A%20%20%20%20%3Fs%20%3Fp%20%3Fo%0A%20%20%7D%0A%7D%0ALIMIT%2010" | grep '^Caused by: java.lang.IllegalArgumentException: Service URI https://rkd.triply.cc/_api/datasets/rkd/RKD-Knowledge-Graph/services/SPARQL/sparql is not allowed$')"
done
Wed, Jul 16, 6:58 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), Data-Platform-SRE (2025.07.05 - 2025.07.25), Wikibase Cloud, Wikidata, Wikidata-Query-Service

Tue, Jul 15

EBernhardson moved T398820: Add RKD to WDQS allowlist from Needs Review to Done on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Tue, Jul 15, 7:21 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), Data-Platform-SRE (2025.07.05 - 2025.07.25), Wikibase Cloud, Wikidata, Wikidata-Query-Service
EBernhardson moved T398820: Add RKD to WDQS allowlist from In Progress to Done on the Data-Platform-SRE (2025.07.05 - 2025.07.25) board.

Looks to query succesfully

Tue, Jul 15, 7:21 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), Data-Platform-SRE (2025.07.05 - 2025.07.25), Wikibase Cloud, Wikidata, Wikidata-Query-Service
EBernhardson moved T399617: Archive operations/software/opensearch/plugins (moved to gitlab) from Incoming to Needs Review on the Discovery-Search (2025.07.04 - 2025.07.25) board.
Tue, Jul 15, 7:16 PM ยท Projects-Cleanup, Discovery-Search (2025.07.04 - 2025.07.25)
EBernhardson updated the task description for T399617: Archive operations/software/opensearch/plugins (moved to gitlab).
Tue, Jul 15, 6:46 PM ยท Projects-Cleanup, Discovery-Search (2025.07.04 - 2025.07.25)
EBernhardson updated the task description for T399617: Archive operations/software/opensearch/plugins (moved to gitlab).
Tue, Jul 15, 6:42 PM ยท Projects-Cleanup, Discovery-Search (2025.07.04 - 2025.07.25)
EBernhardson updated the task description for T399617: Archive operations/software/opensearch/plugins (moved to gitlab).
Tue, Jul 15, 6:41 PM ยท Projects-Cleanup, Discovery-Search (2025.07.04 - 2025.07.25)
EBernhardson updated the task description for T399617: Archive operations/software/opensearch/plugins (moved to gitlab).
Tue, Jul 15, 6:31 PM ยท Projects-Cleanup, Discovery-Search (2025.07.04 - 2025.07.25)
EBernhardson renamed T399617: Archive operations/software/opensearch/plugins (moved to gitlab) from Migrate operations/software/opensearch/plugins to gitlab to Archive operations/software/opensearch/plugins (moved to gitlab).
Tue, Jul 15, 6:29 PM ยท Projects-Cleanup, Discovery-Search (2025.07.04 - 2025.07.25)
EBernhardson updated the task description for T399617: Archive operations/software/opensearch/plugins (moved to gitlab).
Tue, Jul 15, 6:23 PM ยท Projects-Cleanup, Discovery-Search (2025.07.04 - 2025.07.25)
EBernhardson created T399617: Archive operations/software/opensearch/plugins (moved to gitlab).
Tue, Jul 15, 6:23 PM ยท Projects-Cleanup, Discovery-Search (2025.07.04 - 2025.07.25)

Mon, Jul 14

EBernhardson claimed T398820: Add RKD to WDQS allowlist.
Mon, Jul 14, 5:33 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), Data-Platform-SRE (2025.07.05 - 2025.07.25), Wikibase Cloud, Wikidata, Wikidata-Query-Service

Fri, Jul 11

EBernhardson added a comment to T395880: Simple search results don't show the best matching alias in match data.

The problem here is that we are primarily highlighting against the text field which is a wide variety of data stuffed together into a single string. The highlighter doesn't know it should be considering this to be many different strings and picking between them. It highlights it as if it were highlighting paragraphs of content. We are doing some post-processing in the php side to turn that highlighted text field into something more presentable, but it's always going to be hacky trying to solve this problem there.

Fri, Jul 11, 5:32 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), Wikibase Product Platform Team WPP
EBernhardson added a comment to T317599: Allow ^ and $ in intitle regex search.

Hello @EBernhardson,

For Tech News - What wording would you suggest as the content, and when should it be included? Thanks!

Fri, Jul 11, 4:38 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), User-notice, CirrusSearch

Thu, Jul 10

EBernhardson moved T399162: Regression: Cirrus exact string regexp search for insource:/"u.a."/ has stopped working from Incoming to Needs Review on the Discovery-Search (2025.07.04 - 2025.07.25) board.
Thu, Jul 10, 7:26 PM ยท Patch-For-Review, Discovery-Search (2025.07.04 - 2025.07.25), Regression, CirrusSearch
EBernhardson added a comment to T399162: Regression: Cirrus exact string regexp search for insource:/"u.a."/ has stopped working.

There are two things going on here, the first is that quotes in the regexp here have special meaning. They don't match the quote, rather they define a part of the string that has to be a literal match. So the search query insource:/"u.a."/ searches for the literal string u.a., it does not search for the quotation characters and it does not treat the . as a match-all. The second part is that the recent changes to the regex engine didn't take this into account, and is rewriting the . into a semantically equivalent form.

Thu, Jul 10, 5:21 PM ยท Patch-For-Review, Discovery-Search (2025.07.04 - 2025.07.25), Regression, CirrusSearch
EBernhardson claimed T399162: Regression: Cirrus exact string regexp search for insource:/"u.a."/ has stopped working.

This is most likely related to the plugin deployment for T317599.

Thu, Jul 10, 4:39 PM ยท Patch-For-Review, Discovery-Search (2025.07.04 - 2025.07.25), Regression, CirrusSearch

Wed, Jul 9

EBernhardson updated subscribers of T391175: Global Search displays most search results twice.

@MusikAnimal

Wed, Jul 9, 8:06 PM ยท Data-Platform-SRE, Tool-global-search
EBernhardson added a comment to T391175: Global Search displays most search results twice.

merge request: https://github.com/wikimedia/tools-global-search/pull/118

Wed, Jul 9, 8:05 PM ยท Data-Platform-SRE, Tool-global-search
EBernhardson moved T395109: UpdateSuggesterIndex should fail early if the main indices do not exist from Needs Review to To be Deployed on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Wed, Jul 9, 7:34 PM ยท MW-1.45-notes (1.45.0-wmf.10; 2025-07-15), Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch
EBernhardson added a comment to T391175: Global Search displays most search results twice.

Most plausibly the global-search side is picking up indices that are not live on the production side. IIRC this runs queries against the * index, it may have to be a little more selective and use *_content,*_general,*_file

Wed, Jul 9, 1:32 PM ยท Data-Platform-SRE, Tool-global-search

Tue, Jul 8

EBernhardson claimed T395109: UpdateSuggesterIndex should fail early if the main indices do not exist.
Tue, Jul 8, 6:40 PM ยท MW-1.45-notes (1.45.0-wmf.10; 2025-07-15), Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch
EBernhardson moved T398133: Special:Search should suggest creating page in the namespace in which was searched from Needs Review to To be Deployed on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Tue, Jul 8, 6:27 PM ยท MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), Discovery-Search (2025.06.13 - 2025.07.04), MediaWiki-Search
EBernhardson added a comment to T396002: ๐Ÿงฌ Investigate a way to propagate the provenance of queryservice-ui source data into the processed query table.

Updated bits were deployed and are now running. I re-ran a few recent hours through and we get the following counts, not sure if these are expected. Essentially it sees a bit under 1k/hr in the couple hours i re-ran.

Tue, Jul 8, 6:25 PM ยท Wikibase Cloud (Kanban Board Q2 2025)
EBernhardson moved T394720: SearchSatisfaction schema uses deprecated performance.timing from Needs Review to To be Deployed on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Tue, Jul 8, 5:43 PM ยท MW-1.45-notes (1.45.0-wmf.10; 2025-07-15), Discovery-Search (2025.06.13 - 2025.07.04), MediaWiki-extensions-WikimediaEvents
EBernhardson added a comment to T397227: Build and deploy OpenSearch plugins package for updated regex search.

Looks like a couple hosts in codfw still need a restart:

Tue, Jul 8, 3:44 PM ยท Data-Platform-SRE (2025.07.05 - 2025.07.25)
EBernhardson moved T397732: Run a test evaluating fuzziness of completion suggester from To be Deployed to Needs Review on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Tue, Jul 8, 3:39 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.9; 2025-07-08)
EBernhardson added a comment to T397732: Run a test evaluating fuzziness of completion suggester.

Started test, sadly it's not working as expected. It's been some time since we ran an AB test involving autocomplete, and since then the javascript layer has changed. End result is that we are not attaching the testing parameter to the autocomplete api requests, meaning no test treatment is being applied.

Tue, Jul 8, 2:26 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.9; 2025-07-08)

Mon, Jul 7

EBernhardson claimed T394720: SearchSatisfaction schema uses deprecated performance.timing.
Mon, Jul 7, 7:42 PM ยท MW-1.45-notes (1.45.0-wmf.10; 2025-07-15), Discovery-Search (2025.06.13 - 2025.07.04), MediaWiki-extensions-WikimediaEvents
EBernhardson edited projects for T393164: Show template transclusion count next to each search result in the template picker dropdown, added: Discovery-Search; removed Discovery-Search (2025.06.13 - 2025.07.04).
Mon, Jul 7, 7:18 PM ยท Community-Tech, VisualEditor, TemplateData, Template-Discovery-And-Recall
EBernhardson claimed T398133: Special:Search should suggest creating page in the namespace in which was searched.
Mon, Jul 7, 7:17 PM ยท MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), Discovery-Search (2025.06.13 - 2025.07.04), MediaWiki-Search
EBernhardson added a comment to T397732: Run a test evaluating fuzziness of completion suggester.

In terms of metrics, what we should be looking at:

Mon, Jul 7, 6:30 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.9; 2025-07-08)
EBernhardson moved T397732: Run a test evaluating fuzziness of completion suggester from Needs Review to To be Deployed on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Mon, Jul 7, 6:24 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.9; 2025-07-08)
EBernhardson moved T395465: Investigate EQIAD daily completion suggester rebuild failure from Needs Review to Done on the Discovery-Search (2025.06.13 - 2025.07.04) board.

This looks to have worked as expected. Checked by reviewing max(sum by (pod) (container_memory_usage_bytes{namespace="mw-cron", pod=~"cirrus-build-completion-indices-.*", container="mediawiki-main-app"})) for the last 7 days. This shows that peak memory usage of an individual cirrus-build-completion-indices pod decreased from >2gb to ~550mb on july 3rd.

Mon, Jul 7, 6:23 PM ยท Data-Platform-SRE (2025.07.05 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.8; 2025-07-01), Discovery-Search (2025.06.13 - 2025.07.04), MW-on-K8s, serviceops
EBernhardson added a comment to T398461: Attempt to build multi-arch cirrussearch-opensearch docker image.

We use a custom set of plugins for opensearch, including a number that we build ourselves. Those all get packaged up together into a set and this provides the guarantee that development and production have all the same things with the same versions configured and available.

Mon, Jul 7, 1:59 PM ยท Data-Platform-SRE, Discovery-Search

Tue, Jul 1

EBernhardson added a comment to T398273: Search autocomplete is returning incomplete results (again).

General investigation:

Tue, Jul 1, 8:36 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch
EBernhardson added a comment to T398273: Search autocomplete is returning incomplete results (again).

I've also been pondering how we might detect this kind of issue in the future. Perhaps, during the daily completion suggester rebuild, we could increment a counter every time we notice that we are missing some externally populated fields. We could then monitor that count for a week or two to find the typical range of values and alert whenever we get outside those limits.

Tue, Jul 1, 6:43 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch
EBernhardson claimed T398273: Search autocomplete is returning incomplete results (again).
Tue, Jul 1, 6:31 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch
EBernhardson added a comment to T398273: Search autocomplete is returning incomplete results (again).

Checked the data lake, we have been consistently generating popularity_score data for United States (project=en.wikipedia, page_id=3434750). Checking further down the pipeline I can also find the same page in the bulk update files that we push into elasticsearch. Essentially I'm reasonably certain the updates are still flowing.

Tue, Jul 1, 6:22 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch
EBernhardson added a comment to T398273: Search autocomplete is returning incomplete results (again).

We can look at the scoring information to get an idea of what is going on: https://en.wikipedia.org/wiki/Special:ApiSandbox#action=query&format=json&prop=cirruscompsuggestbuilddoc&titles=United%20States%7CUnited%20Kingdom&formatversion=2

Tue, Jul 1, 5:41 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch

Mon, Jun 30

EBernhardson claimed T396529: Reindex All Wikis to enable lots of things.
Mon, Jun 30, 3:15 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.10; 2025-07-15)
EBernhardson moved T396529: Reindex All Wikis to enable lots of things from Incoming to In Progress on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Mon, Jun 30, 3:14 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.10; 2025-07-15)

Thu, Jun 26

EBernhardson claimed T395465: Investigate EQIAD daily completion suggester rebuild failure.

While the above is a reduced form of our maintenance script and does trigger an OOM, after more investigation I'm not certain that is the cause of our memory usage.

Thu, Jun 26, 6:36 PM ยท Data-Platform-SRE (2025.07.05 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.8; 2025-07-01), Discovery-Search (2025.06.13 - 2025.07.04), MW-on-K8s, serviceops
EBernhardson created P78712 OOM reproduction.
Thu, Jun 26, 2:06 PM

Wed, Jun 25

EBernhardson added a comment to T395465: Investigate EQIAD daily completion suggester rebuild failure.

Relatively minimal reproduction of the OOM we trigger. It fails at around 2.3M cached entries. At a very general level the problem is that this mediawiki code is assuming a webrequest that ends in a few seconds at most, not a maintenance script that visits millions of pages in a single execution.

Wed, Jun 25, 9:29 PM ยท Data-Platform-SRE (2025.07.05 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.8; 2025-07-01), Discovery-Search (2025.06.13 - 2025.07.04), MW-on-K8s, serviceops
EBernhardson created P78694 (An Untitled Masterwork).
Wed, Jun 25, 3:46 PM

Tue, Jun 24

EBernhardson moved T388538: Migrate discovery-search jobs to mw-cron from In Progress to Reported on the Discovery-Search (2025.06.13 - 2025.07.04) board.

Moving back to reported, as we are tracking resolving the memory issue in T395465

Tue, Jun 24, 7:11 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Wikidata, Patch-For-Review, serviceops
EBernhardson moved T143553: Switching search traffic between datacenters should be faster from In Progress to Done on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Tue, Jun 24, 7:09 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Data-Platform-SRE (2025.06.13 - 2025.07.04), Patch-For-Review, Sustainability (Incident Followup), Discovery-ARCHIVED, Elasticsearch
EBernhardson added a comment to T394295: Unexpected changes in search latency.

There isn't any particular phabricator task for tracking where search traffic currently flows to, i suppose it's tracked in config-master discovery but that's not human readable. The shift in traffic above was part of T370147. There will potentially be a similar shift when we move to opensearch 2, but thats probably 6+ months away. In general it's not particularlly common that we have to disable one of the two search clusters, but it happens from time to time.

Tue, Jun 24, 5:40 PM ยท User-Michael, Discovery-Search (2025.06.13 - 2025.07.04), Growth-Team (Maintenance), Performance Issue, GrowthExperiments-NewcomerTasks
EBernhardson moved T397732: Run a test evaluating fuzziness of completion suggester from Incoming to Needs Review on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Tue, Jun 24, 4:25 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.9; 2025-07-08)
EBernhardson created T397732: Run a test evaluating fuzziness of completion suggester.
Tue, Jun 24, 3:37 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.9; 2025-07-08)

Mon, Jun 23

EBernhardson moved T143553: Switching search traffic between datacenters should be faster from Blocked/Waiting to Done on the Data-Platform-SRE (2025.06.13 - 2025.07.04) board.

Traffic test complete, moved as expected. Commands have been documented at https://wikitech.wikimedia.org/wiki/Search/Elasticsearch_Administration#Multi-DC_%2F_Multi-Cluster_Operations

Mon, Jun 23, 7:14 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Data-Platform-SRE (2025.06.13 - 2025.07.04), Patch-For-Review, Sustainability (Incident Followup), Discovery-ARCHIVED, Elasticsearch
EBernhardson added a comment to T143553: Switching search traffic between datacenters should be faster.

In theory we should be able to depool codfw like so, causing all traffic to move to eqiad:

Mon, Jun 23, 4:57 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Data-Platform-SRE (2025.06.13 - 2025.07.04), Patch-For-Review, Sustainability (Incident Followup), Discovery-ARCHIVED, Elasticsearch
EBernhardson moved T394295: Unexpected changes in search latency from Incoming to Blocked / Waiting on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Mon, Jun 23, 3:55 PM ยท User-Michael, Discovery-Search (2025.06.13 - 2025.07.04), Growth-Team (Maintenance), Performance Issue, GrowthExperiments-NewcomerTasks
EBernhardson added a comment to T394295: Unexpected changes in search latency.

Search traffic moved between datacenters due to operational concerns, this causes latency effects as noted in the above. The latency difference is ~30ms per round trip to the search servers if the requests have to go cross-datacenter. Typically there are 2 round trips but that can vary for a number of reasons.

Mon, Jun 23, 3:54 PM ยท User-Michael, Discovery-Search (2025.06.13 - 2025.07.04), Growth-Team (Maintenance), Performance Issue, GrowthExperiments-NewcomerTasks
EBernhardson moved T143553: Switching search traffic between datacenters should be faster from Blocked / Waiting to In Progress on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Mon, Jun 23, 3:16 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Data-Platform-SRE (2025.06.13 - 2025.07.04), Patch-For-Review, Sustainability (Incident Followup), Discovery-ARCHIVED, Elasticsearch

Fri, Jun 20

EBernhardson moved T390612: Search requests failing due to connection closure from Blocked / Waiting to Done on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Fri, Jun 20, 7:48 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Data-Platform-SRE (2025.06.13 - 2025.07.04)
EBernhardson added a comment to T143553: Switching search traffic between datacenters should be faster.

This is mostly done, read traffic is all flowing through the dns-disc endpoints. Traffic can now move with the same tooling as everything else. Not quite done yet, as T397377 was opened for a change we noticed in the dashboards.

Fri, Jun 20, 7:48 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Data-Platform-SRE (2025.06.13 - 2025.07.04), Patch-For-Review, Sustainability (Incident Followup), Discovery-ARCHIVED, Elasticsearch
EBernhardson moved T393872: Make weighted tags no longer be WMF-specific from Needs Review to Done on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Fri, Jun 20, 7:45 PM ยท MW-1.45-notes (1.45.0-wmf.7; 2025-06-24), Discovery-Search (2025.06.13 - 2025.07.04), Patch-For-Review, CirrusSearch
EBernhardson added a comment to T390612: Search requests failing due to connection closure.

There are a few errors remaining

Fri, Jun 20, 7:37 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Data-Platform-SRE (2025.06.13 - 2025.07.04)
EBernhardson moved T390612: Search requests failing due to connection closure from Blocked/Waiting to Done on the Data-Platform-SRE (2025.06.13 - 2025.07.04) board.

As part of T143553 we enabled retry_on: gateway-error in the envoy configuration for the new dns-discovery endpoint which is used for read traffic. Essentially this transparently retrys specific 5xx errors that are gateway related at the envoy level. We switched over to the new endpoints a little after 2025-06-18 21:00 UTC. Per logstash the "upstream connect error" errors were firing 500-1000 times per 12 hours, post deployment that has dropped to 30-100, a 10x reduction in errors.

Fri, Jun 20, 7:24 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Data-Platform-SRE (2025.06.13 - 2025.07.04)

Jun 17 2025

EBernhardson moved T396779: Evaluate impact of an increase in phrase suggester size from Needs Review to Done on the Discovery-Search (2025.06.13 - 2025.07.04) board.
Jun 17 2025, 5:42 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch
EBernhardson moved T397083: Add a second suggest field to the CirrusSearch mapping from Incoming to To be Deployed on the Discovery-Search (2025.06.13 - 2025.07.04) board.

This will require a full-cluster reindex operation, which is actually pending for a number of reasons (various language analysis chain updates, regex trigram indexing changes, etc.)

Jun 17 2025, 4:43 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.7; 2025-06-24), CirrusSearch
EBernhardson added a subtask for T317599: Allow ^ and $ in intitle regex search: T397227: Build and deploy OpenSearch plugins package for updated regex search.
Jun 17 2025, 3:42 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), User-notice, CirrusSearch
EBernhardson added a parent task for T397227: Build and deploy OpenSearch plugins package for updated regex search: T317599: Allow ^ and $ in intitle regex search.
Jun 17 2025, 3:41 PM ยท Data-Platform-SRE (2025.07.05 - 2025.07.25)
EBernhardson created T397227: Build and deploy OpenSearch plugins package for updated regex search.
Jun 17 2025, 3:41 PM ยท Data-Platform-SRE (2025.07.05 - 2025.07.25)
EBernhardson moved T262612: Run an A/B test using suggestions generated using glent Method 1 from Needs Review to Done on the Discovery-Search (2025.06.13 - 2025.07.04) board.

With that patch deployed the test is concluded and the updated configuration utilizing glent m1 is in place.

Jun 17 2025, 3:36 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), Patch-For-Review

Jun 16 2025

EBernhardson created T397083: Add a second suggest field to the CirrusSearch mapping.
Jun 16 2025, 5:44 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.7; 2025-06-24), CirrusSearch
EBernhardson added a comment to T390858: Improve CirrusSearch DYM suggestions using the phrase suggester on more content.

Per the findings in T396779, i think we can greatly simplify this. The initial premise was that the phrase suggester indices would be too large, but current analysis says we have plenty of headroom. An alternate implementation:

Jun 16 2025, 4:37 PM ยท Epic, Discovery-Search, CirrusSearch
EBernhardson added a comment to T262612: Run an A/B test using suggestions generated using glent Method 1.

Patch above undeploys the AB test and, per the report recommendations, deploys glent method 1 to all users in english, french, and german language wikipedias.

Jun 16 2025, 4:18 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), Patch-For-Review
EBernhardson added a comment to T393872: Make weighted tags no longer be WMF-specific.

I think the only patch still open is: https://gerrit.wikimedia.org/r/c/mediawiki/extensions/CirrusSearch/+/1144486

Jun 16 2025, 4:17 PM ยท MW-1.45-notes (1.45.0-wmf.7; 2025-06-24), Discovery-Search (2025.06.13 - 2025.07.04), Patch-For-Review, CirrusSearch

Jun 13 2025

EBernhardson moved T396779: Evaluate impact of an increase in phrase suggester size from Incoming to Needs Review on the Discovery-Search (2025.05.24 - 2025.06.13) board.

Two primary sets of tests were conducted: an index size analysis and a memory utilization stress test.

Jun 13 2025, 5:01 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch

Jun 12 2025

EBernhardson edited projects for T396779: Evaluate impact of an increase in phrase suggester size, added: Discovery-Search (2025.05.24 - 2025.06.13); removed Discovery-Search.
Jun 12 2025, 4:34 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch
EBernhardson created T396779: Evaluate impact of an increase in phrase suggester size.
Jun 12 2025, 4:32 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), CirrusSearch

Jun 11 2025

EBernhardson moved T262612: Run an A/B test using suggestions generated using glent Method 1 from Blocked / Waiting to Needs Review on the Discovery-Search (2025.05.24 - 2025.06.13) board.

Expecting this to be the final report: https://people.wikimedia.org/~ebernhardson/T262612-dym-ab-analysis.html

Jun 11 2025, 9:14 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), Patch-For-Review
EBernhardson updated the name of F62295622: article clickthrough rate on glent autorewritten queries.png from "image.png" to "article clickthrough rate on glent autorewritten queries.png".
Jun 11 2025, 7:47 PM

Jun 10 2025

EBernhardson added a comment to T396501: Decide on/run a benchmark for DPE SRE-owned OpenSearch clusters.

Plausibly relevant information from setting up codfw: T117714

Jun 10 2025, 5:32 PM ยท Data-Platform-SRE (2025.07.05 - 2025.07.25)

Jun 9 2025

EBernhardson removed a project from T395571: Verify/fix Logstash pipeline for Search Platform-owned OpenSearch clusters: Discovery-Search (2025.05.24 - 2025.06.13).
Jun 9 2025, 3:37 PM ยท Data-Platform-SRE (2025.07.05 - 2025.07.25)
EBernhardson added a comment to T395465: Investigate EQIAD daily completion suggester rebuild failure.
 :) (ebernhardson@deploy1003)-~$ kube_env mw-cron eqiad
 :) (ebernhardson@deploy1003)-~$ kubectl get pods | grep OOM
cirrus-build-completion-indices-codfw-s3-29155830-52977           0/3     OOMKilled   0          37h
cirrus-build-completion-indices-codfw-s3-29157270-wnzhd           0/3     OOMKilled   0          13h
cirrus-build-completion-indices-eqiad-s3-29155830-72d5c           0/3     OOMKilled   0          37h
cirrus-build-completion-indices-eqiad-s3-29157270-mn6bp           0/3     OOMKilled   0          13h
Jun 9 2025, 3:36 PM ยท Data-Platform-SRE (2025.07.05 - 2025.07.25), MW-1.45-notes (1.45.0-wmf.8; 2025-07-01), Discovery-Search (2025.06.13 - 2025.07.04), MW-on-K8s, serviceops
EBernhardson moved T396252: CirrusSearchIndexTooOld from needs triage to Next Projects on the Discovery-Search board.
Jun 9 2025, 3:31 PM ยท Discovery-Search (2025.06.13 - 2025.07.04)
EBernhardson triaged T395348: An option to load more files for deepcategory searches with partial results (larger depth) as Low priority.
Jun 9 2025, 3:31 PM ยท CirrusSearch, Discovery-Search, Commons
EBernhardson moved T395906: Set up Cirrus Search to migrate to Domain Events from Incoming to Blocked / Waiting on the Discovery-Search (2025.05.24 - 2025.06.13) board.
Jun 9 2025, 3:30 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-Interfaces-Team, CirrusSearch
EBernhardson edited projects for T395906: Set up Cirrus Search to migrate to Domain Events, added: Discovery-Search (2025.05.24 - 2025.06.13); removed Discovery-Search.
Jun 9 2025, 3:29 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-Interfaces-Team, CirrusSearch
EBernhardson moved T394791: [SPIKE] Investigate CirrusSearch extension for Domain Event migrations from Incoming to Blocked / Waiting on the Discovery-Search (2025.05.24 - 2025.06.13) board.
Jun 9 2025, 3:29 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-Interfaces-Team, CirrusSearch, OKR-Work, MediaWiki-DomainEvents
EBernhardson edited projects for T394791: [SPIKE] Investigate CirrusSearch extension for Domain Event migrations, added: Discovery-Search (2025.05.24 - 2025.06.13); removed Discovery-Search.
Jun 9 2025, 3:28 PM ยท Discovery-Search (2025.07.04 - 2025.07.25), MW-Interfaces-Team, CirrusSearch, OKR-Work, MediaWiki-DomainEvents
EBernhardson moved T395341: Deepcategory search on Commons often does not show any results instead of results up to the configured limits from needs triage to Feature Requests on the Discovery-Search board.

Possible solutions:

  • Some sort of cheapcat keyword that has a reduced depth
  • Some sort of parameter / named parameter passing for the keyword to allow users to change the depth
  • Some sort of cheapcat that runs the query but without a sort (wont time out, but doesnt get the n-closest nodes)
Jun 9 2025, 3:27 PM ยท CirrusSearch, Discovery-Search, Commons
EBernhardson moved T388538: Migrate discovery-search jobs to mw-cron from Reported to In Progress on the Discovery-Search (2025.05.24 - 2025.06.13) board.

It looks to still be having issues, in particular the s3 job has been OOMKilled a few times recently and isn't completing a full build.

Jun 9 2025, 2:58 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), Wikidata, Patch-For-Review, serviceops

Jun 6 2025

EBernhardson added a comment to T391383: Metrics for federated querying.

Hi @Gehel !
Thanks again for all the clarifications :)
We have new questions, could you please have a look when you have an opportunity.

  1. Correct me if I'm wrong - after the graph split, there are 2 Query Service instances running - one for Wikidata main and another for Wikidata scholarly articles. Does the data for both end up in the same dataset of query logs on hive?
Jun 6 2025, 2:10 PM ยท Wikidata, Wikidata-Query-Service

Jun 5 2025

EBernhardson moved T392525: Investigate creation of a Sankey diagram for user interaction with search from Needs Review to Done on the Discovery-Search (2025.05.24 - 2025.06.13) board.
Jun 5 2025, 8:23 PM ยท Discovery-Search (2025.05.24 - 2025.06.13)

Jun 3 2025

EBernhardson moved T262612: Run an A/B test using suggestions generated using glent Method 1 from To be Deployed to Blocked / Waiting on the Discovery-Search (2025.05.24 - 2025.06.13) board.
Jun 3 2025, 7:22 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), Patch-For-Review
EBernhardson moved T391738: outreach.wikimedia.org is not allowlisted for an mwapi Wikidata query from To be Deployed to Done on the Discovery-Search (2025.05.24 - 2025.06.13) board.
Jun 3 2025, 7:21 PM ยท Data-Platform-SRE (2025.05.24 - 2025.06.13), Discovery-Search (2025.05.24 - 2025.06.13), Wikidata
EBernhardson moved T391738: outreach.wikimedia.org is not allowlisted for an mwapi Wikidata query from Needs Review to Done on the Data-Platform-SRE (2025.05.24 - 2025.06.13) board.

This is deployed now. Tests show the example query no longer errors out, although it doesn't give any results either. As far as I can tell not getting results is appropriate, the pages returned by the generator don't have a wikibase_item page prop and thus dont get represented here.

Jun 3 2025, 7:21 PM ยท Data-Platform-SRE (2025.05.24 - 2025.06.13), Discovery-Search (2025.05.24 - 2025.06.13), Wikidata
EBernhardson added a comment to T262612: Run an A/B test using suggestions generated using glent Method 1.

T395677 has been fixed, but it essentially means we were not serving suggestions to a subset of queries. The errors stop by 2025-06-03 08:00 UTC, we will need to ensure we do not use any data collected prior to that in data analysis.

Jun 3 2025, 3:16 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), Patch-For-Review
EBernhardson moved T395677: Search backend error: illegal_argument_exception: field value function must not produce negative scores from Incoming to Done on the Discovery-Search (2025.05.24 - 2025.06.13) board.

Glent update built and shipped overnight, the errors look to have gone away now too.

Jun 3 2025, 2:09 PM ยท Discovery-Search (2025.05.24 - 2025.06.13), CirrusSearch

Jun 2 2025

EBernhardson moved T391792: Align search platform DAGs to DPE best practices from Needs Review to Done on the Discovery-Search (2025.05.24 - 2025.06.13) board.

I think we can call this complete. Nothing in the search/dags folder directly creates a DAG anymore, everything goes through create_easy_dag which i believe suggests everything has been migrated.

Jun 2 2025, 7:48 PM ยท Patch-For-Review, Discovery-Search (2025.05.24 - 2025.06.13), CirrusSearch
EBernhardson claimed T395677: Search backend error: illegal_argument_exception: field value function must not produce negative scores.
Jun 2 2025, 5:11 PM ยท Discovery-Search (2025.05.24 - 2025.06.13), CirrusSearch
EBernhardson added a comment to T395677: Search backend error: illegal_argument_exception: field value function must not produce negative scores.

This is indeed coming from glent, poking through the index updates we ship can clearly see there are negative numbers in there. I think they come from a place where we do score - edit_distance, we should be able to simply shift the scores with a constant value. I suspect we have to apply that constant globally if we don't want to change the ranking, current query patterns simply return the suggestion score and shifting only m1run would prefer it over others.

Jun 2 2025, 2:06 PM ยท Discovery-Search (2025.05.24 - 2025.06.13), CirrusSearch

May 29 2025

EBernhardson moved T262612: Run an A/B test using suggestions generated using glent Method 1 from In Progress to To be Deployed on the Discovery-Search (2025.05.24 - 2025.06.13) board.
May 29 2025, 4:50 PM ยท Discovery-Search (2025.06.13 - 2025.07.04), MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), Patch-For-Review