Page MenuHomePhabricator

☂ Migrate Wikitech to Kubernetes
Closed, ResolvedPublic

Description

This is an ☂ task for Wikitech's migration to Kubernetes.

Wikitech has been a long standing snowflake to the point where it is becoming unmanageable. The high level plan is to "weed out" everything that makes wikitech different than any other wiki, and then serve it from MW-on-K8s.

Accounts and Developer Services

  • Remove the dependency between Striker, Horizon, and Wikitech
  • Migrate Wikitech accounts off LDAP to the Single User Login (SUL)
  • Remove the OpenStack extension (SSH key management)

Infrastructure

  • Migrate Wikitech's jobqueue
  • mediawiki-config changes
  • apache changes
  • Route wikitech to MW-on-K8s

Going Live Tue 1st Oct 2024

Related Objects

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Any ETA on when this could happen?

We will soon be in next quarter OKR/planning season, we 'll post an update then if we manage to schedule it for next quarter. Just so that we have all data that we need to make the decision, is this a blocker for Data Persistence in any way?

It is not a blocker for us per se, but we got hit again: https://phabricator.wikimedia.org/T237773#8930311

Cool, noted. Thanks for the info.

To expand upon what Manuel said, we have to manually special-case wikitech in a lot of places which is a lot of work and when we forget to, it usually end up as an incident/outage/major issues making wikitech unusable. This has happened multiple times.

On top of that since wikitech appserver is in public network, we have to open holes in our firewall and add extra grants and that's making me uneasy.

jijiki renamed this task from Migrate Wikitech to Kubernetes to ☂ Migrate Wikitech to Kubernetes.Jul 30 2024, 10:00 AM
jijiki changed the task status from Open to In Progress.
jijiki raised the priority of this task from Medium to High.
jijiki updated the task description. (Show Details)
jijiki updated the task description. (Show Details)
jijiki lowered the priority of this task from High to Medium.Oct 1 2024, 12:45 PM

@Ladsgroup and I flipped the relevant switches today, for the time being all seems well.

Change #1077040 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: Stop monitoring Wikitech on cloudweb hosts

https://gerrit.wikimedia.org/r/1077040

Change #1077041 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] openstack: Stop running Wikitech jobs on cloudweb hosts

https://gerrit.wikimedia.org/r/1077041

Change #1077040 merged by Effie Mouzeli:

[operations/puppet@production] hieradata: Stop monitoring Wikitech on cloudweb hosts

https://gerrit.wikimedia.org/r/1077040

Change #1077041 merged by Effie Mouzeli:

[operations/puppet@production] openstack: Stop running Wikitech jobs on cloudweb hosts

https://gerrit.wikimedia.org/r/1077041

Change #1077349 had a related patch set uploaded (by Zabe; author: Zabe):

[operations/mediawiki-config@master] reverse-proxy: Drop all public ips except cloudweb2002-dev.codfw.wmnet

https://gerrit.wikimedia.org/r/1077349

Change #1077403 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] snapshots: Dump wikitech (labswiki) like any other wiki

https://gerrit.wikimedia.org/r/1077403

Change #1077403 merged by Ladsgroup:

[operations/puppet@production] snapshots: Dump wikitech (labswiki) like any other wiki

https://gerrit.wikimedia.org/r/1077403

Change #1077349 merged by jenkins-bot:

[operations/mediawiki-config@master] reverse-proxy: Drop all public ips except cloudweb2002-dev.codfw.wmnet

https://gerrit.wikimedia.org/r/1077349

Mentioned in SAL (#wikimedia-operations) [2024-10-02T16:09:34Z] <taavi@deploy2002> Started scap sync-world: Backport for [[gerrit:1077349|reverse-proxy: Drop all public ips except cloudweb2002-dev.codfw.wmnet (T292707)]]

Mentioned in SAL (#wikimedia-operations) [2024-10-02T16:11:46Z] <taavi@deploy2002> zabe, taavi: Backport for [[gerrit:1077349|reverse-proxy: Drop all public ips except cloudweb2002-dev.codfw.wmnet (T292707)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)

Mentioned in SAL (#wikimedia-operations) [2024-10-02T16:16:35Z] <taavi@deploy2002> Finished scap sync-world: Backport for [[gerrit:1077349|reverse-proxy: Drop all public ips except cloudweb2002-dev.codfw.wmnet (T292707)]] (duration: 07m 01s)

I think we can close this, kudos to everyone who worked on making this happen!