Page MenuHomePhabricator

Cloud-ServicesUmbrella
ActivePublic

Details

Description

Cloud-Services is the umbrella project for tasks related to the products managed by the Wikimedia Cloud Services team. A general overview of services offered can be found at wikitech:Help:Cloud Services Introduction

Subprojects:

The team itself has a separate project at cloud-services-team that is used to track the backlog and current priorities of the team.

Recent Activity

Mon, Jul 14

Dzahn created T399492: openstack-browser.toolforge.org - internal server error for certain URL.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Mon, Jul 14, 7:16 PM · Tool-openstack-browser

Fri, Jul 11

joanna_borun created T399313: Add tracing to understand Toolforge and CloudVPS usage and dependencies.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Fri, Jul 11, 3:10 PM · Cloud-Services

Wed, Jun 18

hashar created T397351: Object storage web service CSP does not allow inline images.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Wed, Jun 18, 5:22 PM · Cloud-VPS, cloud-services-team, ContentSecurityPolicy, Continuous-Integration-Infrastructure (Zuul upgrade)
Mhurd created T397272: Horizon proxy tab Edit buttons not working .

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Wed, Jun 18, 3:11 AM · cloud-services-team (FY2024/2025-Q3-Q4), Horizon

Jun 12 2025

taavi edited Description on Cloud-Services.
Jun 12 2025, 11:00 AM
Jelto created T396739: Volume is stuck to deleted instance in devtools project.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Jun 12 2025, 10:58 AM · cloud-services-team (FY2024/2025-Q3-Q4), Cloud-VPS, collaboration-services, GitLab (Infrastructure)

Jun 4 2025

Andrew created T396038: toolsbeta paging.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Jun 4 2025, 2:55 PM · Toolforge, cloud-services-team

Jun 3 2025

Gehel edited projects for T387419: Create wiki replicas views for globaljsonlinks tables, added: Cloud-Services; removed Data-Services.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Jun 3 2025, 8:46 AM · Data-Platform-SRE (2025.07.05 - 2025.07.25), Data-Services, cloud-services-team, Data-Persistence, Data-Engineering

May 29 2025

dancy created T395633: Is it a bug to have a hostname in profile::resolving::nameservers?.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

May 29 2025, 11:01 PM · cloud-services-team

May 17 2025

Peachey88 edited projects for T394577: Is Using sentry for error monitoring against wikimedia cloud privacy policy?, added: Cloud-Services; removed Tool-campwiz-nxt.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

May 17 2025, 11:20 AM · cloud-services-team, Toolforge

May 6 2025

Huji created T393505: Unable to create python virtualenv on Toolforge.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

May 6 2025, 6:57 PM · cloud-services-team, Toolforge

Apr 30 2025

GPSLeo created T393024: lua entry thread aborted: runtime error: /etc/nginx/lua/domainproxy.lua:32: bad request.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Apr 30 2025, 3:46 PM · Cloud-VPS, cloud-services-team

Apr 28 2025

Maintenance_bot removed a project from T162955: rebuild tools-grid-master as a large instance: Patch-For-Review.
Apr 28 2025, 4:30 PM · cloud-services-team (Kanban), SRE, Cloud-Services
gerritbot added a comment to T162955: rebuild tools-grid-master as a large instance.

Change #352301 abandoned by Hashar:

[operations/puppet@production] gridengine: Follow up - delete old maintenance scripts and tracker/collector puppet code

Reason:

gridengine is no more

https://gerrit.wikimedia.org/r/352301

Apr 28 2025, 4:22 PM · cloud-services-team (Kanban), SRE, Cloud-Services
gerritbot added a comment to T162955: rebuild tools-grid-master as a large instance.

Change #352294 abandoned by Hashar:

[operations/puppet@production] gridengine: Cleanup old scripts, tracker and collector

Reason:

gridengine is no more

https://gerrit.wikimedia.org/r/352294

Apr 28 2025, 4:22 PM · cloud-services-team (Kanban), SRE, Cloud-Services
gerritbot added a comment to T162955: rebuild tools-grid-master as a large instance.

Change #352281 abandoned by Hashar:

[operations/puppet@production] gridengine: Cleanup mergeconf script and references

Reason:

gridengine is no more

https://gerrit.wikimedia.org/r/352281

Apr 28 2025, 4:22 PM · cloud-services-team (Kanban), SRE, Cloud-Services
gerritbot added a comment to T162955: rebuild tools-grid-master as a large instance.

Change #351379 abandoned by Hashar:

[operations/puppet@production] sge: Fix global config handling

Reason:

gridengine is no more

https://gerrit.wikimedia.org/r/351379

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Apr 28 2025, 4:22 PM · cloud-services-team (Kanban), SRE, Cloud-Services

Apr 23 2025

MoritzMuehlenhoff created T392478: Move cloudweb to Ganeti VMs and repurpose the servers as wikikube nodes.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Apr 23 2025, 12:11 PM · Horizon, Striker, cloud-services-team, serviceops, SRE

Apr 2 2025

hashar added a comment to T390824: On WMCS linux-perf must be installed from backports to be in sync with linux-image package.

On integration-agent-docker-1044:

reboot   system boot  6.1.0-0.deb11.7- Tue Jul  4 14:30:48 2023 - Tue Jul  4 14:32:14 2023  (00:01)
hashar   pts/0        172.16.3.145     Tue Jul  4 14:20:41 2023 - Tue Jul  4 14:30:41 2023  (00:10)
root     ttyS0                         Tue Jul  4 14:18:52 2023 - down                      (00:11)
root     ttyS0                         Tue Jul  4 14:16:31 2023 - Tue Jul  4 14:18:52 2023  (00:02)
reboot   system boot  6.1.0-0.deb11.7- Tue Jul  4 14:16:15 2023 - Tue Jul  4 14:30:42 2023  (00:14)
root     ttyS0                         Thu Jun  8 15:28:43 2023 - down                      (00:00)
reboot   system boot  5.10.0-23-cloud- Thu Jun  8 15:22:11 2023 - Thu Jun  8 15:29:14 2023  (00:07)
Apr 2 2025, 8:50 AM · Essential-Work, Release-Engineering-Team (Radar), Cloud-VPS, cloud-services-team
hashar added a comment to T390824: On WMCS linux-perf must be installed from backports to be in sync with linux-image package.

The instance that got recently created run on 5.10.0 while the older are on 6.1.0:

$ sudo cumin --force 'name:docker' 'uname -r'
26 hosts will be targeted:
integration-agent-docker-[1040-1057,1059-1065].integration.eqiad1.wikimedia.cloud,integration-agent-puppet-docker-1003.integration.eqiad1.wikimedia.cloud
FORCE mode enabled, continuing without confirmation
===== NODE GROUP =====                                                                                                                                                                        
(1) integration-agent-docker-1044.integration.eqiad1.wikimedia.cloud                                                                                                                          
----- OUTPUT of 'uname -r' -----                                                                                                                                                              
6.1.0-0.deb11.21-cloud-amd64                                                                                                                                                                  
===== NODE GROUP =====                                                                                                                                                                        
(6) integration-agent-docker-[1060-1065].integration.eqiad1.wikimedia.cloud                                                                                                                   
----- OUTPUT of 'uname -r' -----                                                                                                                                                              
5.10.0-34-cloud-amd64                                                                                                                                                                         
===== NODE GROUP =====                                                                                                                                                                        
(2) integration-agent-docker-1059.integration.eqiad1.wikimedia.cloud,integration-agent-puppet-docker-1003.integration.eqiad1.wikimedia.cloud                                                  
----- OUTPUT of 'uname -r' -----                                                                                                                                                              
5.10.0-33-cloud-amd64                                                                                                                                                                         
===== NODE GROUP =====                                                                                                                                                                        
(17) integration-agent-docker-[1040-1043,1045-1057].integration.eqiad1.wikimedia.cloud                                                                                                        
----- OUTPUT of 'uname -r' -----                                                                                                                                                              
6.1.0-0.deb11.7-cloud-amd64                                                                                                                                                                   
================
Apr 2 2025, 8:43 AM · Essential-Work, Release-Engineering-Team (Radar), Cloud-VPS, cloud-services-team
hashar added a parent task for T390824: On WMCS linux-perf must be installed from backports to be in sync with linux-image package: T390125: Find and document how to debug a NodeJS process on CI/Docker.
Apr 2 2025, 8:38 AM · Essential-Work, Release-Engineering-Team (Radar), Cloud-VPS, cloud-services-team
hashar created T390824: On WMCS linux-perf must be installed from backports to be in sync with linux-image package.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Apr 2 2025, 8:36 AM · Essential-Work, Release-Engineering-Team (Radar), Cloud-VPS, cloud-services-team

Mar 21 2025

Gehel edited projects for T388270: Cloudelastic alerts should route to data platform alerts, not wmcs, added: Data-Platform-SRE (2025.03.22 - 2025.04.11); removed Data-Platform-SRE (2025.03.01 - 2025.03.21).
Mar 21 2025, 9:56 AM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch

Mar 13 2025

Maintenance_bot removed a project from T388270: Cloudelastic alerts should route to data platform alerts, not wmcs: Patch-For-Review.
Mar 13 2025, 3:30 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
gerritbot added a comment to T388270: Cloudelastic alerts should route to data platform alerts, not wmcs.

Change #1126486 merged by Filippo Giunchedi:

[operations/puppet@production] icinga: route relforge icinga alerts to data-platform

https://gerrit.wikimedia.org/r/1126486

Mar 13 2025, 3:16 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch

Mar 12 2025

lmata edited projects for T388270: Cloudelastic alerts should route to data platform alerts, not wmcs, added: SRE Observability (FY2024/2025-Q3); removed SRE Observability.
Mar 12 2025, 3:13 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch

Mar 11 2025

gerritbot added a project to T388270: Cloudelastic alerts should route to data platform alerts, not wmcs: Patch-For-Review.
Mar 11 2025, 7:48 AM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
gerritbot added a comment to T388270: Cloudelastic alerts should route to data platform alerts, not wmcs.

Change #1126486 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):

[operations/puppet@production] icinga: route relforge icinga alerts to data-platform

https://gerrit.wikimedia.org/r/1126486

Mar 11 2025, 7:48 AM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch

Mar 10 2025

Maintenance_bot removed a project from T388270: Cloudelastic alerts should route to data platform alerts, not wmcs: Patch-For-Review.
Mar 10 2025, 3:30 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
gerritbot added a comment to T388270: Cloudelastic alerts should route to data platform alerts, not wmcs.

Change #1126067 merged by Bking:

[operations/puppet@production] icinga: route cloudelastic alerts to Data Platform SRE

https://gerrit.wikimedia.org/r/1126067

Mar 10 2025, 3:28 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
Gehel moved T388270: Cloudelastic alerts should route to data platform alerts, not wmcs from Backlog - project to Needs Review on the Data-Platform-SRE (2025.03.01 - 2025.03.21) board.
Mar 10 2025, 3:24 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
Gehel edited projects for T388270: Cloudelastic alerts should route to data platform alerts, not wmcs, added: Data-Platform-SRE (2025.03.01 - 2025.03.21); removed Discovery-Search.
Mar 10 2025, 3:24 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
gerritbot added a project to T388270: Cloudelastic alerts should route to data platform alerts, not wmcs: Patch-For-Review.
Mar 10 2025, 2:42 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
gerritbot added a comment to T388270: Cloudelastic alerts should route to data platform alerts, not wmcs.

Change #1126067 had a related patch set uploaded (by Bking; author: Bking):

[operations/puppet@production] icinga: route cloudelastic alerts to Data Platform SRE

https://gerrit.wikimedia.org/r/1126067

Mar 10 2025, 2:42 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
fgiunchedi added a comment to T388270: Cloudelastic alerts should route to data platform alerts, not wmcs.

Thanks for reaching out @RKemper, you are indeed correct that icinga alerts showing up on alerts.w.o go through a "translation" to route teams and alerts. Namely icinga-exporter (something we wrote) takes care of that. You can see its configuration at hieradata/common/profile/prometheus/icinga_exporter.yaml. It should be sufficient to add the expected data-platform team matching before wmcs in there (IIRC rules are "stop at first match"). HTH!

Mar 10 2025, 12:48 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch

Mar 7 2025

RKemper added a project to T388270: Cloudelastic alerts should route to data platform alerts, not wmcs: SRE Observability.

The alerts I'm seeing (unassigned shard check) are defined here:

Mar 7 2025, 10:39 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
RKemper renamed T388270: Cloudelastic alerts should route to data platform alerts, not wmcs from Update alerting to correspond with the new cloudsearch cluster to Cloudelastic alerts should route to data platform alerts, not wmcs.
Mar 7 2025, 10:37 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch
Andrew created T388270: Cloudelastic alerts should route to data platform alerts, not wmcs.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Mar 7 2025, 6:45 PM · Data-Services, Data-Platform-SRE (2025.03.22 - 2025.04.11), SRE Observability (FY2024/2025-Q3), cloud-services-team, Elasticsearch

Feb 26 2025

Maintenance_bot updated the task description for T387305: Migrate labweb-ssl LB VIPs to IPIP encapsulation.
Feb 26 2025, 11:52 AM · cloud-services-team, Cloud-VPS
Maintenance_bot created T387305: Migrate labweb-ssl LB VIPs to IPIP encapsulation.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Feb 26 2025, 11:31 AM · cloud-services-team, Cloud-VPS

Feb 23 2025

Jeff_G added a comment to T387103: X's Tools cannot be reached.
In T387103#10573896, @Herald wrote:

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Feb 23 2025, 11:51 PM · XTools
Jeff_G created T387103: X's Tools cannot be reached.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Feb 23 2025, 11:46 PM · XTools

Feb 13 2025

EBomani created T386416: metricsinfra: send alerts for the catalyst project to catalyst@w.o email.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Feb 13 2025, 9:54 PM · Catalyst (ilopona), VPS-Projects, Cloud-VPS, cloud-services-team

Feb 12 2025

Gehel edited projects for T109715: Replicate production elasticsearch indices to labs, added: Discovery-Search (Current work); removed Discovery-Search.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Feb 12 2025, 8:31 AM · Discovery-Search (Current work), Cloud-Services, Elasticsearch, Discovery-ARCHIVED

Feb 11 2025

KCVelaga_WMF created T386120: Support with steps to access Toolforge user data.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Feb 11 2025, 4:57 PM · cloud-services-team, Toolforge, User-bd808
Maintenance_bot removed a project from T131906: Tool Labs elasticsearch cluster broken by production Puppet changes: Patch-For-Review.
Feb 11 2025, 4:32 PM · Discovery-Search (Current work), User-bd808, Discovery-ARCHIVED, Elasticsearch, Cloud-Services
Gehel edited projects for T131906: Tool Labs elasticsearch cluster broken by production Puppet changes, added: Discovery-Search (Current work); removed Discovery-Search, Toolforge.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Feb 11 2025, 3:33 PM · Discovery-Search (Current work), User-bd808, Discovery-ARCHIVED, Elasticsearch, Cloud-Services
Gehel edited projects for T239135: Create partitioned CirrusSearchElasticaWrite topic, added: Discovery-Search (Current work); removed Discovery-Search.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Feb 11 2025, 3:28 PM · Discovery-Search (Current work), Analytics, Cloud-Services, Elasticsearch, Discovery-ARCHIVED

Feb 7 2025

Susannaanas created T385871: Wikidocumentaries is down.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Feb 7 2025, 11:34 AM · Wikidocumentaries

Jan 30 2025

Yug updated the task description for T385064: Assess opportunity to migrate from WMFR-OVH server to WMF Toolforge or WMF Cloud VPS.
Jan 30 2025, 12:00 AM · Lingua-Libre