Page MenuHomePhabricator

Data-ServicesComponent
ActivePublic

Milestones

Members

  • This project does not have any members.
  • View All

Details

Description

Data services available for use on Cloud-Services.

  • Wiki Replicas - redacted copies of the Wikimedia wiki's production databases
  • Wikimedia Dumps - public hosting of full text dumps from Wikimedia wikis and other datasets in Cloud (see also: Dumps-Generation for generation of dumps themselves)
  • Shared Storage - NFS storage for cross-VM and cross-project use

Recent Activity

Yesterday

Marostegui added a comment to T401295: Decide how to use the new clouddb hosts (clouddb102[2-5]).

So my proposal would be to do this considering 8 existing hosts + 4 expansion hosts already racked.
Considerations

  • Leave s1 alone
  • Add x1, x4 and of course maintain x3
  • Two sections per host (excluding s1) now that we have enough hosts
Wed, Oct 22, 1:14 PM · Data-Platform-SRE, Data-Services, Data-Persistence, cloud-services-team

Tue, Oct 21

Gehel moved T395881: Set up x1 replication to Wiki Replicas from Backlog - project to Backlog - operations on the Data-Platform-SRE (2025.10.17 - 2025.11.07) board.
Tue, Oct 21, 1:05 PM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
Gehel triaged T395881: Set up x1 replication to Wiki Replicas as Medium priority.
Tue, Oct 21, 1:05 PM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering

Fri, Oct 17

Gehel moved T402145: Create view for betafeatures_user_counts table in wiki replicas from Backlog - project to Backlog - operations on the Data-Platform-SRE (2025.10.17 - 2025.11.07) board.
Fri, Oct 17, 8:52 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, BetaFeatures, Data-Services, cloud-services-team
Gehel edited projects for T407485: Set up x1 replication to an-redacteddb1001, added: Data-Platform-SRE (2025.10.17 - 2025.11.07); removed Data-Platform-SRE (2025.09.26 - 2025.10.17).
Fri, Oct 17, 8:52 AM · Essential-Work, Data-Platform-SRE (2025.10.17 - 2025.11.07), Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
Gehel edited projects for T382607: Decision request - Who runs wikireplicas cookbooks, added: Data-Platform-SRE (2025.10.17 - 2025.11.07); removed Data-Platform-SRE (2025.09.26 - 2025.10.17).
Fri, Oct 17, 8:48 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, cloud-services-team (FY2025/26-Q1), Data-Services, Data-Persistence, Cloud Services Proposals
Gehel edited projects for T387419: Create wiki replicas views for globaljsonlinks tables, added: Data-Platform-SRE (2025.10.17 - 2025.11.07); removed Data-Platform-SRE (2025.09.26 - 2025.10.17).
Fri, Oct 17, 8:48 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering-Radar, Data-Services, cloud-services-team, Data-Persistence, Data-Engineering
Gehel moved T407486: Set up x1 replication to an-redacteddb1001 from Done to Reported on the Data-Platform-SRE (2025.09.26 - 2025.10.17) board.
Fri, Oct 17, 8:30 AM · Data-Platform-SRE (2025.09.26 - 2025.10.17), Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
Gehel triaged T402145: Create view for betafeatures_user_counts table in wiki replicas as Medium priority.
Fri, Oct 17, 8:28 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, BetaFeatures, Data-Services, cloud-services-team
Gehel edited projects for T402145: Create view for betafeatures_user_counts table in wiki replicas, added: Data-Platform-SRE (2025.09.26 - 2025.10.17); removed Data-Platform-SRE.
Fri, Oct 17, 8:28 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, BetaFeatures, Data-Services, cloud-services-team
Gehel moved T407486: Set up x1 replication to an-redacteddb1001 from Backlog - operations to Done on the Data-Platform-SRE (2025.09.26 - 2025.10.17) board.
Fri, Oct 17, 8:25 AM · Data-Platform-SRE (2025.09.26 - 2025.10.17), Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
Marostegui added a comment to T395881: Set up x1 replication to Wiki Replicas.

I created T407485 to track the work required to add this section to an-redacteddb1001 and set up the initial replication.

I'm conscious that @Marostegui has said this, back in June.

This definitely needs a review from security before we can proceed.
This will also likely need specific triggers/views.

But first, we need to get an indication on:

  • tables that can be replicated entirely as they are
  • tables that cannot be replicated at all
  • tables that need some redactions
  • tables that require some specific views

When you say:

tables that cannot be replicated at all

...do you mean that certain table might be entirely filtered out of the MariaDB replication so that it never leaves the upstream x1 replica?

Fri, Oct 17, 5:51 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering

Thu, Oct 16

Ahoelzl added a comment to T402145: Create view for betafeatures_user_counts table in wiki replicas.

Thanks @Gehel for asking for clarification. I agree with Joseph and approve the change, please move forward. Thanks!

Thu, Oct 16, 6:24 PM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, BetaFeatures, Data-Services, cloud-services-team
Ladsgroup added a comment to T395881: Set up x1 replication to Wiki Replicas.

It's worth noting that because of the table catalog (T363581), now all tables that should be fully filtered or are fully public can be easily determined and already will be filtered when someone sets up the replication (We did catalog x1 tables too: T399302). That leaves only updating maintain-views for partially public tables.

Thu, Oct 16, 12:07 PM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
BTullis added a comment to T395881: Set up x1 replication to Wiki Replicas.

I created T407485 to track the work required to add this section to an-redacteddb1001 and set up the initial replication.

Thu, Oct 16, 11:50 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
BTullis merged T407486: Set up x1 replication to an-redacteddb1001 into T407485: Set up x1 replication to an-redacteddb1001.
Thu, Oct 16, 11:39 AM · Essential-Work, Data-Platform-SRE (2025.10.17 - 2025.11.07), Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
BTullis merged task T407486: Set up x1 replication to an-redacteddb1001 into T407485: Set up x1 replication to an-redacteddb1001.
Thu, Oct 16, 11:39 AM · Data-Platform-SRE (2025.09.26 - 2025.10.17), Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
BTullis moved T407486: Set up x1 replication to an-redacteddb1001 from Backlog - project to Backlog - operations on the Data-Platform-SRE (2025.09.26 - 2025.10.17) board.
Thu, Oct 16, 11:38 AM · Data-Platform-SRE (2025.09.26 - 2025.10.17), Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
BTullis created T407486: Set up x1 replication to an-redacteddb1001.
Thu, Oct 16, 11:38 AM · Data-Platform-SRE (2025.09.26 - 2025.10.17), Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
BTullis created T407485: Set up x1 replication to an-redacteddb1001.
Thu, Oct 16, 11:38 AM · Essential-Work, Data-Platform-SRE (2025.10.17 - 2025.11.07), Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering
fnegri added a comment to T351457: [toolsdb] Replication stopped because of invalid event.

This happened again today. START REPLICA; was enough to resume replication.

Thu, Oct 16, 10:18 AM · cloud-services-team (FY2023/2024-Q1-Q2), Data-Services
JAllemandou added a comment to T402145: Create view for betafeatures_user_counts table in wiki replicas.

I have reviewed the table, it contains only a features list with how many users use it. It is safe to replicate.

Thu, Oct 16, 8:34 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, BetaFeatures, Data-Services, cloud-services-team
Gehel added a comment to T402145: Create view for betafeatures_user_counts table in wiki replicas.

I pinged Data-Engineering on slack to get validation.

Thu, Oct 16, 8:16 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, BetaFeatures, Data-Services, cloud-services-team

Tue, Oct 14

taavi added a comment to T392794: maintain-dbusers: Use cloud-private to talk to NFS servers instead of proxies.

yes.

Tue, Oct 14, 12:53 PM · cloud-services-team, Data-Services
fgiunchedi added a comment to T392794: maintain-dbusers: Use cloud-private to talk to NFS servers instead of proxies.

AIUI we would need to:

Tue, Oct 14, 12:48 PM · cloud-services-team, Data-Services

Thu, Oct 9

Ladsgroup moved T406562: BlockedExternalDomain hits are not available in the toolforge replicas from Triage to Done on the DBA board.
Thu, Oct 9, 11:29 AM · DBA, Data-Services, cloud-services-team
Ladsgroup closed T406562: BlockedExternalDomain hits are not available in the toolforge replicas as Resolved.
Thu, Oct 9, 11:29 AM · DBA, Data-Services, cloud-services-team
ops-monitoring-bot added a comment to T406562: BlockedExternalDomain hits are not available in the toolforge replicas.

Cookbook cookbooks.sre.wikireplicas.update-views started by ladsgroup completed:

  • an-redacteddb1001.eqiad.wmnet (PASS)
    • Ran Puppet agent
    • Ran 'maintain-views --replace --auto-depool --all-databases --table logging'
Thu, Oct 9, 11:28 AM · DBA, Data-Services, cloud-services-team
ops-monitoring-bot added a comment to T406562: BlockedExternalDomain hits are not available in the toolforge replicas.

Cookbook cookbooks.sre.wikireplicas.update-views run by ladsgroup: Started updating wiki replica views

Thu, Oct 9, 11:21 AM · DBA, Data-Services, cloud-services-team
ops-monitoring-bot added a comment to T406562: BlockedExternalDomain hits are not available in the toolforge replicas.

Cookbook cookbooks.sre.wikireplicas.update-views started by ladsgroup executed with errors:

  • an-redacteddb1001.eqiad.wmnet (PASS)
    • Ran Puppet agent
    • Ran 'maintain-views --replace --auto-depool --all-databases --table logging'
Thu, Oct 9, 11:21 AM · DBA, Data-Services, cloud-services-team
ops-monitoring-bot added a comment to T406562: BlockedExternalDomain hits are not available in the toolforge replicas.

Cookbook cookbooks.sre.wikireplicas.update-views run by ladsgroup: Started updating wiki replica views

Thu, Oct 9, 11:14 AM · DBA, Data-Services, cloud-services-team
gerritbot added a comment to T406562: BlockedExternalDomain hits are not available in the toolforge replicas.

Change #1194294 merged by Ladsgroup:

[operations/puppet@production] maintain-views: Add abusefilterblockeddomainhit to allowed log types

https://gerrit.wikimedia.org/r/1194294

Thu, Oct 9, 11:12 AM · DBA, Data-Services, cloud-services-team

Wed, Oct 8

Ottomata added a project to T395881: Set up x1 replication to Wiki Replicas: Data-Platform-SRE.

Tagging Data-Platform-SRE to get their input.

Wed, Oct 8, 5:27 PM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, Data-Services, Data-Persistence, cloud-services-team, Privacy Engineering

Tue, Oct 7

gerritbot added a project to T406562: BlockedExternalDomain hits are not available in the toolforge replicas: Patch-For-Review.
Tue, Oct 7, 8:43 PM · DBA, Data-Services, cloud-services-team
gerritbot added a comment to T406562: BlockedExternalDomain hits are not available in the toolforge replicas.

Change #1194294 had a related patch set uploaded (by Ladsgroup; author: Amir Sarabadani):

[operations/puppet@production] maintain-views: Add abusefilterblockeddomainhit to allowed log types

https://gerrit.wikimedia.org/r/1194294

Tue, Oct 7, 8:43 PM · DBA, Data-Services, cloud-services-team
JJMC89 moved T406562: BlockedExternalDomain hits are not available in the toolforge replicas from Backlog to Wiki replicas on the Data-Services board.
Tue, Oct 7, 2:58 PM · DBA, Data-Services, cloud-services-team
taavi edited projects for T406562: BlockedExternalDomain hits are not available in the toolforge replicas, added: Data-Services; removed Toolforge.
Tue, Oct 7, 10:43 AM · DBA, Data-Services, cloud-services-team
Gehel added a project to T382607: Decision request - Who runs wikireplicas cookbooks: Essential-Work.
Tue, Oct 7, 8:52 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, cloud-services-team (FY2025/26-Q1), Data-Services, Data-Persistence, Cloud Services Proposals
Gehel added a comment to T402145: Create view for betafeatures_user_counts table in wiki replicas.

We're waiting on Data-Engineering to validate.

Tue, Oct 7, 8:42 AM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering, BetaFeatures, Data-Services, cloud-services-team

Mon, Sep 29

FCeratto-WMF closed T404566: Prepare and check storage layer for tokwiki, a subtask of T404570: [wikireplicas] Create views for new wiki tokwiki, as Resolved.
Mon, Sep 29, 6:33 PM · Data-Services, cloud-services-team
FCeratto-WMF closed T404699: Prepare and check storage layer for mswikiquote, a subtask of T404703: [wikireplicas] Create views for new wiki mswikiquote, as Resolved.
Mon, Sep 29, 6:32 PM · Data-Services, cloud-services-team

Fri, Sep 26

Gehel removed a project from T404473: Unexpected error "Subquery returns more than 1 row" on wiki replicas: Data-Platform-SRE.
Fri, Sep 26, 2:02 PM · Multiblocks, Community-Tech, Data-Services, cloud-services-team
Gehel edited projects for T387419: Create wiki replicas views for globaljsonlinks tables, added: Data-Platform-SRE (2025.09.26 - 2025.10.17); removed Data-Platform-SRE (2025.09.05 - 2025.09.26).
Fri, Sep 26, 1:50 PM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, Data-Engineering-Radar, Data-Services, cloud-services-team, Data-Persistence, Data-Engineering
Gehel edited projects for T382607: Decision request - Who runs wikireplicas cookbooks, added: Data-Platform-SRE (2025.09.26 - 2025.10.17); removed Data-Platform-SRE (2025.09.05 - 2025.09.26).
Fri, Sep 26, 1:50 PM · Data-Platform-SRE (2025.10.17 - 2025.11.07), Essential-Work, cloud-services-team (FY2025/26-Q1), Data-Services, Data-Persistence, Cloud Services Proposals
Maintenance_bot removed a project from T405728: JobUnavailable Reduced availability for job maintain_dbusers_eqiad in cloud@eqiad: Patch-For-Review.
Fri, Sep 26, 1:32 PM · Data-Services, cloud-services-team
taavi closed T405728: JobUnavailable Reduced availability for job maintain_dbusers_eqiad in cloud@eqiad as Resolved.
Fri, Sep 26, 1:00 PM · Data-Services, cloud-services-team
gerritbot added a comment to T405728: JobUnavailable Reduced availability for job maintain_dbusers_eqiad in cloud@eqiad.

Change #1191675 merged by Majavah:

[operations/puppet@production] P:wmcs: maintain-dbusers: Remove accounts names from Prometheus labels

https://gerrit.wikimedia.org/r/1191675

Fri, Sep 26, 12:55 PM · Data-Services, cloud-services-team
gerritbot added a project to T405728: JobUnavailable Reduced availability for job maintain_dbusers_eqiad in cloud@eqiad: Patch-For-Review.
Fri, Sep 26, 11:55 AM · Data-Services, cloud-services-team
gerritbot added a comment to T405728: JobUnavailable Reduced availability for job maintain_dbusers_eqiad in cloud@eqiad.

Change #1191675 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] P:wmcs: maintain-dbusers: Remove accounts names from Prometheus labels

https://gerrit.wikimedia.org/r/1191675

Fri, Sep 26, 11:55 AM · Data-Services, cloud-services-team
taavi claimed T405728: JobUnavailable Reduced availability for job maintain_dbusers_eqiad in cloud@eqiad.
Fri, Sep 26, 11:53 AM · Data-Services, cloud-services-team