401-450 of 10000 results (89ms)
2025-08-21 ยง
15:07 <joal@deploy1003> Started deploy [analytics/refinery@9fc3b38]: Regular analytics weekly train [analytics/refinery@9fc3b380] [production]
15:06 <joal@deploy1003> Finished deploy [analytics/refinery@9fc3b38] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@9fc3b380] (duration: 00m 55s) [production]
15:05 <joal@deploy1003> Started deploy [analytics/refinery@9fc3b38] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@9fc3b380] [production]
15:00 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1238 (T399249)', diff saved to https://phabricator.wikimedia.org/P81662 and previous config saved to /var/cache/conftool/dbconfig/20250821-150021-fceratto.json [production]
14:52 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:51 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:50 <fceratto@cumin1002> START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis amwikimedia, cnwikimedia, donatewiki, gewikimedia, grwikimedia, hiwikimedia, idwikimedia, maiwikimedia, ngwikimedia, nostalgiawiki, punjabiwikimedia, romdwikimedia, rswikimedia, votewiki, wbwikimedia in section s5 [production]
14:50 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Checking sanitization for wikis amwikimedia, cnwikimedia, donatewiki, gewikimedia, grwikimedia, hiwikimedia, idwikimedia, maiwikimedia, ngwikimedia, nostalgiawiki, punjabiwikimedia, romdwikimedia, rswikimedia, votewiki, wbwikimedia in section s5 [production]
14:48 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1240.eqiad.wmnet with reason: Maintenance [production]
14:47 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'sync'. [production]
14:44 <fceratto@cumin1002> START - Cookbook sre.mysql.sanitize-wiki Checking sanitization for wikis amwikimedia, cnwikimedia, donatewiki, gewikimedia, grwikimedia, hiwikimedia, idwikimedia, maiwikimedia, ngwikimedia, nostalgiawiki, punjabiwikimedia, romdwikimedia, rswikimedia, votewiki, wbwikimedia in section s5 [production]
14:34 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on es2040.codfw.wmnet with reason: 10GB-fication [production]
14:33 <bking@cumin1002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Unbanning all hosts in search_eqiad [production]
14:33 <bking@cumin1002> START - Cookbook sre.elasticsearch.ban Unbanning all hosts in search_eqiad [production]
14:30 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depool es2040 T399927', diff saved to https://phabricator.wikimedia.org/P81661 and previous config saved to /var/cache/conftool/dbconfig/20250821-143039-ladsgroup.json [production]
14:17 <zabe@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply [production]
14:16 <zabe@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-experimental: apply [production]
13:52 <kartik@deploy1003> Finished scap sync-world: Backport for [[gerrit:1180838|CX3 Build 1.0.0+20250821 (T387427)]] (duration: 19m 38s) [production]
13:47 <kartik@deploy1003> kartik: Continuing with sync [production]
13:47 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'sync'. [production]
13:46 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'sync'. [production]
13:36 <kartik@deploy1003> kartik: Backport for [[gerrit:1180838|CX3 Build 1.0.0+20250821 (T387427)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:32 <kartik@deploy1003> Started scap sync-world: Backport for [[gerrit:1180838|CX3 Build 1.0.0+20250821 (T387427)]] [production]
13:30 <urbanecm@deploy1003> Finished scap sync-world: Backport for [[gerrit:1180858|Set wgCampaignEventsCountrySchemaMigrationStage to MIGRATION_WRITE_NEW (T397476)]] (duration: 14m 01s) [production]
13:28 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1239.eqiad.wmnet with reason: Maintenance [production]
13:28 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1235 (T399249)', diff saved to https://phabricator.wikimedia.org/P81660 and previous config saved to /var/cache/conftool/dbconfig/20250821-132839-fceratto.json [production]
13:27 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'sync'. [production]
13:27 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'sync'. [production]
13:27 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'sync'. [production]
13:26 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'sync'. [production]
13:25 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'sync'. [production]
13:25 <urbanecm@deploy1003> urbanecm, daimona: Continuing with sync [production]
13:22 <stevemunene@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'sync'. [production]
13:21 <urbanecm@deploy1003> urbanecm, daimona: Backport for [[gerrit:1180858|Set wgCampaignEventsCountrySchemaMigrationStage to MIGRATION_WRITE_NEW (T397476)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:21 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Bugfix - oblivian@cumin1003" [production]
13:21 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Bugfix - oblivian@cumin1003 [production]
13:20 <oblivian@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Bugfix - oblivian@cumin1003 [production]
13:20 <oblivian@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Bugfix - oblivian@cumin1003" [production]
13:19 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2212.codfw.wmnet with reason: Maintenance [production]
13:16 <urbanecm@deploy1003> Started scap sync-world: Backport for [[gerrit:1180858|Set wgCampaignEventsCountrySchemaMigrationStage to MIGRATION_WRITE_NEW (T397476)]] [production]
13:16 <urbanecm@deploy1003> Finished scap sync-world: Backport for [[gerrit:1180701|bewwiktionary: set sitename, project namespace & timezone (T402134)]], [[gerrit:1180700|bewwiktionary: add logos (T402134)]] (duration: 11m 34s) [production]
13:13 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1235', diff saved to https://phabricator.wikimedia.org/P81659 and previous config saved to /var/cache/conftool/dbconfig/20250821-131333-fceratto.json [production]
13:10 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host testvm7001.magru.wmnet with OS bookworm [production]
13:09 <urbanecm@deploy1003> urbanecm, anzx: Continuing with sync [production]
13:07 <urbanecm@deploy1003> urbanecm, anzx: Backport for [[gerrit:1180701|bewwiktionary: set sitename, project namespace & timezone (T402134)]], [[gerrit:1180700|bewwiktionary: add logos (T402134)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:04 <urbanecm@deploy1003> Started scap sync-world: Backport for [[gerrit:1180701|bewwiktionary: set sitename, project namespace & timezone (T402134)]], [[gerrit:1180700|bewwiktionary: add logos (T402134)]] [production]
12:58 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1235', diff saved to https://phabricator.wikimedia.org/P81658 and previous config saved to /var/cache/conftool/dbconfig/20250821-125825-fceratto.json [production]
12:53 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on testvm7001.magru.wmnet with reason: host reimage [production]
12:48 <ayounsi@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on testvm7001.magru.wmnet with reason: host reimage [production]
12:43 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1235 (T399249)', diff saved to https://phabricator.wikimedia.org/P81657 and previous config saved to /var/cache/conftool/dbconfig/20250821-124318-fceratto.json [production]