1651-1700 of 10000 results (70ms)
2022-05-13 §
10:41 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on ganeti4002.ulsfo.wmnet with reason: Remove from cluster for eventual reimage [production]
10:18 <vgutierrez> disable puppet on gerrit1001 to fix /etc/ssh/ssh_config [production]
08:39 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
08:03 <jynus> moving s2 database from db2101 to db2097 T299920 [production]
07:59 <moritzm> draining ganeti4002 T307997 [production]
07:52 <XioNoX> add init7 transit in drmrs [production]
07:39 <root@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti4001.ulsfo.wmnet to ganeti01.svc.ulsfo.wmnet [production]
07:39 <root@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti4001.ulsfo.wmnet to ganeti01.svc.ulsfo.wmnet [production]
07:27 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti4001.ulsfo.wmnet [production]
07:20 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti4001.ulsfo.wmnet [production]
07:18 <Amir1> start of mwscript extensions/Echo/maintenance/removeOrphanedEvents.php --wiki=wikidatawiki --force (T308084) [production]
02:14 <ejegg> updated payments-wiki from 8f46af9d to 590fac28 [production]
2022-05-12 §
21:56 <razzi@deploy1002> Finished deploy [analytics/turnilo/deploy@a2bdc3e]: (no justification provided) (duration: 02m 08s) [production]
21:53 <razzi@deploy1002> Started deploy [analytics/turnilo/deploy@a2bdc3e]: (no justification provided) [production]
21:43 <robh> cp306[23] returned to service, cp306[45] coming down for firmware update via T243167 [production]
21:15 <robh> cp306[01] returned to service, cp306[23] coming down for firmware update via T243167 [production]
20:59 <brennen> utc late backport & config window closed [production]
20:50 <robh> resuming last 6 esams cp host firmware updates via T243167. cp306[01] going offline [production]
20:50 <Krinkle> krinkle@mwmaint1002$ mwscript refreshLinks.php --wiki commonswiki --category 'Media_needing_categories_requiring_human_attention' (approximately 2000 tiny pages) [production]
20:44 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:43 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:43 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:40 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:39 <brennen@deploy1002> Finished scap: Backport for [[gerrit:791430]] viwiki: Enable "upload_by_url" for sysop (duration: 01m 36s) [production]
20:37 <brennen@deploy1002> Started scap: Backport for [[gerrit:791430]] viwiki: Enable "upload_by_url" for sysop [production]
20:35 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:34 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:34 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:33 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:32 <brennen@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:791424|ruwiktionary: Add localized mobile wordmark (T308233)]] (duration: 00m 50s) [production]
20:31 <brennen@deploy1002> Synchronized static/images/mobile/copyright/wiktionary-wordmark-ru.svg: Config: [[gerrit:791424|ruwiktionary: Add localized mobile wordmark (T308233)]] (duration: 00m 49s) [production]
20:25 <brennen@deploy1002> Finished scap: Backport for [[gerrit:785229]] Enable "upload_by_url" feature on zhwiki (duration: 01m 46s) [production]
20:23 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:23 <brennen@deploy1002> Started scap: Backport for [[gerrit:785229]] Enable "upload_by_url" feature on zhwiki [production]
20:22 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:22 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:21 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:17 <brennen@deploy1002> backport aborted: (duration: 02m 05s) [production]
20:17 <brennen@deploy1002> prep aborted: (duration: 00m 01s) [production]
19:57 <hashar> Restarting Gerrit [production]
19:53 <mutante> gitlab2001 - systemctl start backup-restore - systemd[1]: Started GitLab Backup Restore. after gerrit:791410 for T308089 [production]
18:57 <jelto> restart gitlab2001 [production]
18:30 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
18:26 <krinkle@deploy1002> Synchronized w/static.php: Ic0a5eae4f721a16403071d1b2136cf23d78e4fa9 (duration: 00m 49s) [production]
18:26 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti4001.ulsfo.wmnet with OS bullseye [production]
18:26 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
18:26 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
18:25 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
18:11 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti4001.ulsfo.wmnet with reason: host reimage [production]
18:08 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti4001.ulsfo.wmnet with reason: host reimage [production]