5801-5850 of 10000 results (77ms)
2022-09-06 ยง
21:39 <mutante> phabricator - passive hosts in codfw switched to readonly DB access (m3-slave, not m3-master) T315713 [production]
21:30 <root@cumin1001> END (ERROR) - Cookbook sre.network.prepare-upgrade (exit_code=97) [production]
21:13 <milimetric@deploy1002> Started deploy [analytics/refinery@b14c9f4]: Hotfix for requestctl field [production]
20:57 <milimetric@deploy1002> Finished deploy [analytics/refinery@8a5ce13] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@8a5ce13] (duration: 08m 54s) [production]
20:50 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:49 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:49 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:48 <milimetric@deploy1002> Started deploy [analytics/refinery@8a5ce13] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@8a5ce13] [production]
20:48 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:48 <cjming> end of UTC late backport window [production]
20:47 <cjming@deploy1002> Finished scap: Backport for [[gerrit:830213|Add localized wordmark for Bengali Wiktionary (T316953)]] (duration: 05m 24s) [production]
20:45 <milimetric@deploy1002> Finished deploy [analytics/refinery@8a5ce13]: Regular analytics weekly train [analytics/refinery@8a5ce13] (duration: 00m 16s) [production]
20:44 <milimetric@deploy1002> Started deploy [analytics/refinery@8a5ce13]: Regular analytics weekly train [analytics/refinery@8a5ce13] [production]
20:44 <milimetric@deploy1002> deploy aborted: Regular analytics weekly train [analytics/refinery@8a5ce13] (duration: 00m 00s) [production]
20:44 <milimetric@deploy1002> Started deploy [analytics/refinery@8a5ce13]: Regular analytics weekly train [analytics/refinery@8a5ce13] [production]
20:44 <milimetric@deploy1002> Finished deploy [analytics/refinery@8a5ce13] (thin): Regular analytics weekly train THIN [analytics/refinery@8a5ce13] (duration: 00m 08s) [production]
20:44 <milimetric@deploy1002> Started deploy [analytics/refinery@8a5ce13] (thin): Regular analytics weekly train THIN [analytics/refinery@8a5ce13] [production]
20:42 <cjming@deploy1002> cjming and mdsshakil: Backport for [[gerrit:830213|Add localized wordmark for Bengali Wiktionary (T316953)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
20:41 <cjming@deploy1002> Started scap: Backport for [[gerrit:830213|Add localized wordmark for Bengali Wiktionary (T316953)]] [production]
20:38 <milimetric@deploy1002> Finished deploy [analytics/refinery@8a5ce13]: Regular analytics weekly train [analytics/refinery@8a5ce13] (duration: 03m 15s) [production]
20:35 <milimetric@deploy1002> Started deploy [analytics/refinery@8a5ce13]: Regular analytics weekly train [analytics/refinery@8a5ce13] [production]
20:33 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:32 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2117 (T314041)', diff saved to https://phabricator.wikimedia.org/P33978 and previous config saved to /var/cache/conftool/dbconfig/20220906-203258-ladsgroup.json [production]
20:32 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2117.codfw.wmnet with reason: Maintenance [production]
20:32 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2117.codfw.wmnet with reason: Maintenance [production]
20:32 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2124 (T314041)', diff saved to https://phabricator.wikimedia.org/P33977 and previous config saved to /var/cache/conftool/dbconfig/20220906-203236-ladsgroup.json [production]
20:29 <cjming@deploy1002> Finished scap: Backport for [[gerrit:830214|Ensure namespace filters is passed as a list]] (duration: 06m 35s) [production]
20:29 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:29 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:28 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:28 <milimetric@deploy1002> Finished deploy [analytics/refinery@8a5ce13]: Regular analytics weekly train [analytics/refinery@8a5ce13] (duration: 63m 48s) [production]
20:27 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1139.eqiad.wmnet with reason: Maintenance [production]
20:26 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1139.eqiad.wmnet with reason: Maintenance [production]
20:26 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1135 (T312863)', diff saved to https://phabricator.wikimedia.org/P33976 and previous config saved to /var/cache/conftool/dbconfig/20220906-202654-ladsgroup.json [production]
20:23 <cjming@deploy1002> cjming and ebernhardson: Backport for [[gerrit:830214|Ensure namespace filters is passed as a list]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
20:23 <cjming@deploy1002> Started scap: Backport for [[gerrit:830214|Ensure namespace filters is passed as a list]] [production]
20:16 <bd808> Forcing puppet runs on cloudweb100[34] to deploy new version of Striker (T296893) [production]
20:13 <bd808> Running database migrations for Striker (T296893) [production]
20:11 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P33975 and previous config saved to /var/cache/conftool/dbconfig/20220906-201148-ladsgroup.json [production]
20:03 <inflatador> 'bking@cumin1001 disabling puppet on elastic codfw hosts T313431' [production]
19:56 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P33974 and previous config saved to /var/cache/conftool/dbconfig/20220906-195642-ladsgroup.json [production]
19:41 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1135 (T312863)', diff saved to https://phabricator.wikimedia.org/P33973 and previous config saved to /var/cache/conftool/dbconfig/20220906-194135-ladsgroup.json [production]
19:24 <milimetric@deploy1002> Started deploy [analytics/refinery@8a5ce13]: Regular analytics weekly train [analytics/refinery@8a5ce13] [production]
18:45 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2124 (T314041)', diff saved to https://phabricator.wikimedia.org/P33972 and previous config saved to /var/cache/conftool/dbconfig/20220906-184515-ladsgroup.json [production]
18:45 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2124.codfw.wmnet with reason: Maintenance [production]
18:44 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2124.codfw.wmnet with reason: Maintenance [production]
18:25 <cwhite> reduce codfw replicas 2 to 1 for logstash-(webrequest|k8s) partitions. Make space for failed logstash2027 - T316996 [production]
17:50 <root@cumin1001> START - Cookbook sre.network.prepare-upgrade [production]
17:48 <root@cumin1001> START - Cookbook sre.network.prepare-upgrade [production]
17:23 <moritzm> installing dpkg bugfix updates from bullseye point release [production]