851-900 of 10000 results (112ms)
2025-11-07 §
08:28 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on maps-test2001.codfw.wmnet with reason: host reimage [production]
08:27 <fceratto@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db2164 - Upgrading db2164.codfw.wmnet [production]
08:27 <fceratto@cumin1003> START - Cookbook sre.mysql.depool db2164 - Upgrading db2164.codfw.wmnet [production]
08:27 <fceratto@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
08:19 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2146 (T407997)', diff saved to https://phabricator.wikimedia.org/P85058 and previous config saved to /var/cache/conftool/dbconfig/20251107-081934-marostegui.json [production]
08:06 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host maps-test2001.codfw.wmnet with OS bookworm [production]
08:00 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts dbprov2003.codfw.wmnet [production]
08:00 <jynus@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:00 <jynus@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov2003.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1003" [production]
07:59 <jynus@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov2003.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1003" [production]
07:58 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2146 (T407997)', diff saved to https://phabricator.wikimedia.org/P85057 and previous config saved to /var/cache/conftool/dbconfig/20251107-075857-marostegui.json [production]
07:58 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2146.codfw.wmnet with reason: Maintenance [production]
07:58 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2145 (T407997)', diff saved to https://phabricator.wikimedia.org/P85056 and previous config saved to /var/cache/conftool/dbconfig/20251107-075833-marostegui.json [production]
07:50 <jynus@cumin1003> START - Cookbook sre.dns.netbox [production]
07:45 <jynus@cumin1003> START - Cookbook sre.hosts.decommission for hosts dbprov2003.codfw.wmnet [production]
07:43 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2145', diff saved to https://phabricator.wikimedia.org/P85055 and previous config saved to /var/cache/conftool/dbconfig/20251107-074326-marostegui.json [production]
07:28 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2145', diff saved to https://phabricator.wikimedia.org/P85054 and previous config saved to /var/cache/conftool/dbconfig/20251107-072818-marostegui.json [production]
07:27 <moritzm> fix failed logrotation on install1005 [production]
07:13 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2145 (T407997)', diff saved to https://phabricator.wikimedia.org/P85053 and previous config saved to /var/cache/conftool/dbconfig/20251107-071310-marostegui.json [production]
06:52 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Rate-limit by wmfuniq fix, conftool 6 - oblivian@cumin1003" [production]
06:52 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Rate-limit by wmfuniq fix, conftool 6 - oblivian@cumin1003 [production]
06:52 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2145 (T407997)', diff saved to https://phabricator.wikimedia.org/P85052 and previous config saved to /var/cache/conftool/dbconfig/20251107-065226-marostegui.json [production]
06:52 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2145.codfw.wmnet with reason: Maintenance [production]
06:51 <oblivian@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Rate-limit by wmfuniq fix, conftool 6 - oblivian@cumin1003 [production]
06:51 <oblivian@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Rate-limit by wmfuniq fix, conftool 6 - oblivian@cumin1003" [production]
06:35 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2141.codfw.wmnet with reason: Maintenance [production]
06:14 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1184.eqiad.wmnet with reason: Maintenance [production]
03:06 <ryankemper@cumin1002> END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.REBOOT (2 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad cluster reboot (apply updates) - ryankemper@cumin1002 - T390860 [production]
03:05 <tstarling@deploy2002> Finished scap sync-world: Backport for [[gerrit:1202368|Add English translations to namespaces that lack them (T407127)]], [[gerrit:1202369|Set robot noindex policy for draft namespaces that lacked it (T407127)]] (duration: 09m 58s) [production]
02:58 <tstarling@deploy2002> tstarling: Continuing with sync [production]
02:57 <tstarling@deploy2002> tstarling: Backport for [[gerrit:1202368|Add English translations to namespaces that lack them (T407127)]], [[gerrit:1202369|Set robot noindex policy for draft namespaces that lacked it (T407127)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
02:55 <tstarling@deploy2002> Started scap sync-world: Backport for [[gerrit:1202368|Add English translations to namespaces that lack them (T407127)]], [[gerrit:1202369|Set robot noindex policy for draft namespaces that lacked it (T407127)]] [production]
01:14 <mwpresync@deploy2002> Finished scap build-images: Publishing wmf/next image (duration: 13m 34s) [production]
01:00 <mwpresync@deploy2002> Started scap build-images: Publishing wmf/next image [production]
2025-11-06 §
23:56 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1202659|Update for new WikimediaMaintenance script locations]] (duration: 07m 15s) [production]
23:51 <zabe@deploy2002> zabe: Continuing with sync [production]
23:51 <zabe@deploy2002> zabe: Backport for [[gerrit:1202659|Update for new WikimediaMaintenance script locations]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
23:48 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1202659|Update for new WikimediaMaintenance script locations]] [production]
23:44 <cjming@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
23:43 <cjming@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
23:12 <cjming> end of UTC late backport window [production]
23:11 <cjming@deploy2002> Finished scap sync-world: Backport for [[gerrit:1201826|Use wikimedia.org as the "server" for the wiki-agnostic RESTbase specs]], [[gerrit:1202766|Use prefixed 'sub' field in OAuth 2 access tokens on beta cluster (T399199)]], [[gerrit:1202807|Re-run xLab MW Module Loaded experiment v2 (T401705)]] (duration: 08m 34s) [production]
23:06 <cjming@deploy2002> cjming, tgr, aaron: Continuing with sync [production]
23:04 <cjming@deploy2002> cjming, tgr, aaron: Backport for [[gerrit:1201826|Use wikimedia.org as the "server" for the wiki-agnostic RESTbase specs]], [[gerrit:1202766|Use prefixed 'sub' field in OAuth 2 access tokens on beta cluster (T399199)]], [[gerrit:1202807|Re-run xLab MW Module Loaded experiment v2 (T401705)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there [production]
23:02 <cjming@deploy2002> Started scap sync-world: Backport for [[gerrit:1201826|Use wikimedia.org as the "server" for the wiki-agnostic RESTbase specs]], [[gerrit:1202766|Use prefixed 'sub' field in OAuth 2 access tokens on beta cluster (T399199)]], [[gerrit:1202807|Re-run xLab MW Module Loaded experiment v2 (T401705)]] [production]
22:49 <catrope@deploy2002> Finished scap sync-world: Backport for [[gerrit:1202780|AccountRecovery: Use canonical URL in confirmation email]], [[gerrit:1202346|Enable Special:AccountRecovery everywhere (T399742)]] (duration: 10m 24s) [production]
22:46 <ryankemper@cumin1002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REBOOT (2 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad cluster reboot (apply updates) - ryankemper@cumin1002 - T390860 [production]
22:42 <catrope@deploy2002> catrope: Continuing with sync [production]
22:40 <catrope@deploy2002> catrope: Backport for [[gerrit:1202780|AccountRecovery: Use canonical URL in confirmation email]], [[gerrit:1202346|Enable Special:AccountRecovery everywhere (T399742)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
22:40 <ryankemper@cumin1002> END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.RESTART (2 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad cluster reboot (apply updates) - ryankemper@cumin1002 - T390860 [production]