1351-1400 of 10000 results (135ms)
2025-11-07 §
08:56 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host maps-test2001.codfw.wmnet with OS bookworm [production]
08:49 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2146', diff saved to https://phabricator.wikimedia.org/P85061 and previous config saved to /var/cache/conftool/dbconfig/20251107-084949-marostegui.json [production]
08:44 <jynus@cumin1003> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts dbprov1003.eqiad.wmnet [production]
08:44 <jynus@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:44 <jynus@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1003" [production]
08:40 <jynus@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1003" [production]
08:34 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2146', diff saved to https://phabricator.wikimedia.org/P85060 and previous config saved to /var/cache/conftool/dbconfig/20251107-083442-marostegui.json [production]
08:34 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on maps-test2001.codfw.wmnet with reason: host reimage [production]
08:32 <jynus@cumin1003> START - Cookbook sre.dns.netbox [production]
08:31 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
08:29 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
08:28 <jynus@cumin1003> START - Cookbook sre.hosts.decommission for hosts dbprov1003.eqiad.wmnet [production]
08:28 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on maps-test2001.codfw.wmnet with reason: host reimage [production]
08:27 <fceratto@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db2164 - Upgrading db2164.codfw.wmnet [production]
08:27 <fceratto@cumin1003> START - Cookbook sre.mysql.depool db2164 - Upgrading db2164.codfw.wmnet [production]
08:27 <fceratto@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
08:19 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2146 (T407997)', diff saved to https://phabricator.wikimedia.org/P85058 and previous config saved to /var/cache/conftool/dbconfig/20251107-081934-marostegui.json [production]
08:06 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host maps-test2001.codfw.wmnet with OS bookworm [production]
08:00 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts dbprov2003.codfw.wmnet [production]
08:00 <jynus@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:00 <jynus@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov2003.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1003" [production]
07:59 <jynus@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbprov2003.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jynus@cumin1003" [production]
07:58 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2146 (T407997)', diff saved to https://phabricator.wikimedia.org/P85057 and previous config saved to /var/cache/conftool/dbconfig/20251107-075857-marostegui.json [production]
07:58 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2146.codfw.wmnet with reason: Maintenance [production]
07:58 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2145 (T407997)', diff saved to https://phabricator.wikimedia.org/P85056 and previous config saved to /var/cache/conftool/dbconfig/20251107-075833-marostegui.json [production]
07:50 <jynus@cumin1003> START - Cookbook sre.dns.netbox [production]
07:45 <jynus@cumin1003> START - Cookbook sre.hosts.decommission for hosts dbprov2003.codfw.wmnet [production]
07:43 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2145', diff saved to https://phabricator.wikimedia.org/P85055 and previous config saved to /var/cache/conftool/dbconfig/20251107-074326-marostegui.json [production]
07:28 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2145', diff saved to https://phabricator.wikimedia.org/P85054 and previous config saved to /var/cache/conftool/dbconfig/20251107-072818-marostegui.json [production]
07:27 <moritzm> fix failed logrotation on install1005 [production]
07:13 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2145 (T407997)', diff saved to https://phabricator.wikimedia.org/P85053 and previous config saved to /var/cache/conftool/dbconfig/20251107-071310-marostegui.json [production]
06:52 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Rate-limit by wmfuniq fix, conftool 6 - oblivian@cumin1003" [production]
06:52 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Rate-limit by wmfuniq fix, conftool 6 - oblivian@cumin1003 [production]
06:52 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2145 (T407997)', diff saved to https://phabricator.wikimedia.org/P85052 and previous config saved to /var/cache/conftool/dbconfig/20251107-065226-marostegui.json [production]
06:52 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2145.codfw.wmnet with reason: Maintenance [production]
06:51 <oblivian@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Rate-limit by wmfuniq fix, conftool 6 - oblivian@cumin1003 [production]
06:51 <oblivian@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Rate-limit by wmfuniq fix, conftool 6 - oblivian@cumin1003" [production]
06:35 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2141.codfw.wmnet with reason: Maintenance [production]
06:14 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1184.eqiad.wmnet with reason: Maintenance [production]
03:06 <ryankemper@cumin1002> END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.REBOOT (2 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad cluster reboot (apply updates) - ryankemper@cumin1002 - T390860 [production]
03:05 <tstarling@deploy2002> Finished scap sync-world: Backport for [[gerrit:1202368|Add English translations to namespaces that lack them (T407127)]], [[gerrit:1202369|Set robot noindex policy for draft namespaces that lacked it (T407127)]] (duration: 09m 58s) [production]
02:58 <tstarling@deploy2002> tstarling: Continuing with sync [production]
02:57 <tstarling@deploy2002> tstarling: Backport for [[gerrit:1202368|Add English translations to namespaces that lack them (T407127)]], [[gerrit:1202369|Set robot noindex policy for draft namespaces that lacked it (T407127)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
02:55 <tstarling@deploy2002> Started scap sync-world: Backport for [[gerrit:1202368|Add English translations to namespaces that lack them (T407127)]], [[gerrit:1202369|Set robot noindex policy for draft namespaces that lacked it (T407127)]] [production]
01:14 <mwpresync@deploy2002> Finished scap build-images: Publishing wmf/next image (duration: 13m 34s) [production]
01:00 <mwpresync@deploy2002> Started scap build-images: Publishing wmf/next image [production]
2025-11-06 §
23:56 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1202659|Update for new WikimediaMaintenance script locations]] (duration: 07m 15s) [production]
23:51 <zabe@deploy2002> zabe: Continuing with sync [production]
23:51 <zabe@deploy2002> zabe: Backport for [[gerrit:1202659|Update for new WikimediaMaintenance script locations]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
23:48 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1202659|Update for new WikimediaMaintenance script locations]] [production]