1101-1150 of 10000 results (120ms)
2023-12-05 §
09:06 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 58952 [production]
09:05 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'configure' for AS: 58952 [production]
09:04 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1106.eqiad.wmnet with reason: Maintenance [production]
09:03 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1106.eqiad.wmnet with reason: Maintenance [production]
08:59 <isaranto@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
08:26 <marostegui> Failover m2-master dbproxy1023.eqiad.wmnet -> dbproxy1025.eqiad.wmnet T351864 [production]
06:55 <vgutierrez> rolling restart of text|secondary LVS on eqsin effectively enabling IPIP encapsulation for ncredir@eqsin - T351069 [production]
06:23 <marostegui> Failover m5 from db1119 to db1176 - T352631 [production]
06:17 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2135,2160].codfw.wmnet,db[1119,1176,1217].eqiad.wmnet with reason: m5 master switch T352631 [production]
06:17 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on db[2135,2160].codfw.wmnet,db[1119,1176,1217].eqiad.wmnet with reason: m5 master switch T352631 [production]
01:18 <mutante> LDAP - added user xqt to group nda (T348520) [production]
01:12 <ejegg> payments-wiki upgraded from 5284fc99 to 1d24dc90 [production]
00:06 <eevans@cumin1001> END (FAIL) - Cookbook sre.puppet.migrate-host (exit_code=99) for host restbase2028.codfw.wmnet [production]
2023-12-04 §
23:53 <eevans@cumin1001> END (FAIL) - Cookbook sre.puppet.migrate-host (exit_code=99) for host restbase2028.codfw.wmnet [production]
23:52 <eevans@cumin1001> START - Cookbook sre.puppet.migrate-host for host restbase2028.codfw.wmnet [production]
22:53 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2189 (T348183)', diff saved to https://phabricator.wikimedia.org/P54146 and previous config saved to /var/cache/conftool/dbconfig/20231204-225336-arnaudb.json [production]
22:53 <eileen> civicrm upgraded from 83816165 to 297a091d [production]
22:38 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P54145 and previous config saved to /var/cache/conftool/dbconfig/20231204-223830-arnaudb.json [production]
22:23 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P54144 and previous config saved to /var/cache/conftool/dbconfig/20231204-222323-arnaudb.json [production]
22:08 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2189 (T348183)', diff saved to https://phabricator.wikimedia.org/P54142 and previous config saved to /var/cache/conftool/dbconfig/20231204-220817-arnaudb.json [production]
22:03 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2189 (T348183)', diff saved to https://phabricator.wikimedia.org/P54141 and previous config saved to /var/cache/conftool/dbconfig/20231204-220345-arnaudb.json [production]
22:03 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2189.codfw.wmnet with reason: Maintenance [production]
22:03 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2189.codfw.wmnet with reason: Maintenance [production]
22:03 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2175 (T348183)', diff saved to https://phabricator.wikimedia.org/P54140 and previous config saved to /var/cache/conftool/dbconfig/20231204-220322-arnaudb.json [production]
21:58 <ebernhardson@deploy2002> Finished scap: Backport for [[gerrit:979693|Always load transcode state from db when opting in to primary db]] (duration: 08m 37s) [production]
21:52 <ebernhardson@deploy2002> ebernhardson and brion: Continuing with sync [production]
21:51 <ebernhardson@deploy2002> ebernhardson and brion: Backport for [[gerrit:979693|Always load transcode state from db when opting in to primary db]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:50 <ebernhardson@deploy2002> Started scap: Backport for [[gerrit:979693|Always load transcode state from db when opting in to primary db]] [production]
21:49 <ebernhardson@deploy2002> Finished scap: Backport for [[gerrit:979155|cirrus: Enable event bus bridge on more wikis (T352335)]] (duration: 09m 23s) [production]
21:48 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P54138 and previous config saved to /var/cache/conftool/dbconfig/20231204-214816-arnaudb.json [production]
21:47 <ryankemper> T351503 Setting partition count to 5: `ryankemper@kafka-main2001:~$ kafka topics --alter --topic codfw.mediawiki.cirrussearch.page_rerender.v1 --partitions 5` [production]
21:47 <ryankemper> T351503 Setting partition count to 5: `ryankemper@kafka-main2001:~$ kafka topics --alter --topic eqiad.mediawiki.cirrussearch.page_rerender.v1 --partitions 5` [production]
21:42 <ebernhardson@deploy2002> ebernhardson: Continuing with sync [production]
21:41 <ebernhardson@deploy2002> ebernhardson: Backport for [[gerrit:979155|cirrus: Enable event bus bridge on more wikis (T352335)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:39 <ebernhardson@deploy2002> Started scap: Backport for [[gerrit:979155|cirrus: Enable event bus bridge on more wikis (T352335)]] [production]
21:33 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P54137 and previous config saved to /var/cache/conftool/dbconfig/20231204-213309-arnaudb.json [production]
21:27 <ebernhardson@deploy2002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
21:27 <ebernhardson@deploy2002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
21:19 <pt1979@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1077.eqiad.wmnet with OS bullseye [production]
21:19 <pt1979@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin1001" [production]
21:18 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2175 (T348183)', diff saved to https://phabricator.wikimedia.org/P54136 and previous config saved to /var/cache/conftool/dbconfig/20231204-211803-arnaudb.json [production]
21:14 <pt1979@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin1001" [production]
21:13 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2175 (T348183)', diff saved to https://phabricator.wikimedia.org/P54135 and previous config saved to /var/cache/conftool/dbconfig/20231204-211305-arnaudb.json [production]
21:12 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2175.codfw.wmnet with reason: Maintenance [production]
21:12 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2175.codfw.wmnet with reason: Maintenance [production]
21:12 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2170:3312 (T348183)', diff saved to https://phabricator.wikimedia.org/P54134 and previous config saved to /var/cache/conftool/dbconfig/20231204-211241-arnaudb.json [production]
21:09 <ryankemper> T351503 Setting partition count to 5: `ryankemper@kafka-main1001:~$ kafka topics --alter --topic codfw.mediawiki.cirrussearch.page_rerender.v1 --partitions 5` [production]
21:06 <ryankemper> T351503 Setting partition count to 5: `ryankemper@kafka-main1001:~$ kafka topics --alter --topic eqiad.mediawiki.cirrussearch.page_rerender.v1 --partitions 5` [production]
20:57 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2170:3312', diff saved to https://phabricator.wikimedia.org/P54133 and previous config saved to /var/cache/conftool/dbconfig/20231204-205735-arnaudb.json [production]
20:53 <pt1979@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1077.eqiad.wmnet with reason: host reimage [production]