production SAL

451-500 of 10000 results (70ms)

2024-01-31 §
15:32	<ayounsi@cumin2002>	START - Cookbook sre.hosts.decommission for hosts testvm2006.codfw.wmnet	[production]
15:29	<hnowlan@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=maps1009.eqiad.wmnet	[production]
15:20	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1156 (T355609)', diff saved to https://phabricator.wikimedia.org/P55968 and previous config saved to /var/cache/conftool/dbconfig/20240131-152042-marostegui.json	[production]
15:18	<jgiannelos@deploy2002>	helmfile [codfw] DONE helmfile.d/services/mobileapps: apply	[production]
15:17	<jgiannelos@deploy2002>	helmfile [codfw] START helmfile.d/services/mobileapps: apply	[production]
15:17	<jgiannelos@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply	[production]
15:16	<jgiannelos@deploy2002>	helmfile [eqiad] START helmfile.d/services/mobileapps: apply	[production]
15:16	<hnowlan@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/thumbor: apply	[production]
15:16	<jgiannelos@deploy2002>	helmfile [staging] DONE helmfile.d/services/mobileapps: apply	[production]
15:16	<jgiannelos@deploy2002>	helmfile [staging] START helmfile.d/services/mobileapps: apply	[production]
15:14	<btullis@cumin1002>	END (PASS) - Cookbook sre.misc-clusters.roll-restart-reboot-eventschemas (exit_code=0) rolling reboot on A:schema	[production]
15:14	<jgiannelos@deploy2002>	helmfile [eqiad] START helmfile.d/services/mobileapps: apply	[production]
15:14	<jgiannelos@deploy2002>	helmfile [staging] DONE helmfile.d/services/mobileapps: apply	[production]
15:14	<jgiannelos@deploy2002>	helmfile [staging] START helmfile.d/services/mobileapps: apply	[production]
15:10	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1156 (T355609)', diff saved to https://phabricator.wikimedia.org/P55967 and previous config saved to /var/cache/conftool/dbconfig/20240131-151016-marostegui.json	[production]
15:10	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance	[production]
15:10	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance	[production]
15:09	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance	[production]
15:09	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance	[production]
15:09	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T355609)', diff saved to https://phabricator.wikimedia.org/P55966 and previous config saved to /var/cache/conftool/dbconfig/20240131-150934-marostegui.json	[production]
15:09	<hnowlan@deploy2002>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
15:08	<hnowlan@deploy2002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
15:08	<hnowlan@deploy2002>	helmfile [eqiad] DONE helmfile.d/admin 'apply'.	[production]
15:07	<hnowlan@deploy2002>	helmfile [eqiad] START helmfile.d/admin 'apply'.	[production]
15:06	<hnowlan@deploy2002>	helmfile [eqiad] START helmfile.d/services/thumbor: apply	[production]
15:05	<filippo@deploy2002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
14:58	<btullis@cumin1002>	START - Cookbook sre.misc-clusters.roll-restart-reboot-eventschemas rolling reboot on A:schema	[production]
14:54	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1146:3312', diff saved to https://phabricator.wikimedia.org/P55965 and previous config saved to /var/cache/conftool/dbconfig/20240131-145427-marostegui.json	[production]
14:53	<brouberol>	I'm going to apply kafka log compaction for {eqiad,codfw}.mediawiki.currussearch.page_rerender.v1 on kafka-main-eqiad only (current replica) - T354794	[production]
14:52	<filippo@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
14:51	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lists2001.codfw.wmnet	[production]
14:46	<urbanecm@deploy2002>	Finished scap: Backport for [[gerrit:994176\|Add WikimediaCampaignEvents to extension list (T347894)]] (duration: 10m 41s)	[production]
14:45	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host lists2001.codfw.wmnet	[production]
14:43	<filippo@deploy2002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
14:40	<urbanecm@deploy2002>	cmelo and urbanecm: Continuing with sync	[production]
14:39	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1146:3312', diff saved to https://phabricator.wikimedia.org/P55964 and previous config saved to /var/cache/conftool/dbconfig/20240131-143921-marostegui.json	[production]
14:37	<urbanecm@deploy2002>	cmelo and urbanecm: Backport for [[gerrit:994176\|Add WikimediaCampaignEvents to extension list (T347894)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
14:36	<urbanecm@deploy2002>	Started scap: Backport for [[gerrit:994176\|Add WikimediaCampaignEvents to extension list (T347894)]]	[production]
14:30	<urbanecm@deploy2002>	Finished scap: Backport for [[gerrit:994702\|[metawiki] Let admins add/remove the event-organizer group (T356070)]], [[gerrit:994711\|index.php: Restore support for forcesafemode option. (T355314)]] (duration: 10m 05s)	[production]
14:28	<filippo@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
14:24	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T355609)', diff saved to https://phabricator.wikimedia.org/P55963 and previous config saved to /var/cache/conftool/dbconfig/20240131-142413-marostegui.json	[production]
14:23	<urbanecm@deploy2002>	daimona and matmarex and urbanecm: Continuing with sync	[production]
14:21	<urbanecm@deploy2002>	daimona and matmarex and urbanecm: Backport for [[gerrit:994702\|[metawiki] Let admins add/remove the event-organizer group (T356070)]], [[gerrit:994711\|index.php: Restore support for forcesafemode option. (T355314)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
14:21	<eevans@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2020.codfw.wmnet with reason: Decommissioning — T352469	[production]
14:20	<eevans@cumin1002>	START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2020.codfw.wmnet with reason: Decommissioning — T352469	[production]
14:20	<urbanecm@deploy2002>	Started scap: Backport for [[gerrit:994702\|[metawiki] Let admins add/remove the event-organizer group (T356070)]], [[gerrit:994711\|index.php: Restore support for forcesafemode option. (T355314)]]	[production]
14:19	<urbanecm@deploy2002>	Finished scap: Backport for [[gerrit:994234\|decodeURI fragments before sending them to discussiontoolsfindcomment (T356199)]], [[gerrit:994235\|decodeURI fragments before sending them to discussiontoolsfindcomment (T356199)]], [[gerrit:994708\|Add an exception for ConvenientDiscussions-style permalinks (T349653)]], [[gerrit:994709\|Add an exception for ConvenientDiscussions-style permalinks (T349653)	[production]
14:18	<urbanecm>	[urbanecm@mwmaint2002 ~]$ mwscript migrateUserGroup.php --wiki=metawiki campaignevents-beta-tester event-organizer # T356070	[production]
14:13	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1146:3312 (T355609)', diff saved to https://phabricator.wikimedia.org/P55962 and previous config saved to /var/cache/conftool/dbconfig/20240131-141316-marostegui.json	[production]
14:13	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1146.eqiad.wmnet with reason: Maintenance	[production]