651-700 of 10000 results (77ms)
2024-01-31 ยง
15:41 <ayounsi@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:41 <ayounsi@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: testvm2006.codfw.wmnet decommissioned, removing all IPs except the asset tag one - ayounsi@cumin2002" [production]
15:39 <ayounsi@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: testvm2006.codfw.wmnet decommissioned, removing all IPs except the asset tag one - ayounsi@cumin2002" [production]
15:36 <ayounsi@cumin2002> START - Cookbook sre.dns.netbox [production]
15:36 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes:weight=10; selector: name=maps2009.codfw.wmnet [production]
15:35 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P55969 and previous config saved to /var/cache/conftool/dbconfig/20240131-153549-marostegui.json [production]
15:34 <hnowlan@puppetmaster1001> conftool action : set/weight=10; selector: name=maps1009.eqiad.wmnet [production]
15:32 <ayounsi@cumin2002> START - Cookbook sre.hosts.decommission for hosts testvm2006.codfw.wmnet [production]
15:29 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=maps1009.eqiad.wmnet [production]
15:20 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1156 (T355609)', diff saved to https://phabricator.wikimedia.org/P55968 and previous config saved to /var/cache/conftool/dbconfig/20240131-152042-marostegui.json [production]
15:18 <jgiannelos@deploy2002> helmfile [codfw] DONE helmfile.d/services/mobileapps: apply [production]
15:17 <jgiannelos@deploy2002> helmfile [codfw] START helmfile.d/services/mobileapps: apply [production]
15:17 <jgiannelos@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
15:16 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
15:16 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
15:16 <jgiannelos@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
15:16 <jgiannelos@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
15:14 <btullis@cumin1002> END (PASS) - Cookbook sre.misc-clusters.roll-restart-reboot-eventschemas (exit_code=0) rolling reboot on A:schema [production]
15:14 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
15:14 <jgiannelos@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
15:14 <jgiannelos@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
15:10 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1156 (T355609)', diff saved to https://phabricator.wikimedia.org/P55967 and previous config saved to /var/cache/conftool/dbconfig/20240131-151016-marostegui.json [production]
15:10 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
15:10 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
15:09 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance [production]
15:09 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance [production]
15:09 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T355609)', diff saved to https://phabricator.wikimedia.org/P55966 and previous config saved to /var/cache/conftool/dbconfig/20240131-150934-marostegui.json [production]
15:09 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
15:08 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
15:08 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
15:07 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
15:06 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
15:05 <filippo@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply [production]
14:58 <btullis@cumin1002> START - Cookbook sre.misc-clusters.roll-restart-reboot-eventschemas rolling reboot on A:schema [production]
14:54 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1146:3312', diff saved to https://phabricator.wikimedia.org/P55965 and previous config saved to /var/cache/conftool/dbconfig/20240131-145427-marostegui.json [production]
14:53 <brouberol> I'm going to apply kafka log compaction for {eqiad,codfw}.mediawiki.currussearch.page_rerender.v1 on kafka-main-eqiad only (current replica) - T354794 [production]
14:52 <filippo@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply [production]
14:51 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lists2001.codfw.wmnet [production]
14:46 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:994176|Add WikimediaCampaignEvents to extension list (T347894)]] (duration: 10m 41s) [production]
14:45 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host lists2001.codfw.wmnet [production]
14:43 <filippo@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply [production]
14:40 <urbanecm@deploy2002> cmelo and urbanecm: Continuing with sync [production]
14:39 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1146:3312', diff saved to https://phabricator.wikimedia.org/P55964 and previous config saved to /var/cache/conftool/dbconfig/20240131-143921-marostegui.json [production]
14:37 <urbanecm@deploy2002> cmelo and urbanecm: Backport for [[gerrit:994176|Add WikimediaCampaignEvents to extension list (T347894)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:36 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:994176|Add WikimediaCampaignEvents to extension list (T347894)]] [production]
14:30 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:994702|[metawiki] Let admins add/remove the event-organizer group (T356070)]], [[gerrit:994711|index.php: Restore support for forcesafemode option. (T355314)]] (duration: 10m 05s) [production]
14:28 <filippo@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply [production]
14:24 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T355609)', diff saved to https://phabricator.wikimedia.org/P55963 and previous config saved to /var/cache/conftool/dbconfig/20240131-142413-marostegui.json [production]
14:23 <urbanecm@deploy2002> daimona and matmarex and urbanecm: Continuing with sync [production]
14:21 <urbanecm@deploy2002> daimona and matmarex and urbanecm: Backport for [[gerrit:994702|[metawiki] Let admins add/remove the event-organizer group (T356070)]], [[gerrit:994711|index.php: Restore support for forcesafemode option. (T355314)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]