51-100 of 10000 results (62ms)
2022-10-03 ยง
16:14 <sukhe> disable Puppet on cp hosts in codfw: rolling out T309651 [production]
15:15 <sukhe> disable Puppet on cp hosts in ulsfo: rolling out T309651 [production]
15:14 <marostegui@cumin1001> dbctl commit (dc=all): 'db2123 (re)pooling @ 100%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35320 and previous config saved to /var/cache/conftool/dbconfig/20221003-151438-root.json [production]
15:06 <papaul> maintenance complete on mr1-esams [production]
14:59 <marostegui@cumin1001> dbctl commit (dc=all): 'db2123 (re)pooling @ 75%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35319 and previous config saved to /var/cache/conftool/dbconfig/20221003-145933-root.json [production]
14:44 <marostegui@cumin1001> dbctl commit (dc=all): 'db2123 (re)pooling @ 50%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35318 and previous config saved to /var/cache/conftool/dbconfig/20221003-144428-root.json [production]
14:35 <sukhe> upgrade A:cp and A:drmrs to ATS 9.1.3-1wm2 from 9.1.3-1wm1: T309651 [production]
14:31 <papaul> on going maintenance on mr1-esams [production]
14:29 <marostegui@cumin1001> dbctl commit (dc=all): 'db2123 (re)pooling @ 25%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35317 and previous config saved to /var/cache/conftool/dbconfig/20221003-142923-root.json [production]
14:14 <marostegui@cumin1001> dbctl commit (dc=all): 'db2123 (re)pooling @ 10%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35316 and previous config saved to /var/cache/conftool/dbconfig/20221003-141417-root.json [production]
14:08 <sukhe> upgrade cp4026, cp4032 to ATS 9.1.3-1wm2 from 9.1.3-1wm1: T309651 [production]
13:59 <marostegui@cumin1001> dbctl commit (dc=all): 'db2123 (re)pooling @ 5%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35315 and previous config saved to /var/cache/conftool/dbconfig/20221003-135912-root.json [production]
13:57 <sukhe> reprepro -C component/trafficserver9 include buster-wikimedia trafficserver_9.1.3-1wm2_amd64.changes: T309651 [production]
13:44 <marostegui@cumin1001> dbctl commit (dc=all): 'db2123 (re)pooling @ 3%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35314 and previous config saved to /var/cache/conftool/dbconfig/20221003-134407-root.json [production]
13:40 <marostegui@cumin1001> dbctl commit (dc=all): 'db2157 (re)pooling @ 100%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35313 and previous config saved to /var/cache/conftool/dbconfig/20221003-134024-root.json [production]
13:29 <marostegui@cumin1001> dbctl commit (dc=all): 'db2123 (re)pooling @ 1%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35312 and previous config saved to /var/cache/conftool/dbconfig/20221003-132902-root.json [production]
13:25 <marostegui@cumin1001> dbctl commit (dc=all): 'db2157 (re)pooling @ 75%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35311 and previous config saved to /var/cache/conftool/dbconfig/20221003-132519-root.json [production]
13:18 <vgutierrez> enforcing origin-form|asterisk-form for request-target on varnish (could trigger spikes of HTTP 400 errors) - T318676 [production]
13:10 <marostegui@cumin1001> dbctl commit (dc=all): 'db2157 (re)pooling @ 50%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35310 and previous config saved to /var/cache/conftool/dbconfig/20221003-131014-root.json [production]
12:55 <marostegui@cumin1001> dbctl commit (dc=all): 'db2157 (re)pooling @ 25%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35308 and previous config saved to /var/cache/conftool/dbconfig/20221003-125509-root.json [production]
12:40 <marostegui@cumin1001> dbctl commit (dc=all): 'db2157 (re)pooling @ 10%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35307 and previous config saved to /var/cache/conftool/dbconfig/20221003-124004-root.json [production]
12:25 <marostegui@cumin1001> dbctl commit (dc=all): 'db2157 (re)pooling @ 5%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35306 and previous config saved to /var/cache/conftool/dbconfig/20221003-122459-root.json [production]
12:09 <marostegui@cumin1001> dbctl commit (dc=all): 'db2157 (re)pooling @ 3%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35305 and previous config saved to /var/cache/conftool/dbconfig/20221003-120954-root.json [production]
12:02 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2123', diff saved to https://phabricator.wikimedia.org/P35303 and previous config saved to /var/cache/conftool/dbconfig/20221003-120208-root.json [production]
12:01 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2123.codfw.wmnet with reason: Cloning [production]
12:01 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db2123.codfw.wmnet with reason: Cloning [production]
12:00 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1116.eqiad.wmnet with reason: Reboot [production]
12:00 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1116.eqiad.wmnet with reason: Reboot [production]
11:54 <marostegui@cumin1001> dbctl commit (dc=all): 'db2157 (re)pooling @ 1%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35302 and previous config saved to /var/cache/conftool/dbconfig/20221003-115449-root.json [production]
11:54 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1117.eqiad.wmnet with reason: Reboot [production]
11:54 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1117.eqiad.wmnet with reason: Reboot [production]
11:28 <hnowlan@puppetmaster1001> conftool action : set/pooled=true; selector: dnsdisc=sessionstore,name=eqiad [production]
11:28 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/sessionstore: sync [production]
11:27 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sessionstore1003.eqiad.wmnet with OS buster [production]
11:27 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/sessionstore: sync [production]
11:20 <hnowlan@puppetmaster1001> conftool action : set/pooled=false; selector: dnsdisc=sessionstore,name=eqiad [production]
11:08 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sessionstore1003.eqiad.wmnet with reason: host reimage [production]
11:04 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on sessionstore1003.eqiad.wmnet with reason: host reimage [production]
10:52 <hnowlan@cumin1001> START - Cookbook sre.hosts.reimage for host sessionstore1003.eqiad.wmnet with OS buster [production]
10:49 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on sessionstore1003.eqiad.wmnet with reason: Prep for reimage [production]
10:48 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on sessionstore1003.eqiad.wmnet with reason: Prep for reimage [production]
10:41 <hnowlan@puppetmaster1001> conftool action : set/pooled=true; selector: dnsdisc=sessionstore,name=eqiad [production]
10:41 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sessionstore1002.eqiad.wmnet with OS buster [production]
10:40 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/sessionstore: sync [production]
10:40 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/sessionstore: sync [production]
10:39 <hnowlan> starting cassandra on reimaged sessionstore1002 [production]
10:37 <_joe_> remove stale druid.svc.eqiad.wmnet certificate from the puppetmaster CA; it was expired anyways [production]
10:32 <hnowlan@puppetmaster1001> conftool action : set/pooled=false; selector: dnsdisc=sessionstore,name=eqiad [production]
10:31 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on gitlab1004.wikimedia.org with reason: upgrade gitlab1004 to new version [production]
10:31 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 0:20:00 on gitlab1004.wikimedia.org with reason: upgrade gitlab1004 to new version [production]