2022-10-03
ยง
|
16:14 |
<sukhe> |
disable Puppet on cp hosts in codfw: rolling out T309651 |
[production] |
15:15 |
<sukhe> |
disable Puppet on cp hosts in ulsfo: rolling out T309651 |
[production] |
15:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2123 (re)pooling @ 100%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35320 and previous config saved to /var/cache/conftool/dbconfig/20221003-151438-root.json |
[production] |
15:06 |
<papaul> |
maintenance complete on mr1-esams |
[production] |
14:59 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2123 (re)pooling @ 75%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35319 and previous config saved to /var/cache/conftool/dbconfig/20221003-145933-root.json |
[production] |
14:44 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2123 (re)pooling @ 50%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35318 and previous config saved to /var/cache/conftool/dbconfig/20221003-144428-root.json |
[production] |
14:35 |
<sukhe> |
upgrade A:cp and A:drmrs to ATS 9.1.3-1wm2 from 9.1.3-1wm1: T309651 |
[production] |
14:31 |
<papaul> |
on going maintenance on mr1-esams |
[production] |
14:29 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2123 (re)pooling @ 25%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35317 and previous config saved to /var/cache/conftool/dbconfig/20221003-142923-root.json |
[production] |
14:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2123 (re)pooling @ 10%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35316 and previous config saved to /var/cache/conftool/dbconfig/20221003-141417-root.json |
[production] |
14:08 |
<sukhe> |
upgrade cp4026, cp4032 to ATS 9.1.3-1wm2 from 9.1.3-1wm1: T309651 |
[production] |
13:59 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2123 (re)pooling @ 5%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35315 and previous config saved to /var/cache/conftool/dbconfig/20221003-135912-root.json |
[production] |
13:57 |
<sukhe> |
reprepro -C component/trafficserver9 include buster-wikimedia trafficserver_9.1.3-1wm2_amd64.changes: T309651 |
[production] |
13:44 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2123 (re)pooling @ 3%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35314 and previous config saved to /var/cache/conftool/dbconfig/20221003-134407-root.json |
[production] |
13:40 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2157 (re)pooling @ 100%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35313 and previous config saved to /var/cache/conftool/dbconfig/20221003-134024-root.json |
[production] |
13:29 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2123 (re)pooling @ 1%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35312 and previous config saved to /var/cache/conftool/dbconfig/20221003-132902-root.json |
[production] |
13:25 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2157 (re)pooling @ 75%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35311 and previous config saved to /var/cache/conftool/dbconfig/20221003-132519-root.json |
[production] |
13:18 |
<vgutierrez> |
enforcing origin-form|asterisk-form for request-target on varnish (could trigger spikes of HTTP 400 errors) - T318676 |
[production] |
13:10 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2157 (re)pooling @ 50%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35310 and previous config saved to /var/cache/conftool/dbconfig/20221003-131014-root.json |
[production] |
12:55 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2157 (re)pooling @ 25%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35308 and previous config saved to /var/cache/conftool/dbconfig/20221003-125509-root.json |
[production] |
12:40 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2157 (re)pooling @ 10%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35307 and previous config saved to /var/cache/conftool/dbconfig/20221003-124004-root.json |
[production] |
12:25 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2157 (re)pooling @ 5%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35306 and previous config saved to /var/cache/conftool/dbconfig/20221003-122459-root.json |
[production] |
12:09 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2157 (re)pooling @ 3%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35305 and previous config saved to /var/cache/conftool/dbconfig/20221003-120954-root.json |
[production] |
12:02 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2123', diff saved to https://phabricator.wikimedia.org/P35303 and previous config saved to /var/cache/conftool/dbconfig/20221003-120208-root.json |
[production] |
12:01 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2123.codfw.wmnet with reason: Cloning |
[production] |
12:01 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2123.codfw.wmnet with reason: Cloning |
[production] |
12:00 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1116.eqiad.wmnet with reason: Reboot |
[production] |
12:00 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1116.eqiad.wmnet with reason: Reboot |
[production] |
11:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2157 (re)pooling @ 1%: After upgrade', diff saved to https://phabricator.wikimedia.org/P35302 and previous config saved to /var/cache/conftool/dbconfig/20221003-115449-root.json |
[production] |
11:54 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1117.eqiad.wmnet with reason: Reboot |
[production] |
11:54 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1117.eqiad.wmnet with reason: Reboot |
[production] |
11:28 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=true; selector: dnsdisc=sessionstore,name=eqiad |
[production] |
11:28 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/sessionstore: sync |
[production] |
11:27 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sessionstore1003.eqiad.wmnet with OS buster |
[production] |
11:27 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/sessionstore: sync |
[production] |
11:20 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=false; selector: dnsdisc=sessionstore,name=eqiad |
[production] |
11:08 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sessionstore1003.eqiad.wmnet with reason: host reimage |
[production] |
11:04 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sessionstore1003.eqiad.wmnet with reason: host reimage |
[production] |
10:52 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.reimage for host sessionstore1003.eqiad.wmnet with OS buster |
[production] |
10:49 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on sessionstore1003.eqiad.wmnet with reason: Prep for reimage |
[production] |
10:48 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on sessionstore1003.eqiad.wmnet with reason: Prep for reimage |
[production] |
10:41 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=true; selector: dnsdisc=sessionstore,name=eqiad |
[production] |
10:41 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sessionstore1002.eqiad.wmnet with OS buster |
[production] |
10:40 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/sessionstore: sync |
[production] |
10:40 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/sessionstore: sync |
[production] |
10:39 |
<hnowlan> |
starting cassandra on reimaged sessionstore1002 |
[production] |
10:37 |
<_joe_> |
remove stale druid.svc.eqiad.wmnet certificate from the puppetmaster CA; it was expired anyways |
[production] |
10:32 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=false; selector: dnsdisc=sessionstore,name=eqiad |
[production] |
10:31 |
<jelto@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on gitlab1004.wikimedia.org with reason: upgrade gitlab1004 to new version |
[production] |
10:31 |
<jelto@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:20:00 on gitlab1004.wikimedia.org with reason: upgrade gitlab1004 to new version |
[production] |