2021-02-03
§
|
09:26 |
<vgutierrez> |
rolling restart varnish-fe on cp5004-5006 |
[production] |
09:20 |
<_joe_> |
restarting varnish-frontend on cp5001 |
[production] |
09:17 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1174 (re)pooling @ 25%: Slowly pool db1174 into s7', diff saved to https://phabricator.wikimedia.org/P14150 and previous config saved to /var/cache/conftool/dbconfig/20210203-091712-root.json |
[production] |
09:02 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1174 (re)pooling @ 20%: Slowly pool db1174 into s7', diff saved to https://phabricator.wikimedia.org/P14149 and previous config saved to /var/cache/conftool/dbconfig/20210203-090208-root.json |
[production] |
08:47 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1174 (re)pooling @ 15%: Slowly pool db1174 into s7', diff saved to https://phabricator.wikimedia.org/P14148 and previous config saved to /var/cache/conftool/dbconfig/20210203-084705-root.json |
[production] |
08:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1174 (re)pooling @ 13%: Slowly pool db1174 into s7', diff saved to https://phabricator.wikimedia.org/P14147 and previous config saved to /var/cache/conftool/dbconfig/20210203-083201-root.json |
[production] |
08:16 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1174 (re)pooling @ 10%: Slowly pool db1174 into s7', diff saved to https://phabricator.wikimedia.org/P14146 and previous config saved to /var/cache/conftool/dbconfig/20210203-081658-root.json |
[production] |
08:01 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1174 (re)pooling @ 8%: Slowly pool db1174 into s7', diff saved to https://phabricator.wikimedia.org/P14145 and previous config saved to /var/cache/conftool/dbconfig/20210203-080154-root.json |
[production] |
07:49 |
<marostegui> |
Stop mysql on db1093 to clone db1173 T258361 |
[production] |
07:47 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1093 to clone db1173 T258361', diff saved to https://phabricator.wikimedia.org/P14143 and previous config saved to /var/cache/conftool/dbconfig/20210203-074749-marostegui.json |
[production] |
07:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1174 (re)pooling @ 5%: Slowly pool db1174 into s7', diff saved to https://phabricator.wikimedia.org/P14142 and previous config saved to /var/cache/conftool/dbconfig/20210203-074651-root.json |
[production] |
07:13 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Give some more weight to db1174', diff saved to https://phabricator.wikimedia.org/P14141 and previous config saved to /var/cache/conftool/dbconfig/20210203-071310-marostegui.json |
[production] |
07:08 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1173.eqiad.wmnet with reason: REIMAGE |
[production] |
07:06 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1173.eqiad.wmnet with reason: REIMAGE |
[production] |
06:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1078 - will be decommissioned', diff saved to https://phabricator.wikimedia.org/P14139 and previous config saved to /var/cache/conftool/dbconfig/20210203-064137-marostegui.json |
[production] |
06:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Pool db1174 with minimal weight for the first time in s7', diff saved to https://phabricator.wikimedia.org/P14138 and previous config saved to /var/cache/conftool/dbconfig/20210203-063812-marostegui.json |
[production] |
00:16 |
<jhuneidi@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'blubberoid' for release 'production' . |
[production] |
00:13 |
<legoktm@deploy1001> |
Synchronized logos/: Update and recompress logos for nlwiki, eswiki, ptwiki, ruwiki, svwiki, zhwiki (2/2) (duration: 01m 05s) |
[production] |
00:12 |
<legoktm@deploy1001> |
Synchronized static/images/project-logos/: Update and recompress logos for nlwiki, eswiki, ptwiki, ruwiki, svwiki, zhwiki (1/2) (duration: 01m 10s) |
[production] |
00:10 |
<jhuneidi@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'blubberoid' for release 'production' . |
[production] |
2021-02-02
§
|
23:53 |
<mutante> |
mw1300 - scap pull (it crashed earlier put is back after powercycling) |
[production] |
23:52 |
<jhuneidi@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'blubberoid' for release 'staging' . |
[production] |
23:30 |
<mutante> |
powercycling crashed m1300.eqiad.wmnet |
[production] |
21:56 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1335.eqiad.wmnet |
[production] |
21:56 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1336.eqiad.wmnet |
[production] |
21:56 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1335.eqiad.wmnet |
[production] |
21:55 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1336.eqiad.wmnet |
[production] |
21:09 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1335.eqiad.wmnet with reason: REIMAGE |
[production] |
21:07 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1336.eqiad.wmnet with reason: REIMAGE |
[production] |
21:06 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1335.eqiad.wmnet with reason: REIMAGE |
[production] |
21:05 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1336.eqiad.wmnet with reason: REIMAGE |
[production] |
20:12 |
<cdanis> |
✔️ cdanis@cumin1001.eqiad.wmnet ~ 🕒☕ sudo cumin A:cp 'enable-puppet "cdanis deploying I7003b7b6 and Idd0e124f5 T263496"' # test on cp2027 looks good, perhaps slightly-increased Varnish CPU consumption but hard to be sure |
[production] |
20:00 |
<Lucas_WMDE> |
Morning backport window done |
[production] |
19:58 |
<lucaswerkmeister-wmde@deploy1001> |
Synchronized php-1.36.0-wmf.29/extensions/WikibaseMediaInfo/: Backport: [[gerrit:661092|Pass $databaseName into WikiPageEntityDataLoader (T273622)]] (duration: 01m 07s) |
[production] |
19:57 |
<lucaswerkmeister-wmde@deploy1001> |
Synchronized php-1.36.0-wmf.29/extensions/Wikibase/: Backport: [[gerrit:661091|Add wiki ID to WikiPageEntityDataLoader (T273622)]] (duration: 01m 25s) |
[production] |
19:52 |
<cdanis> |
❌cdanis@cumin1001.eqiad.wmnet ~ 🕒☕ sudo cumin A:cp 'disable-puppet "cdanis deploying I7003b7b6 and Idd0e124f5 T263496"' |
[production] |
19:00 |
<mbsantos@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
18:48 |
<mbsantos@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
18:43 |
<mbsantos@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
18:23 |
<milimetric@deploy1001> |
Finished deploy [analytics/turnilo/deploy@052348b]: (no justification provided) (duration: 00m 03s) |
[production] |
18:23 |
<milimetric@deploy1001> |
Started deploy [analytics/turnilo/deploy@052348b]: (no justification provided) |
[production] |
18:22 |
<milimetric@deploy1001> |
deploy aborted: (no justification provided) (duration: 00m 10s) |
[production] |
18:22 |
<milimetric@deploy1001> |
Started deploy [analytics/turnilo/deploy@052348b]: (no justification provided) |
[production] |
18:17 |
<mbsantos@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
18:07 |
<mbsantos@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
18:03 |
<mbsantos@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . |
[production] |
16:37 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host auth2001.codfw.wmnet |
[production] |
16:33 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host auth1002.eqiad.wmnet |
[production] |
16:30 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host auth1002.eqiad.wmnet |
[production] |
16:30 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host auth2001.codfw.wmnet |
[production] |