2021-12-17
ยง
|
22:30 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host prometheus6001.drmrs.wmnet |
[production] |
21:28 |
<bblack@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host prometheus6001.drmrs.wmnet |
[production] |
21:21 |
<bblack@cumin1001> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host prometheus6001.drmrs.wmnet |
[production] |
21:17 |
<bblack@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host prometheus6001.drmrs.wmnet |
[production] |
21:08 |
<legoktm> |
repooling wtp1025 |
[production] |
20:56 |
<mutante> |
puppetmaster - revoking and recreating TLS cert for miscweb one more time because "tendril-static" isn't "static-tendril" ;Pp |
[production] |
20:40 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:39 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:34 |
<legoktm@deploy1002> |
Synchronized wmf-config/CommonSettings.php: Set $wgMaxImageArea = false; (T291014) (duration: 00m 59s) |
[production] |
19:46 |
<mutante> |
adding dbtree.wikimedia.org and tendril.wikimedia.org to TLS cert for webserver-misc-apps.discovery.wmnet - recreating cert T297605 |
[production] |
19:44 |
<ryankemper> |
T297910 `ryankemper@mwmaint1002:~$ sudo modify-ldap-group wmf` to add `bking` |
[production] |
19:43 |
<ryankemper> |
T297910 `ryankemper@mwmaint1002:~$ sudo modify-ldap-group ops` to add `bking` |
[production] |
19:39 |
<mutante> |
puppetmaster1001 - sudo puppet cert clean webserver-misc-apps.discovery.wmnet - Revoked certificate with serial 8502 |
[production] |
19:26 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
19:15 |
<rzl> |
rzl@apt1001:~$ sudo -i reprepro -C main include buster-wikimedia /home/rzl/python3-imagecatalog/imagecatalog_0.0.2-1_amd64.changes |
[production] |
19:06 |
<cmjohnson@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ganeti1025.eqiad.wmnet with OS buster |
[production] |
18:49 |
<cmjohnson@cumin1001> |
START - Cookbook sre.hosts.reimage for host ganeti1025.eqiad.wmnet with OS buster |
[production] |
17:58 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host install6001.wikimedia.org |
[production] |
17:57 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
17:43 |
<bblack@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host install6001.wikimedia.org |
[production] |
17:24 |
<milimetric@deploy1002> |
Finished deploy [analytics/refinery@0778d1e] (thin): Proper fix for mediawiki_skin_diff [THIN] (duration: 00m 06s) |
[production] |
17:24 |
<milimetric@deploy1002> |
Started deploy [analytics/refinery@0778d1e] (thin): Proper fix for mediawiki_skin_diff [THIN] |
[production] |
17:21 |
<bblack> |
bast6001: shutdown->start (again) |
[production] |
17:20 |
<milimetric@deploy1002> |
Finished deploy [analytics/refinery@0778d1e]: Proper fix for mediawiki_skin_diff (duration: 20m 45s) |
[production] |
17:07 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
16:59 |
<milimetric@deploy1002> |
Started deploy [analytics/refinery@0778d1e]: Proper fix for mediawiki_skin_diff |
[production] |
16:59 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
16:56 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
16:53 |
<bblack> |
bast6001: shutdown->start |
[production] |
16:44 |
<bblack> |
ganeti6003 - rebooting |
[production] |
16:39 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
16:33 |
<godog> |
remove /var/log/swift/server.log.1 from thanos-be* - T297959 |
[production] |
16:29 |
<milimetric@deploy1002> |
Finished deploy [analytics/refinery@5c3bce1] (thin): Fix refine sanitize allowlist, remove mediawiki_skin_diff schema for now [THIN] (duration: 00m 07s) |
[production] |
16:29 |
<milimetric@deploy1002> |
Started deploy [analytics/refinery@5c3bce1] (thin): Fix refine sanitize allowlist, remove mediawiki_skin_diff schema for now [THIN] |
[production] |
16:28 |
<bblack> |
reboot bast6001 (downtimed) |
[production] |
16:26 |
<milimetric@deploy1002> |
Finished deploy [analytics/refinery@5c3bce1]: Fix refine sanitize allowlist, remove mediawiki_skin_diff schema for now (duration: 69m 48s) |
[production] |
16:02 |
<godog> |
root@thanos-be2004:/srv/log/swift# rm server.log.1 - T297959 |
[production] |
15:35 |
<elukey@deploy1002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
15:35 |
<elukey@deploy1002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality' for release 'main' . |
[production] |
15:35 |
<elukey@deploy1002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . |
[production] |
15:34 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . |
[production] |
15:34 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality' for release 'main' . |
[production] |
15:33 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
15:16 |
<milimetric@deploy1002> |
Started deploy [analytics/refinery@5c3bce1]: Fix refine sanitize allowlist, remove mediawiki_skin_diff schema for now |
[production] |
14:52 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-main2001.codfw.wmnet with OS buster |
[production] |
14:35 |
<milimetric@deploy1002> |
Finished deploy [analytics/refinery@e9f04c3] (hadoop-test): Fix sanitize allowlist problem [TEST] (duration: 69m 41s) |
[production] |
14:22 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host kafka-main2001.codfw.wmnet with OS buster |
[production] |
13:25 |
<milimetric@deploy1002> |
Started deploy [analytics/refinery@e9f04c3] (hadoop-test): Fix sanitize allowlist problem [TEST] |
[production] |
13:20 |
<milimetric@deploy1002> |
Finished deploy [analytics/refinery@e9f04c3] (thin): Fix sanitize allowlist problem [THIN] (duration: 00m 07s) |
[production] |
13:20 |
<milimetric@deploy1002> |
Started deploy [analytics/refinery@e9f04c3] (thin): Fix sanitize allowlist problem [THIN] |
[production] |