951-1000 of 10000 results (38ms)
2021-12-20 §
09:35 <moritzm> switch kubetcd2006 to DRBD storage to allow eventual migration for reimage of ganeti2019 [production]
09:28 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubetcd2006.codfw.wmnet with reason: switch to drbd storage [production]
09:28 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 1:00:00 on kubetcd2006.codfw.wmnet with reason: switch to drbd storage [production]
09:14 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy2004.codfw.wmnet with OS bullseye [production]
09:13 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2015.codfw.wmnet [production]
09:06 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2015.codfw.wmnet [production]
08:50 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti2015.codfw.wmnet with OS buster [production]
08:41 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host dbproxy2004.codfw.wmnet with OS bullseye [production]
08:40 <moritzm> updated bullseye installer images for 11.2 point release [production]
08:14 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti2015.codfw.wmnet with OS buster [production]
07:39 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dbproxy2004.codfw.wmnet with OS bullseye [production]
07:12 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host dbproxy2004.codfw.wmnet with OS bullseye [production]
07:08 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dbproxy2004.codfw.wmnet with OS bullseye [production]
06:41 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host dbproxy2004.codfw.wmnet with OS bullseye [production]
2021-12-19 §
17:10 <Amir1> restart apache2 on lists1001 (T293826) [production]
2021-12-18 §
13:57 <dcausse> restarting blazegraph on wdqs1013 (jvm stuck for 10hours) [production]
2021-12-17 §
23:14 <ryankemper> T297986 Beep boop testing 1 2 3 disregard me [production]
23:13 <dduvall@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
23:13 <ryankemper> T297910 foobar testing 1 2 3 [production]
23:12 <dduvall@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
23:08 <Reedy> Testing T297987 [production]
23:07 <dduvall@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'blubberoid' for release 'staging' . [production]
22:30 <bblack@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host prometheus6001.drmrs.wmnet [production]
21:28 <bblack@cumin1001> START - Cookbook sre.ganeti.makevm for new host prometheus6001.drmrs.wmnet [production]
21:21 <bblack@cumin1001> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host prometheus6001.drmrs.wmnet [production]
21:17 <bblack@cumin1001> START - Cookbook sre.ganeti.makevm for new host prometheus6001.drmrs.wmnet [production]
21:08 <legoktm> repooling wtp1025 [production]
20:56 <mutante> puppetmaster - revoking and recreating TLS cert for miscweb one more time because "tendril-static" isn't "static-tendril" ;Pp [production]
20:40 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
20:39 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
20:34 <legoktm@deploy1002> Synchronized wmf-config/CommonSettings.php: Set $wgMaxImageArea = false; (T291014) (duration: 00m 59s) [production]
19:46 <mutante> adding dbtree.wikimedia.org and tendril.wikimedia.org to TLS cert for webserver-misc-apps.discovery.wmnet - recreating cert T297605 [production]
19:44 <ryankemper> T297910 `ryankemper@mwmaint1002:~$ sudo modify-ldap-group wmf` to add `bking` [production]
19:43 <ryankemper> T297910 `ryankemper@mwmaint1002:~$ sudo modify-ldap-group ops` to add `bking` [production]
19:39 <mutante> puppetmaster1001 - sudo puppet cert clean webserver-misc-apps.discovery.wmnet - Revoked certificate with serial 8502 [production]
19:26 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
19:15 <rzl> rzl@apt1001:~$ sudo -i reprepro -C main include buster-wikimedia /home/rzl/python3-imagecatalog/imagecatalog_0.0.2-1_amd64.changes [production]
19:06 <cmjohnson@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ganeti1025.eqiad.wmnet with OS buster [production]
18:49 <cmjohnson@cumin1001> START - Cookbook sre.hosts.reimage for host ganeti1025.eqiad.wmnet with OS buster [production]
17:58 <bblack@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host install6001.wikimedia.org [production]
17:57 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
17:43 <bblack@cumin1001> START - Cookbook sre.ganeti.makevm for new host install6001.wikimedia.org [production]
17:24 <milimetric@deploy1002> Finished deploy [analytics/refinery@0778d1e] (thin): Proper fix for mediawiki_skin_diff [THIN] (duration: 00m 06s) [production]
17:24 <milimetric@deploy1002> Started deploy [analytics/refinery@0778d1e] (thin): Proper fix for mediawiki_skin_diff [THIN] [production]
17:21 <bblack> bast6001: shutdown->start (again) [production]
17:20 <milimetric@deploy1002> Finished deploy [analytics/refinery@0778d1e]: Proper fix for mediawiki_skin_diff (duration: 20m 45s) [production]
17:07 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
16:59 <milimetric@deploy1002> Started deploy [analytics/refinery@0778d1e]: Proper fix for mediawiki_skin_diff [production]
16:59 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
16:56 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]