901-950 of 10000 results (34ms)
2021-09-13 §
14:13 <jelto@cumin2002> END (PASS) - Cookbook sre.switchdc.services.01-switch-dc (exit_code=0) [production]
14:13 <legoktm> (cotd.) ternal, eventgate-main, wikifeeds, eventstreams-internal, eventgate-analytics-external: codfw => eqiad [production]
14:12 <jelto@cumin2002> Switching services echostore, termbox, cxserver, eventstreams, search, ores, mathoid, schema, push-notifications, thanos-swift, wdqs, sessionstore, restbase, wdqs-internal, apertium, eventgate-analytics, citoid, api-gateway, restbase-async, proton, linkrecommendation, thanos-query, shellbox, kartotherian, mobileapps, recommendation-api, zotero, similar-users, shellbox-constraints, eventgate-logging-ex [production]
14:12 <jelto@cumin2002> START - Cookbook sre.switchdc.services.01-switch-dc [production]
14:11 <jelto@cumin2002> END (PASS) - Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (exit_code=0) [production]
14:05 <jelto@cumin2002> START - Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep [production]
14:03 <dzahn@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host durum3002.esams.wmnet [production]
13:51 <dzahn@cumin1001> START - Cookbook sre.ganeti.makevm for new host durum3002.esams.wmnet [production]
13:50 <dzahn@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host durum3001.esams.wmnet [production]
13:39 <dzahn@cumin1001> START - Cookbook sre.ganeti.makevm for new host durum3001.esams.wmnet [production]
13:36 <dzahn@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host durum2002.codfw.wmnet [production]
13:21 <dzahn@cumin1001> START - Cookbook sre.ganeti.makevm for new host durum2002.codfw.wmnet [production]
13:20 <dzahn@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host durum2001.codfw.wmnet [production]
13:08 <dzahn@cumin1001> START - Cookbook sre.ganeti.makevm for new host durum2001.codfw.wmnet [production]
12:09 <volans@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:03 <volans@cumin1001> START - Cookbook sre.dns.netbox [production]
11:32 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
11:27 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
11:26 <kostajh> European mid-day backport window deploys done [production]
11:24 <kharlan@deploy1002> Synchronized wmf-config: Config: [[gerrit:713553|WikimediaEvents: Remove UnderstandingFirstDay config]] (duration: 00m 59s) [production]
10:51 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts testvm2002.codfw.wmnet [production]
10:43 <jmm@cumin2002> START - Cookbook sre.hosts.decommission for hosts testvm2002.codfw.wmnet [production]
10:15 <volans@cumin1001> END (FAIL) - Cookbook sre.experimental.reimage (exit_code=93) for host mw1414.eqiad.wmnet [production]
09:33 <volans> restarting tcpircbot-logmsgbot on alert1001, not relying messages [production]
09:18 <elukey> upgrade rsyslog* on ml-serve* nodes to 8.1901.0-1+wmf2 [production]
09:16 <godog> swift eqiad-prod: add weight to ms-be10[64-67] - T290546 [production]
09:11 <moritzm> reimaging sretest1002 [production]
09:11 <elukey> upload rsyslog* 8.1901.0-1+wmf2 to buster-wikimedia component/rsyslog-k8s - T277739 [production]
08:16 <godog> bump +100G prometheus/ops codfw [production]
2021-09-12 §
18:33 <vgutierrez> restart varnish-fe on cp3061, cp3063 and cp3065 [production]
18:29 <vgutierrez> restart varnish on cp3055 [production]
18:26 <vgutierrez> restart varnish on cp3057 [production]
04:53 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
04:52 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
2021-09-11 §
19:02 <urbanecm@deploy1002> Synchronized wmf-config/InitialiseSettings.php: 27814b8eaacb5ba2fee1b6167a36ea14356a1ecf: testwiki: Fully remove securepoll-related groups (T290808) (duration: 00m 57s) [production]
18:34 <urbanecm> [urbanecm@mwmaint2002 ~]$ mwscript emptyUserGroup.php --wiki=testwiki {electionadmin,electcomm} # T290808 [production]
18:31 <urbanecm@deploy1002> Synchronized wmf-config/InitialiseSettings.php: 908bbf35235ea4129795dfbf4c0e646440152e18: Revert "test: Add electcomm and electionadmin groups" (T290808) (duration: 00m 58s) [production]
2021-09-10 §
21:28 <legoktm@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'shellbox-syntaxhighlight' for release 'main' . [production]
21:27 <legoktm@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'shellbox-syntaxhighlight' for release 'main' . [production]
21:21 <legoktm@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'shellbox-syntaxhighlight' for release 'main' . [production]
20:46 <jhuneidi@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
20:44 <jhuneidi@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
20:42 <jhuneidi@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'blubberoid' for release 'staging' . [production]
18:34 <volans@cumin1001> END (FAIL) - Cookbook sre.experimental.reimage (exit_code=99) for host sretest1001.eqiad.wmnet [production]
18:08 <volans@cumin1001> START - Cookbook sre.experimental.reimage for host sretest1001.eqiad.wmnet [production]
17:16 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on puppetmaster2005.codfw.wmnet with reason: REIMAGE [production]
17:14 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on puppetmaster2005.codfw.wmnet with reason: REIMAGE [production]
16:42 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on puppetmaster2004.codfw.wmnet with reason: REIMAGE [production]
16:40 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on puppetmaster2004.codfw.wmnet with reason: REIMAGE [production]
16:14 <volans@cumin1001> END (FAIL) - Cookbook sre.experimental.reimage (exit_code=99) for host sretest1001.eqiad.wmnet [production]