2020-12-03
§
|
15:06 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:06 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:59 |
<jbond42> |
disable puppet fleet wide to role out puppetdb node-ttl change |
[production] |
14:46 |
<effie> |
rolling depool and pool of parsoid servers |
[production] |
14:34 |
<elukey> |
stop zookeeper and etcd on conf1005 as prep-step before rack move |
[production] |
14:00 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
13:58 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:47 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1089', diff saved to https://phabricator.wikimedia.org/P13523 and previous config saved to /var/cache/conftool/dbconfig/20201203-134724-marostegui.json |
[production] |
13:39 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1089', diff saved to https://phabricator.wikimedia.org/P13522 and previous config saved to /var/cache/conftool/dbconfig/20201203-133953-marostegui.json |
[production] |
13:30 |
<effie> |
puppet enabled on jobrunners |
[production] |
13:29 |
<hashar> |
Upgraded Jenkins on releases1002 # T269352 |
[production] |
13:24 |
<hashar> |
Upgraded Jenkins on releases2002 (spare server) # T269352 |
[production] |
13:24 |
<andrewbogott> |
removing all osds on cloudcephosd1008 for rebuild, T268746 |
[admin] |
13:09 |
<moritzm> |
uploaded jenkins 2.263.1 to apt.wikimedia.org component/ci |
[production] |
13:01 |
<Operator873> |
restarted CVNBot6-10 & 15 |
[cvn] |
13:00 |
<elukey> |
move db1108 to C3 - T267065 |
[production] |
12:37 |
<moritzm> |
installing jupyter-notebook security updates on Stretch |
[production] |
12:17 |
<elukey> |
move aqs1006 to rack D6 - T267065 |
[production] |
12:10 |
<effie> |
disable puppet on jobrunners and parsoid - T244340 |
[production] |
12:09 |
<Lucas_WMDE> |
EU backport+config window done |
[production] |
12:07 |
<lucaswerkmeister-wmde@deploy1001> |
Synchronized wmf-config/Wikibase.php: Config: [[gerrit:645059|Enable implicit description usage (T267745)]] (duration: 01m 12s) |
[production] |
11:57 |
<Urbanecm> |
Start of mwscript extensions/AbuseFilter/maintenance/updateVarDumps.php --wiki=$wiki --print-orphaned-records-to=/tmp/urbanecm/$wiki-orphaned.log --progress-markers > $wiki.log in a tmux at mwmaint1002 (wiki=enwiki; T246539) |
[production] |
11:52 |
<Urbanecm> |
Start of mwscript extensions/AbuseFilter/maintenance/updateVarDumps.php --wiki=$wiki --print-orphaned-records-to=/tmp/urbanecm/$wiki-orphaned.log --progress-markers > $wiki.log in a tmux at mwmaint1002 (wiki=eswiki; T246539) |
[production] |
11:46 |
<elukey> |
move druid1001 to rack A1 - T267065 |
[production] |
11:31 |
<volans@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on cumin2001.codfw.wmnet with reason: volans's test |
[production] |
11:31 |
<volans@cumin2001> |
START - Cookbook sre.hosts.downtime for 0:10:00 on cumin2001.codfw.wmnet with reason: volans's test |
[production] |
09:40 |
<ema> |
A:cp start rolling varnish upgrade to 6.0.7-1wm1 T268736 |
[production] |
09:35 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:35 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:34 |
<moritzm> |
gnt-instance reboot ldap-replica2003 to validate new qemu |
[production] |
09:18 |
<arturo> |
restarted kubelet systemd service on tools-k8s-worker-38. Node was NotReady, complaining about 'use of closed network connection' |
[tools] |
09:17 |
<arturo> |
webservice restart |
[tools.openstack-browser] |
09:16 |
<arturo> |
restarted kubelet systemd service on tools-k8s-worker-59. Node was NotReady, complaining about 'use of closed network connection' |
[tools] |
09:14 |
<moritzm> |
installing qemu security updates on Stretch |
[production] |
07:09 |
<elukey> |
manual reset-failed refinery-sqoop-whole-mediawiki.service on an-launcher1002 (job launched manually) |
[analytics] |
06:06 |
<marostegui> |
Create sockpuppet database on m2 T268505 |
[production] |
04:13 |
<ejegg> |
updated fundraising CiviCRM from a2979cbba1 to 913ccdfd2b |
[production] |
03:26 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
03:24 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
02:55 |
<andrewbogott> |
removing all osds on cloudcephosd1009 for rebuild, T268746 |
[admin] |
01:54 |
<eileen> |
process-control config revision is f863b32627 |
[production] |
01:21 |
<mutante> |
lists1001 - remove "delete_held_messages" cronjob from root crontab - replaced by systemd timer - systemctl start delete_held_messages.service and confirmed it succeeded |
[production] |
2020-12-02
§
|
23:42 |
<reedy@deploy1001> |
Synchronized php-1.36.0-wmf.20/includes/debug/logger/monolog/LogstashFormatter.php: T269286 (duration: 01m 07s) |
[production] |
22:49 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
22:47 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
22:43 |
<twentyafterfour@deploy1001> |
Synchronized php-1.36.0-wmf.20/extensions/CategoryTree/: Deploying backport f6c2d74259b9 to wmf.20, bug: T269235 refs T263186 (duration: 01m 07s) |
[production] |
22:38 |
<twentyafterfour@deploy1001> |
Synchronized php-1.36.0-wmf.20/includes/parser/: Deploying backports for wmf.20 refs T263186 (duration: 01m 08s) |
[production] |
21:37 |
<joal> |
Manually create _SUCCESS flags for banner history monthly jobs to kick off (they'll be deleted by the purge tomorrow morning) |
[analytics] |
21:35 |
<wm-bot> |
<lucaswerkmeister> deployed e5291d5cda (more Esperanto translations) |
[tools.lexeme-forms] |
21:23 |
<otto@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'eventstreams' for release 'canary' . |
[production] |