2019-12-12
ยง
|
14:08 |
<marostegui> |
Upgrade db2085 and db2086 |
[production] |
14:02 |
<jbond42> |
merge puppet-merge refactor |
[production] |
13:38 |
<hashar> |
contint1001 / contint2001 : upgraded Zuul to 2.5.1-wmf11 # T203846 |
[production] |
13:33 |
<hashar> |
Remove label contintLabsSlave from integration-slave-jessie-1002 and integration-slave-jessie-1004 # T225031 |
[releng] |
13:03 |
<Reedy> |
restart to fix double posting to irc |
[tools.wikibugs] |
12:59 |
<elukey> |
roll restart hadoop workers to pick up the new settings (removed prefer ipv4 false after T240255) |
[analytics] |
12:58 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-workers |
[production] |
12:40 |
<elukey> |
enable timers on an-coord1001 after maintenance |
[analytics] |
12:39 |
<elukey> |
restart hive and oozie on an-coord1001 to pick up ipv6 settings |
[analytics] |
12:39 |
<Urbanecm> |
EU SWAT done |
[production] |
12:38 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: 07652a6: Add 2020: Wikimania namespace (T240339) (duration: 01m 02s) |
[production] |
12:37 |
<moritzm> |
installing NSS security updates on buster |
[production] |
12:34 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: 1c58f09: Enable SandboxLink extension on hywwiki (T239387) (duration: 01m 03s) |
[production] |
11:49 |
<moritzm> |
removing puppetdb2001 from Ganeti |
[production] |
11:46 |
<jmm@cumin2001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) |
[production] |
11:45 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.decommission |
[production] |
11:41 |
<hashar> |
Removing zuul package from Jessie CI instances # T240551 |
[production] |
11:20 |
<arturo> |
rolling reboot for all grid & k8s worker nodes due to NFS staleness |
[tools] |
11:17 |
<addshore@deploy1001> |
Synchronized php-1.35.0-wmf.10/extensions/Wikibase: BACKPORTS: wikibase tainted refs https://gerrit.wikimedia.org/r/#/q/topic:backports-wd-tainted-1 (duration: 01m 08s) |
[production] |
11:16 |
<hashar> |
Manually rebuilding last build of https://integration.wikimedia.org/ci/job/scap-beta-deb/ to generate a scap package without relying on zuul-cloner | https://gerrit.wikimedia.org/r/556643 | T240551 |
[releng] |
11:14 |
<elukey> |
stop timers on an-coord1001 as prep step for hive/oozie restart |
[analytics] |
09:46 |
<moritzm> |
upgrading recently reimaged stretch hosts back to puppet 5 / facter 3 T239832 |
[production] |
09:37 |
<marostegui> |
Retroactive: deploy schema change on db1102:3314 |
[production] |
09:22 |
<arturo> |
reboot tools-sgeexec-0911 to try fixing weird NFS state |
[tools] |
08:46 |
<arturo> |
doing `run-puppet-agent` in all VMs to see state of NFS |
[tools] |
08:40 |
<eileen> |
process-control config revision is 4d25b656e2 |
[production] |
08:34 |
<godog> |
cleanup puppetmaster1001:/run/confd-template |
[production] |
08:34 |
<arturo> |
reboot tools-worker-1033/1034 and tools-sgebastion-08 to try to correct NFS mount issues |
[tools] |
08:22 |
<wm-bot> |
<jeanfred> Moved logs/crontabl.log following emails from Cron Daemon: /bin/sh: 1: cannot create /data/project/wikiloves/logs/crontab.log: Stale file handle |
[tools.wikiloves] |
07:28 |
<eileen> |
process-control config revision is ad8bec977d |
[production] |
07:10 |
<marostegui> |
Upgrade db1137 |
[production] |
07:09 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Depool db1137 for upgrade (duration: 01m 03s) |
[production] |
06:47 |
<marostegui> |
Upgrade db1117 |
[production] |
06:11 |
<arturo> |
schedule 4h downtime for labstores |
[admin] |
06:11 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
06:11 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
05:58 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
05:58 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
05:57 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
05:57 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
05:57 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
05:57 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
05:57 |
<arturo> |
schedule 4h downtime for cloudvirts and other openstack components due to upgrade ops |
[admin] |
05:56 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
05:56 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
05:56 |
<aborrero@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
05:56 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
05:47 |
<marostegui> |
Deploy schema change on db1102:3314 |
[production] |
05:47 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1097:3314 after schema change T233135', diff saved to https://phabricator.wikimedia.org/P9861 and previous config saved to /var/cache/conftool/dbconfig/20191212-054708-marostegui.json |
[production] |
03:25 |
<ejegg> |
updated fundraising internal dashboard from 3917f7d9dc to c1ded3c473 |
[production] |