2021-01-04
§
|
18:56 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
18:50 |
<ayounsi@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:21 |
<bstorm> |
ran 'sudo systemctl stop getty@ttyS1.service && sudo systemctl disable getty@ttyS1.service' on tools-k8s-etcd-5 I have no idea why that keeps coming back. |
[tools] |
18:14 |
<shdubsh> |
restart elasticsearch on logstash1012 - oom |
[production] |
17:33 |
<thcipriani> |
fixed beta-scap-eqiad by removing local mwdeploy user/group using vipw/vigr and chown -R mwdeploy:mwdeploy /srv/mediawiki for deployment-prep hosts |
[releng] |
16:35 |
<jayme> |
import kubernetes 1.16.15-4 to component/kubernetes-future buster-wikimedia and stretch-wikimedia |
[production] |
16:32 |
<pt1979@cumin2001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:25 |
<pt1979@cumin2001> |
START - Cookbook sre.dns.netbox |
[production] |
15:50 |
<bstorm> |
silencing all alerts from deployment-prep for 60 more days |
[metricsinfra] |
15:33 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2140 ', diff saved to https://phabricator.wikimedia.org/P13644 and previous config saved to /var/cache/conftool/dbconfig/20210104-153339-marostegui.json |
[production] |
15:33 |
<marostegui> |
Depool db2140 T271084 |
[production] |
14:50 |
<ema> |
cp3058: ats-backend-restart T265625 |
[production] |
14:34 |
<marostegui> |
Upgrade and restart mysql on es2020 and es2024 - T271106 |
[production] |
14:31 |
<moritzm> |
installing openssl updates on buster-based DB hosts |
[production] |
14:24 |
<elukey> |
deprecate the analytics-users group |
[analytics] |
14:15 |
<moritzm> |
installing libdatetime-timezone-perl updates |
[production] |
14:13 |
<marostegui> |
Restart mysql on pc2009 |
[production] |
12:52 |
<ema> |
deployment-cache-text06: try out varnish 6.0.1-1wm1 T264398 |
[production] |
12:08 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: [[gerrit:650988|Grant several OATHAuth-related permissions to wmf-supportsafety at Meta (T180896)]] (duration: 00m 56s) |
[production] |
11:59 |
<volans> |
uploaded python3-wmflib_0.0.6 to apt.wikimedia.org buster-wikimedia |
[production] |
11:36 |
<jdrewniak@deploy1001> |
Synchronized portals: Wikimedia Portals Update: [[gerrit:654201| Bumping portals to master (T128546)]] (duration: 00m 56s) |
[production] |
11:35 |
<jdrewniak@deploy1001> |
Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:654201| Bumping portals to master (T128546)]] (duration: 01m 14s) |
[production] |
10:50 |
<XioNoX> |
push pfw policies - T269958 |
[production] |
10:19 |
<_joe_> |
uploading docker-report 0.0.10 to debian buster |
[production] |
09:48 |
<marostegui> |
Deploy schema change on s6 codfw master (lag will appear on codfw) - T270187 |
[production] |
09:02 |
<XioNoX> |
bounce asw-d-codfw:xe-7/0/8 - T271041 |
[production] |
2021-01-03
§
|
16:17 |
<arturo> |
merged change to TLS cert used by slapd/openldap servers https://gerrit.wikimedia.org/r/c/operations/puppet/+/653871 |
[production] |
15:49 |
<vgutierrez> |
reenable puppet on ldap-replica2004.wm.o |
[production] |
15:30 |
<andrewbogott> |
disabling puppet fleet-wide to avert potential disaster from acme-chief cert rotation T271063 |
[production] |
14:42 |
<andrewbogott> |
restarting slapd on serpens and seaborgium |
[production] |
14:11 |
<milimetric> |
reset-failed refinery-sqoop-whole-mediawiki.service |
[analytics] |
14:10 |
<milimetric> |
manual sqoop finished, logs on an-launcher1002 at /var/log/refinery/sqoop-mediawiki.log and /var/log/refinery/sqoop-mediawiki-production.log |
[analytics] |
11:57 |
<wm-bot> |
<lucaswerkmeister> deployed db1e890252 (grab cursor for draggable links) |
[tools.lexeme-forms] |
11:38 |
<elukey> |
powercycle an-worker1114 (kernel errors in the serial console) |
[production] |
09:07 |
<elukey> |
reboot ms-be2050 as attempt to recover/fix its broken networking state (started from Dec 30th) - T271041 |
[production] |
07:06 |
<dcaro> |
Got a network hiccup on cloudnet1004, keeping track here T271058 |
[admin] |
2021-01-01
§
|
18:26 |
<James_F> |
zuul: Try in a second way to only run mwext coverage jobs on master T270976 |
[releng] |
18:13 |
<James_F> |
zuul: [mediawiki/extensions/AbuseFilter] Make sqlite tests voting T251967 |
[releng] |
17:31 |
<Majavah> |
deploy 9876262 to fix bug with ip ranges |
[tools.majavah-bot] |
14:54 |
<milimetric> |
deployed refinery hotfix for sqoop problem, after testing on three small wikis |
[analytics] |
14:49 |
<milimetric@deploy1001> |
Finished deploy [analytics/refinery@f9281dd] (thin): [SAFE, IGNORE] Simple hotfix for a python bug, analytics refinery only, not urgent (duration: 00m 07s) |
[production] |
14:49 |
<milimetric@deploy1001> |
Started deploy [analytics/refinery@f9281dd] (thin): [SAFE, IGNORE] Simple hotfix for a python bug, analytics refinery only, not urgent |
[production] |
14:48 |
<milimetric@deploy1001> |
Finished deploy [analytics/refinery@f9281dd]: [SAFE, IGNORE] Simple hotfix for a python bug, analytics refinery only, not urgent (duration: 10m 00s) |
[production] |
14:38 |
<milimetric@deploy1001> |
Started deploy [analytics/refinery@f9281dd]: [SAFE, IGNORE] Simple hotfix for a python bug, analytics refinery only, not urgent |
[production] |
08:52 |
<legoktm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Switch fiwiki to their 500k temporary logo! (T270974) (duration: 00m 55s) |
[production] |
08:46 |
<legoktm@deploy1001> |
Synchronized static/images/project-logos/fiwiki-500k-2x.png: Add fiwiki 500k temporary logos (3/3) (duration: 00m 55s) |
[production] |
08:45 |
<legoktm@deploy1001> |
Synchronized static/images/project-logos/fiwiki-500k-1.5x.png: Add fiwiki 500k temporary logos (2/3) (duration: 00m 54s) |
[production] |