3751-3800 of 10000 results (55ms)
2022-01-24 §
06:05 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host es1022.eqiad.wmnet with OS bullseye [production]
06:04 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1104', diff saved to https://phabricator.wikimedia.org/P18981 and previous config saved to /var/cache/conftool/dbconfig/20220124-060431-marostegui.json [production]
06:02 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool es1022 T299123', diff saved to https://phabricator.wikimedia.org/P18980 and previous config saved to /var/cache/conftool/dbconfig/20220124-060248-marostegui.json [production]
05:52 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host es1029.eqiad.wmnet with OS bullseye [production]
05:49 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1104 (T285149)', diff saved to https://phabricator.wikimedia.org/P18979 and previous config saved to /var/cache/conftool/dbconfig/20220124-054926-marostegui.json [production]
05:43 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool es1029 for reimage T299741', diff saved to https://phabricator.wikimedia.org/P18978 and previous config saved to /var/cache/conftool/dbconfig/20220124-054349-marostegui.json [production]
05:42 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1104 (T285149)', diff saved to https://phabricator.wikimedia.org/P18977 and previous config saved to /var/cache/conftool/dbconfig/20220124-054218-marostegui.json [production]
05:42 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1104.eqiad.wmnet with reason: Maintenance [production]
05:42 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1104.eqiad.wmnet with reason: Maintenance [production]
05:42 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1116.eqiad.wmnet with reason: Maintenance [production]
05:42 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1116.eqiad.wmnet with reason: Maintenance [production]
05:42 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
05:42 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
2022-01-23 §
22:02 <ebysans@deploy1002> Finished deploy [airflow-dags/analytics-test@37937f6]: (no justification provided) (duration: 00m 08s) [production]
22:02 <ebysans@deploy1002> Started deploy [airflow-dags/analytics-test@37937f6]: (no justification provided) [production]
21:27 <ebysans@deploy1002> Finished deploy [airflow-dags/analytics-test@fa62e75]: (no justification provided) (duration: 00m 09s) [production]
21:26 <ebysans@deploy1002> Started deploy [airflow-dags/analytics-test@fa62e75]: (no justification provided) [production]
14:50 <wm-bot> <bd808> Update to php7.4 runtime; downgrade elasticsearch client to ^6.2.0 [tools.sal]
13:58 <taavi> restarted webservice [tools.sal]
10:12 <taavi> revert quota changes requested on T299585 [maps]
2022-01-22 §
22:38 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
22:38 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
18:08 <taavi> add maps proxy names requested in T299775 [project-proxy]
17:11 <wm-bot> <lucaswerkmeister> deployed b1cc42ef84 (Odia nouns) [tools.lexeme-forms]
16:52 <wm-bot> <lucaswerkmeister> deployed b62723fb6f (update Odia adverbs) [tools.lexeme-forms]
14:51 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
14:51 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
13:40 <taavi> apply T299827 on deployment-prep centralauth database [releng]
11:44 <taavi> restart varnish-frontend.service on deployment-cache-upload06 to clear puppet agent failure alerts [releng]
11:32 <taavi> added project-proxy VMs to prometheus targets [metricsinfra]
11:16 <taavi> add wma.wmcloud.org and *.wma.wmcloud.org to wma certificate SANs T299775 [project-proxy]
08:36 <elukey> `apt-get clean` on an-test-coord1001 to free some space [analytics]
08:35 <elukey> `apt-get clean` on an-test-coord1001 to free some space [production]
08:25 <elukey> remove the `--debug=true` etcd daemon arg from ml-etcd2002 (only node having it, probably a manual test done in the past) and cleaned up spammy etcd logs to free space [production]
01:30 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
01:30 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
01:08 <bd808> Update demo server to efe2dbe [toolhub]
00:27 <dzahn@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=miscweb [production]
2022-01-21 §
22:23 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
22:23 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 4:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
22:11 <mutante> - created new instance gitlab-prod-1001 T297411 [devtools]
22:11 <mutante> - created new instance gitlab-prod-1001T297411 [devtools]
21:57 <mutante> - deleted instances "doc" and "doc1002" to make room for gitlab instance T299561 - T297411 [devtools]
21:43 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
21:42 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
21:42 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
21:40 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
21:38 <brennen@deploy1002> Synchronized php-1.38.0-wmf.18/extensions/VisualEditor/modules/ve-mw: Backport: [[gerrit:756066|Revert "Re-duplicate deduplicated TemplateStyles" (T287675 T299251 T299767)]] (duration: 00m 49s) [production]
21:21 <topranks> Running homer against cr1-eqiad and cr2-eqiad to remove entries on analytics-in4/6 filters that refer to decommissioned deb mirror host sodium. [production]
19:14 <ayounsi@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]