701-750 of 10000 results (20ms)
2021-02-23 ยง
21:03 <otto@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . [production]
21:03 <otto@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . [production]
21:00 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@46a8ae1]: ores_bulk_ingest: namespace is not plural (duration: 01m 41s) [production]
21:00 <otto@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . [production]
20:59 <otto@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . [production]
20:58 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@46a8ae1]: ores_bulk_ingest: namespace is not plural [production]
20:56 <dzahn@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host gitlab1002.eqiad.wmnet [production]
20:55 <wm-bot> <root> Stopped webservice. Pod in CrashLoopBackoff and restarting did no seem to help. [tools.wikiloop]
20:52 <jhuneidi@deploy1001> Started scap: testwikis wikis to 1.36.0-wmf.32 refs T274936 [production]
20:47 <wm-bot> <root> Deleted deployment.apps/lilywhite.bot which was spawning pods into CrashLoopBackoff due to missing /data/project/lekhaki/tool-lekhaki/main.js entrypoint file. [tools.lekhaki]
20:44 <ppchelko@deploy1001> Synchronized wmf-config/CommonSettings.php: No-op: math enable talking to mathoid directly in labs, T274436 (duration: 00m 57s) [production]
20:44 <wm-bot> <root> Deleted "test" deployment and related pod stuck in CrashLoopBackoff. [tools.adhs-wde]
20:38 <otto@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Fix typo in visualeditortemplatedialoguse - T275015 (duration: 01m 01s) [production]
20:36 <andrewbogott> adding r/o access to the eqiad1-glance-images ceph pool for the client.eqiad1-compute for T275430 [admin]
20:13 <razzi@cumin1001> END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka jumbo cluster: Reboot kafka nodes - razzi@cumin1001 [production]
20:04 <dzahn@cumin1001> START - Cookbook sre.ganeti.makevm for new host gitlab1002.eqiad.wmnet [production]
19:54 <ryankemper@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) [production]
19:54 <ryankemper@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
19:49 <ryankemper@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) [production]
19:49 <ryankemper@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
19:43 <ryankemper> [WDQS Deploy] Disk space low on `wdqs1009`, rolling back so that can be addressed [production]
19:43 <ryankemper@deploy1001> Finished deploy [wdqs/wdqs@b5fc9d5]: 0.3.64 (duration: 08m 01s) [production]
19:38 <otto@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Declare WMDE Technical Wishes streams and migrate to EventGate on testwiki (duration: 02m 41s) [production]
19:36 <ryankemper> [WDQS Deploy] Tests passing following deploy of `0.3.64` on canary `wdqs1003`; proceeding to rest of fleet [production]
19:35 <ryankemper@deploy1001> Started deploy [wdqs/wdqs@b5fc9d5]: 0.3.64 [production]
19:35 <ryankemper> [WDQS Deploy] Gearing up for deploy of wdqs `0.3.64`. Pre-deploy tests passing on canary `wdqs1003` [production]
19:33 <dzahn@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host gitlab1001.eqiad.wmnet [production]
19:32 <legoktm> re-enabling puppet on registry* [production]
19:31 <elukey> roll out new uid/gid for mapred/druid/analytics/yarn/hdfs for all buster nodes (no op for stretch) [analytics]
19:30 <legoktm> pushed new wikimedia-buster image [production]
19:16 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@3969cae]: new dag ores_bulk_ingest (duration: 01m 32s) [production]
19:15 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@3969cae]: new dag ores_bulk_ingest [production]
19:10 <dduvall@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
19:08 <dduvall@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
19:08 <legoktm> disabling puppet on registry* except registry2001 while rolling out https://gerrit.wikimedia.org/r/664683 [production]
19:04 <dduvall@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'blubberoid' for release 'staging' . [production]
18:41 <dzahn@cumin1001> START - Cookbook sre.ganeti.makevm for new host gitlab1001.eqiad.wmnet [production]
18:20 <James_F> Zuul: [mediawiki/services/function-schemata] Add generic pipeline CI [releng]
18:17 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@c2190da]: environment and venv builder for ores_bulk_ingest (duration: 01m 40s) [production]
18:15 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@c2190da]: environment and venv builder for ores_bulk_ingest [production]
18:15 <ebernhardson@deploy1001> deploy aborted: environment and venv builder for ores_bulk_ingest (duration: 00m 16s) [production]
18:15 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@c2190da]: environment and venv builder for ores_bulk_ingest [production]
18:12 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:07 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
17:47 <elukey> change uid/gid for yarn/mapred/analytics/hdfs/druid on stat100x, an-presto100x [analytics]
17:29 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
17:29 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
17:22 <longma> wmf/1.36.0-wmf.32 was branched at 03c382f199318f4ecd6a92c0acc280b6543adcc3 for T274936 [production]
17:21 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1034.eqiad.wmnet [production]
17:18 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]