6801-6850 of 10000 results (47ms)
2021-02-23 ยง
19:35 <ryankemper> [WDQS Deploy] Gearing up for deploy of wdqs `0.3.64`. Pre-deploy tests passing on canary `wdqs1003` [production]
19:33 <dzahn@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host gitlab1001.eqiad.wmnet [production]
19:32 <legoktm> re-enabling puppet on registry* [production]
19:30 <legoktm> pushed new wikimedia-buster image [production]
19:16 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@3969cae]: new dag ores_bulk_ingest (duration: 01m 32s) [production]
19:15 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@3969cae]: new dag ores_bulk_ingest [production]
19:10 <dduvall@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
19:08 <dduvall@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
19:08 <legoktm> disabling puppet on registry* except registry2001 while rolling out https://gerrit.wikimedia.org/r/664683 [production]
19:04 <dduvall@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'blubberoid' for release 'staging' . [production]
18:41 <dzahn@cumin1001> START - Cookbook sre.ganeti.makevm for new host gitlab1001.eqiad.wmnet [production]
18:17 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@c2190da]: environment and venv builder for ores_bulk_ingest (duration: 01m 40s) [production]
18:15 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@c2190da]: environment and venv builder for ores_bulk_ingest [production]
18:15 <ebernhardson@deploy1001> deploy aborted: environment and venv builder for ores_bulk_ingest (duration: 00m 16s) [production]
18:15 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@c2190da]: environment and venv builder for ores_bulk_ingest [production]
18:12 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:07 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
17:29 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
17:29 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
17:22 <longma> wmf/1.36.0-wmf.32 was branched at 03c382f199318f4ecd6a92c0acc280b6543adcc3 for T274936 [production]
17:21 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1034.eqiad.wmnet [production]
17:18 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
17:18 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
17:17 <jiji@cumin1001> START - Cookbook sre.hosts.reboot-single for host mc1034.eqiad.wmnet [production]
17:16 <effie> upgrade memcached on mc1034, mc2034 - T270315 [production]
17:01 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
17:01 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
16:59 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
16:59 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
16:55 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
16:55 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
16:48 <mholloway-shell@deploy1001> Synchronized wmf-config/InitialiseSettings.php: WikimediaEvents: Enable session tick instrument on all wikis (T274172) (duration: 00m 58s) [production]
16:46 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
16:46 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
16:42 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'changeprop' for release 'staging' . [production]
16:41 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'changeprop' for release 'production' . [production]
16:25 <razzi@cumin1001> START - Cookbook sre.kafka.reboot-workers for Kafka jumbo cluster: Reboot kafka nodes - razzi@cumin1001 [production]
16:02 <otto@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Declare TranslationRecommendation event streams - T271163 (duration: 00m 58s) [production]
15:52 <jynus> previous message should say 15:38 T267338 [production]
15:51 <jynus> started swift codfw backup stress test at 14:38 with 10 threads T267338 [production]
15:44 <elukey> reboot an-launcher1002 for kernel updates [production]
15:35 <moritzm> restarting PHP/Apache on mw canaries for gnutls update [production]
15:23 <moritzm> installing gnutls28 bugfix updates from Buster 10.8 point release [production]
15:17 <elukey> deploy a new term to the analytics-in4 filter on cr1/cr2-eqiad (see https://gerrit.wikimedia.org/r/c/operations/homer/public/+/665814) [production]
14:55 <otto@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Remove wgEventLoggingSchemas overrides for QuickSurvey and NavigationTiming (duration: 00m 56s) [production]
14:51 <elukey> drop /srv/backup-1007 on stat1008 to free space [production]
14:41 <otto@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Migrate SpecialMuteSubmit to EventGate on all wikis - T268517 (duration: 00m 58s) [production]
14:40 <otto@deploy1001> sync-file aborted: Migrate SpecialMuteSubmit to EventGate on all wikis - T268517 (duration: 00m 05s) [production]
14:17 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-test-worker1002.eqiad.wmnet with reason: REIMAGE [production]
14:15 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-test-worker1002.eqiad.wmnet with reason: REIMAGE [production]