651-700 of 10000 results (105ms)
2024-10-31 ยง
14:05 <arnaudb@cumin1002> dbctl commit (dc=all): 'db1234 (re)pooling @ 1%: post T378267 reclone', diff saved to https://phabricator.wikimedia.org/P70754 and previous config saved to /var/cache/conftool/dbconfig/20241031-140459-arnaudb.json [production]
14:03 <arnaudb@cumin1002> dbctl commit (dc=all): 'db1232 (re)pooling @ 25%: post db1234.eqiad.wmnet clone', diff saved to https://phabricator.wikimedia.org/P70753 and previous config saved to /var/cache/conftool/dbconfig/20241031-140345-arnaudb.json [production]
13:50 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1084306|tcywikisource: add logo (T378555)]] (duration: 08m 56s) [production]
13:46 <urbanecm@deploy2002> urbanecm, anzx: Continuing with sync [production]
13:44 <urbanecm@deploy2002> urbanecm, anzx: Backport for [[gerrit:1084306|tcywikisource: add logo (T378555)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:41 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1084306|tcywikisource: add logo (T378555)]] [production]
13:38 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1085342|Set username in user mock and reset state after test (T378573)]], [[gerrit:1085343|Fix and re-enable selenium test (T378581)]], [[gerrit:1085344|Fix selenium test loading the wrong talk page]], [[gerrit:1085346|HomepageHooks: do not store assigned variant on account creation (T377713)]], [[gerrit:1085347|SpecialHomepage: show community update [production]
13:34 <urbanecm@deploy2002> hnowlan, sgimeno, urbanecm: Continuing with sync [production]
13:30 <urbanecm@deploy2002> hnowlan, sgimeno, urbanecm: Backport for [[gerrit:1085342|Set username in user mock and reset state after test (T378573)]], [[gerrit:1085343|Fix and re-enable selenium test (T378581)]], [[gerrit:1085344|Fix selenium test loading the wrong talk page]], [[gerrit:1085346|HomepageHooks: do not store assigned variant on account creation (T377713)]], [[gerrit:1085347|SpecialHomepage: show community upda [production]
13:28 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1085342|Set username in user mock and reset state after test (T378573)]], [[gerrit:1085343|Fix and re-enable selenium test (T378581)]], [[gerrit:1085344|Fix selenium test loading the wrong talk page]], [[gerrit:1085346|HomepageHooks: do not store assigned variant on account creation (T377713)]], [[gerrit:1085347|SpecialHomepage: show community update [production]
13:25 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1084939|tcywikisource: Add namespaces, SITENAME and timezone (T378555)]], [[gerrit:1084940|tcywiktionary: add SITENAME and timezone (T378556)]], [[gerrit:1084307|tcywiktionary: add logo (T378556)]] (duration: 09m 39s) [production]
13:20 <urbanecm@deploy2002> anzx, urbanecm: Continuing with sync [production]
13:19 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
13:18 <urbanecm@deploy2002> anzx, urbanecm: Backport for [[gerrit:1084939|tcywikisource: Add namespaces, SITENAME and timezone (T378555)]], [[gerrit:1084940|tcywiktionary: add SITENAME and timezone (T378556)]], [[gerrit:1084307|tcywiktionary: add logo (T378556)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:18 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
13:15 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1084939|tcywikisource: Add namespaces, SITENAME and timezone (T378555)]], [[gerrit:1084940|tcywiktionary: add SITENAME and timezone (T378556)]], [[gerrit:1084307|tcywiktionary: add logo (T378556)]] [production]
13:14 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1084200|TimedMediaHandler: use shellbox globally (T357309)]], [[gerrit:1078700|Remove RunSingleJobStdin script (T369048)]] (duration: 09m 43s) [production]
13:09 <urbanecm@deploy2002> urbanecm, hnowlan: Continuing with sync [production]
13:08 <urbanecm@deploy2002> urbanecm, hnowlan: Backport for [[gerrit:1084200|TimedMediaHandler: use shellbox globally (T357309)]], [[gerrit:1078700|Remove RunSingleJobStdin script (T369048)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:04 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1084200|TimedMediaHandler: use shellbox globally (T357309)]], [[gerrit:1078700|Remove RunSingleJobStdin script (T369048)]] [production]
12:27 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1237 (T376905)', diff saved to https://phabricator.wikimedia.org/P70752 and previous config saved to /var/cache/conftool/dbconfig/20241031-122719-ladsgroup.json [production]
12:12 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1237', diff saved to https://phabricator.wikimedia.org/P70751 and previous config saved to /var/cache/conftool/dbconfig/20241031-121212-ladsgroup.json [production]
12:06 <elukey@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host aux-k8s-worker1002.eqiad.wmnet [production]
12:06 <elukey@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host aux-k8s-worker1002.eqiad.wmnet [production]
12:01 <fnegri@cumin1002> END (PASS) - Cookbook sre.wikireplicas.add-wiki (exit_code=0) for database annwiki (T377118) [production]
12:01 <fnegri@cumin1002> START - Cookbook sre.wikireplicas.add-wiki for database annwiki (T377118) [production]
12:01 <fnegri@cumin1002> END (PASS) - Cookbook sre.wikireplicas.add-wiki (exit_code=0) for database tddwiki (T375016) [production]
12:00 <fnegri@cumin1002> START - Cookbook sre.wikireplicas.add-wiki for database tddwiki (T375016) [production]
12:00 <fnegri@cumin1002> END (PASS) - Cookbook sre.wikireplicas.add-wiki (exit_code=0) for database rskwiki (T375016) [production]
11:59 <fnegri@cumin1002> START - Cookbook sre.wikireplicas.add-wiki for database rskwiki (T375016) [production]
11:59 <fnegri@cumin1002> END (ERROR) - Cookbook sre.wikireplicas.add-wiki (exit_code=97) for database rskwiki (T375016) [production]
11:57 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1237', diff saved to https://phabricator.wikimedia.org/P70750 and previous config saved to /var/cache/conftool/dbconfig/20241031-115705-ladsgroup.json [production]
11:54 <fnegri@cumin1002> START - Cookbook sre.wikireplicas.add-wiki for database rskwiki (T375016) [production]
11:47 <arnaudb@cumin1002> END (PASS) - Cookbook sre.mysql.clone (exit_code=0) of db1232.eqiad.wmnet onto db1234.eqiad.wmnet [production]
11:41 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1237 (T376905)', diff saved to https://phabricator.wikimedia.org/P70747 and previous config saved to /var/cache/conftool/dbconfig/20241031-114158-ladsgroup.json [production]
11:38 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aux-k8s-worker1002.eqiad.wmnet with OS bookworm [production]
11:34 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1237 (T376905)', diff saved to https://phabricator.wikimedia.org/P70746 and previous config saved to /var/cache/conftool/dbconfig/20241031-113456-ladsgroup.json [production]
11:34 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1237.eqiad.wmnet with reason: Maintenance [production]
11:34 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1237.eqiad.wmnet with reason: Maintenance [production]
11:29 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1225.eqiad.wmnet with reason: Maintenance [production]
11:29 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1225.eqiad.wmnet with reason: Maintenance [production]
11:29 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1224 (T376905)', diff saved to https://phabricator.wikimedia.org/P70744 and previous config saved to /var/cache/conftool/dbconfig/20241031-112924-ladsgroup.json [production]
11:26 <fabfur> reverted previous action (T378578) [production]
11:20 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aux-k8s-worker1002.eqiad.wmnet with reason: host reimage [production]
11:17 <fabfur> install haproxykafka on cp4037 and cp3066 (https://gerrit.wikimedia.org/r/c/operations/puppet/+/1085308) (T378578) [production]
11:17 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker1002.eqiad.wmnet with reason: host reimage [production]
11:14 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1224', diff saved to https://phabricator.wikimedia.org/P70743 and previous config saved to /var/cache/conftool/dbconfig/20241031-111417-ladsgroup.json [production]
11:02 <elukey@cumin1002> START - Cookbook sre.hosts.reimage for host aux-k8s-worker1002.eqiad.wmnet with OS bookworm [production]
11:01 <elukey@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host aux-k8s-worker1002.eqiad.wmnet [production]
11:00 <elukey@cumin1002> START - Cookbook sre.k8s.pool-depool-node depool for host aux-k8s-worker1002.eqiad.wmnet [production]