1751-1800 of 2560 results (19ms)
2017-07-20 §
20:30 <nuria_> deploying eventlogging c1c2c39411ccd002ff8cea197bc535155213f5fb and restarting [analytics]
18:18 <ottomata> deleted instance deployment-eventlogging03 in favor of new instance deployment-eventlog02 [analytics]
17:14 <ottomata> killed tranquility instances tranq-banners and tranq-netflow running on druid1003 in joal's screen sessions [analytics]
2017-07-18 §
13:04 <ottomata> adding unique index on meta_id and index on meta_dt to mediawiki_page_{create,delete,move,undelete}_1 on db1046 MySQL eventlogging master [analytics]
2017-07-17 §
16:27 <elukey> set innodb_flush_log_at_trx_commit on bohrium to 2 and sync_binlog=300 to reduce iowait - T164073 [analytics]
14:31 <elukey> set innodb_flush_log_at_trx_commit on bohrium to 1 (default value)- T164073 [analytics]
2017-07-12 §
13:48 <fdans> updated pageview whitelist with din.wikipedia [analytics]
2017-07-11 §
05:24 <elukey> drop _Edit_11448630_old from dbstore1002 [analytics]
2017-07-10 §
16:14 <nuria_> deploying eventlogging 5e16da16e3f5ce287829390a76b9f5b0c7715ee5 [analytics]
2017-07-08 §
07:55 <elukey> re-run wikidata-specialentitydata_metrics-wf-2017-7-7 in Hue (failed Spark job) [analytics]
2017-07-06 §
10:37 <elukey> taking mysqldump for Piwik and storing it on stat1002:/a/backup/bohrium/mysqldump_20170706.sql [analytics]
2017-07-04 §
11:21 <joal> Redeploying refinery with scap [analytics]
11:10 <joal> Restart unique_devices-per_project_family-monthly-coord after correction deployed [analytics]
11:03 <joal> Deploying refinery onto hdfs [analytics]
10:57 <joal> Deploying refinery with scap [analytics]
2017-07-03 §
16:40 <joal> Manually launch sqoop imports for enwiki revision, and wikidatawiki revision and logging tables, snapshot=2017-06 [analytics]
2017-07-01 §
21:33 <joal> Restart cassandra bundle at beginning of the month [analytics]
2017-06-29 §
11:39 <joal> Update tables and archived data and kill/start jobs for unique-devices per project-family [analytics]
11:34 <joal> Kill and restart druid webrequest sampled oozie jobs after deploy [analytics]
11:18 <joal> Update tables and restart mediawiki_history oozie jobs after deploy [analytics]
10:58 <elukey> deploy refinery to HDFS [analytics]
10:57 <elukey> fixed archiva whitelist in the analytics VLAN (VM changed IP) [analytics]
07:03 <joal> Deploying refinery with scap (after yesterday's failure) [analytics]
2017-06-28 §
18:17 <joal> Deploying refinery with scap [analytics]
16:25 <joal> Building / Deploying refinery-source from jenkins to archiva (v0.0.480 [analytics]
15:42 <elukey> analytics1030 back to the worker nodes after maintenance [analytics]
2017-06-27 §
16:26 <milimetric> quarry Rebooted all the boxes in an attempt to fix performance problems [analytics]
10:05 <elukey> added https://wiki.apache.org/commons/VfsProblems to stat1004 [analytics]
07:14 <joal> Rerun wikidata-articleplaceholder_metrics-wf-2017-6-26 [analytics]
2017-06-24 §
10:31 <elukey> re-run webrequest-load-coord-misc's failed job in hue [analytics]
2017-06-23 §
07:32 <elukey> uploaded new pageview whitelist following https://wikitech.wikimedia.org/wiki/Analytics/Team/Oncall#Find_and_fix_pageview_whitelist_exceptions for kbp.wikipedia [analytics]
2017-06-21 §
20:23 <joal> Disable puppet agent and restart kafka with 48h retention in deployment-kafka01 [analytics]
13:59 <elukey> eventlogging restarted after reboot [analytics]
13:54 <elukey> stop eventlogging and reboot eventlog1001 [analytics]
13:15 <elukey> reboot analytics1003 for kernel update [analytics]
11:08 <elukey> stop camus on an1003 [analytics]
2017-06-20 §
19:24 <ottomata> beginning to consume select eventbus event using eventlogging mysql consumer and inserting into eventlogging analytics mysql db [analytics]
18:01 <joal> Rerun webrequest-load-wf-text-2017-6-20-12 after oozie failure [analytics]
16:23 <joal> Restarted tranquility for banners and netflow on druid1003 [analytics]
16:18 <joal> Rererun pageview-druid-hourly-wf-2017-6-20-14 (failed due to druid reboots) [analytics]
16:04 <elukey> re-run pageview-druid-hourly-wf-2017-6-20-14 (failed due to druid reboots) [analytics]
14:46 <elukey> re-run failed webrequest-load-text/upload jobs due to reboots [analytics]
13:29 <elukey> restart webrequest-load-coord-text and webrequest-load-coord-upload failed jobs due to reboots [analytics]
13:14 <elukey> re-run wikidata-wdqs_extract-wf-2017-6-20-11 (failed for connection issues, likely due to reboots) [analytics]
11:54 <joal> Deleting old unique_devices data (renamed to unique_devices_per_domain) [analytics]
10:27 <elukey> reboot kafka1012, analytics1028, aqs1004 for kernel upgrades (canary hosts) [analytics]
08:51 <elukey> manually added the user 'hdfs' to the 'hive' group to be able to run refinery-drop-webrequest-partitions [analytics]
08:49 <elukey> manually running /srv/deployment/analytics/refinery/bin/refinery-drop-webrequest-partitions on an1003 to free hdfs space [analytics]
2017-06-19 §
12:10 <elukey> disable BBU auto learn on all the hadoop workers [analytics]
2017-06-13 §
10:10 <elukey> merged big zookeeper refactoring https://gerrit.wikimedia.org/r/#/c/354449 - Druid's Hadoop client config now correctly points to conf1* and not drud1* [analytics]