Write a Blog >>
SPLASH 2019
Sun 20 - Fri 25 October 2019 Athens, Greece
Tue 22 Oct 2019 14:30 - 15:00 at Room 3A - Session 2

A number of online machine learning techniques based on tree model are studied in order to cope with today’s requirements of quickly processing large scale data-sets. We present a design pattern for \emph{incremental tree data processing} as gradually constructing on-demand tree-model on memory. Our approach adopts the actor model as making use of multi-cores and distributed computers without largely rewriting code for algorithms. The pattern basically defines a node in the tree as an actor which is the unit of asynchronous processes and each data instance flows between actor nodes as a message. We study concrete 2 machine learning algorithms, VFDT for decision tree’s top-down growth and BIRCH for hierarchical clustering’s bottom up growth. For supporting VFDT, we propose an extension mechanism of replicating root nodes so that it can address bottleneck as starting of inputs. For supporting BIRCH, we split processes of recursive construction into asynchronous steps with correcting target node by traversing extra horizontal links between sibling nodes. We carried out machine learning tasks with our implementation on top of Akka Java, and we confirmed reasonable performance for the tasks with large scale data-sets.

Tue 22 Oct

agere
14:00 - 15:30: AGERE 2019 - Session 2 at Room 3A
agere14:00 - 14:30
Talk
Nikolai SuslovFund for Supporting Development of RT
Media Attached
agere14:30 - 15:00
Talk
Kouhei SakuraiGraduate School of Natural Science & Technology, Kanazawa University, Taiki ShimizuGraduate School of Natural Science & Technology, Kanazawa University
agere15:00 - 15:30
Talk
Sebastian BlessingImperial College London, Kiko Fernandez-ReyesUppsala University, Albert Mingkun Yang, Sophia DrossopoulouImperial College London, Tobias WrigstadUppsala University