HeteroSpark: A Heterogeneous CPU/GPU Spark Platform for Machine ...

Recommend Documents

A Heterogeneous Animated Platform for Educational Participatory ...

[email protected] ... http://www.ics.uci.edu/~wmt/movies/HomeschoolVideo.mov .... These participants included industry professionals, academic administrators, ...

DocMining: A Cooperative Platform for Heterogeneous ... - CiteSeerX

a large number of heterogeneous documents, allowing users to define their own knowledge. .....

DocMining: A Cooperative Platform for Heterogeneous ... - Loria

It is based on a method proposed by Fletcher and Kasturi [12] and has been implemented by the Qgar Project, with an extra absolute threshold for the size of a ...

A heterogeneous reconfigurable platform for cognitive ...

design in the context of a video application over Sundance. FPGA/DSP platform, where the authors illustrate the validity of their approach for Cognitive Radio.

Laboratory Spark Erosion Machine

Comprehensive instruction manual. Page 2. Specification. The Mk 4 Spark erosion machine has two orthogonal micrometers for positioning the electrode. The.

Apache Spark a Big Data Analytics Platform for Smart ... - ScienceDirect

This paper presents Apache spark as a unified cluster computing platform which is suitable for storing and performing Big Data analytics on smart grid data for ...

Supporting Platform for Heterogeneous Sensor Network Operation ...

network by means of promoting a common communication interface and cooperation .... for a dedicated server node that centralizes all available services in the ...

Lightweight Communication Platform for Heterogeneous ... - CiteSeerX

This section describes a lightweight communication platform. Such a platform ... Message transport service is responsible for marshalling and demarshalling.

An integration platform for heterogeneous sensor

Jun 17, 2010 - The Tsunami Service Bus (TSB) is the sensor integration platform of the .... cation layer comprising the four SWE services (SOS, SAS,. SPS, and WNS) ... The JMS technology has been chosen as MOM because of its elaborate ...

Developing Applications for Heterogeneous Machine Networks: The ...

processors such as systolic arrays, and general-purpose worksta- tions, the resources that ... manage the resources of a heterogeneous machine network. The.

Power Breakdown Analysis for a Heterogeneous NoC Platform ...

Power Breakdown Analysis for a Heterogeneous NoC. Platform Running a Video Application. Andy Lambrechts1,2, Praveen Raghavan1,2, Anthony Leroy1,4, ...

Green Move: A Platform for Highly Configurable, Heterogeneous ...

Jul 23, 2014 - tive ones through a set of rulesâexpressed in an ad-hoc languageâthat ... Green Move Android application running on the GEB. As explained ...

A Web Platform for Analysis of Multivariate Heterogeneous Biomedical ...

development of an efficient and upgradeable system for automatic classification of ... novel open-source Java-based Android application was presented in [1] ...

Design and Optimization of a Heterogeneous Platform for Multiple ...

Aug 29, 2014 - current status on the development of Droid planner, a ground .... the Android OS, to enable it to run many and diverse devices ranging from ...

Coupling mechanisms for a heterogeneous silicon nanowire platform

May 12, 2008 - substrates with a 200â400 nm thick top silicon layer and a. 1â2 Âµm thick SiO2 ..... had a threshold current density 10.4 kA cm. â2 under pulsed .... [9] Marris-Morini D, Le Roux X, Vivien L, Cassan E, Pascal D,. Halbwax M ...

Design and Optimization of a Heterogeneous Platform for Multiple ...

Aug 29, 2014 - software for UAV control and mission planning.. 1. INTRODUCTION ... development of such system for small UAVs brings several ... Cape Town, South Africa. August .... composed of a custom FPGA board, and a CPU board.

A Web-based Platform for Dynamic Integration of Heterogeneous Data

visualize data. The platform is built upon Semantic Web technologies and its design follows three guiding principles: openness, con- nectedness, and reusability ...

Machine-to-machine platform architecture for horizontal service ...

Mar 20, 2013 - European Telecommunications Standards Institute (ETSI) ... 2Telecommunication Systems Division, Samsung Electronics Co., Ltd., Maetan.

Terra: A Virtual Machine-Based Platform for Trusted Computing - Suif

Terra, that allows applications with a wide range of security requirements to ...... Persistent state that will change, e.g. variable configuration state or application ...

A Hardware-Software WSN Platform for Machine and ... - David Rojas

In this paper, we present a novel hardware-software platform designed to monitor .... sufficient bandwidth for both machine and structural vibration. We selected ...

MNH-TT: a collaborative platform for translator training - Machine ...

Nov 30, 2012 - Globalsight is one of the best-known open-source tools for translation .... MNH-TT is based on the online translation hosting and translation aid ... complex (e.g., team translation of a website into several target .... Page 10 ...

Touring Machine: A Software Platform for Distributed Multimedia ...

A primary goal of the Touring Machine project is to provide an open software platform that facilitates ... of policy from mechanism for communication session management. .... It also provides a set of routines and support programs that simplify.

SmartSociety â A Platform for Collaborative People-Machine ...

These changes are opening up the possibilities for novel forms of .... Android cloud msg. Facebook connector ... platform's programming libraries, allowing the developer to execute ..... potentially agreed rides as other travelers sign up or better.

Touring Machine: A Software Platform for Distributed Multimedia ...

basis of the communications tools used daily by 100 users in two Bellcore ... centers on individual multimedia applications on stand-alone hardware, great ...

HeteroSpark: A Heterogeneous CPU/GPU Spark Platform for Machine ...

Download PDF

17 downloads 216036 Views 219KB Size Report

Comment

design by augmenting Spark platform so that current Spark applications can ... learning algorithms and data analytics tools imposes new requirements on ...

HeteroSpark: A Heterogeneous CPU/GPU Spark Platform for Machine Learning Algorithms Peilong Li, Yan Luo

Ning Zhang, Yu Cao

Dept. of Electrical and Computer Engineering Dept. of Computer Science University of Massachusetts Lowell Abstract—Analytics algorithms on big data sets require tremendous computational capabilities. Spark is a recent development that addresses big data challenges with data and computation distribution and in-memory caching. However, as a CPU only framework, Spark cannot leverage GPUs and a growing set of GPU libraries to achieve better performance and energy efficiency. We present HeteroSpark, a GPU-accelerated heterogeneous architecture integrated with Spark, which combines the massive compute power of GPUs and scalability of CPUs and system memory resources for applications that are both data and compute intensive. We make the following contributions in this work: (1) we integrate the GPU accelerator into current Spark framework to further leverage data parallelism and achieve algorithm acceleration; (2) we provide a plug-n-play design by augmenting Spark platform so that current Spark applications can choose to enable/disable GPU acceleration; (3) application acceleration is transparent to developers, therefore existing Spark applications can be easily ported to this heterogeneous platform without code modifications. The evaluation of HeteroSpark demonstrates up to 18x speedup on a number of machine learning applications.

I. I NTRODUCTION Recent research advance in machine learning and deep learning algorithms and data analytics tools imposes new requirements on existing computing systems and architectures. To help address the challenges, recent development of Apache Spark [1] significantly boosts the performance of Hadoop framework with in-memory data computation. However, there is a gap in CPU-based cluster computing frame such as Spark, for computationally intensive applications. On one hand, while Spark framework can partition and distribute data, the complex algorithms applied on the data unit in a single node still consume a large number of CPU cycles, therefore have difficulty in meeting real-time requirements. On the other hand, GPUs have been leveraged as accelerators in speeding up complex workloads such as K-Means and SVM, thanks to the density of the cores and their power efficiency. However, the memory space is generally limited to a single GPU node. Therefore, the marriage of GPUs to CPUbased cluster computing framework is a particularly interesting approach to address both big data and intensive computation challenges. In this work, we present HeteroSpark, a middleware framework to address the aforementioned challenges. We augment the vanilla Spark framework by placing GPUs in Spark’s worker nodes and employing Java Virtual Machines to ex-

978-1-4673-7891-8/15/$31.00 ©2015 IEEE

!"#$%&'#()*$

!"#$%&+,$%*$&-& ./'

!"#$%&+,$%*$&4& ./'

5,66*7)*8& 9,7#::;

5,66*7)*8& *& ?*)@,

%$012

012

3*)*$,!"#$%&+,$%*$&-

3*)*$,!"#$%&+,$%*$&4

*(1)$ *27)+2+(@1@*'(

01))$ 2345(&$ '($ 671.8$ 9'.8+.

6+@$!"#$%&&$E5?3

H'

I+? 0:+&8$2345(&$ '($;*[email protected]

I+? 2345(&$+A*?@/

6+(C$?+.*1)*D+C$C1@1$ @:.'5>:$;