Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/13850
Title: Mapreduce programming and costbased optimization? Crossing this chasm with starfish
Authors: Herodotou, Herodotos 
Dong, Fei 
Babu, Shivnath
Major Field of Science: Engineering and Technology
Field Category: Electrical Engineering - Electronic Engineering - Information Engineering
Issue Date: Aug-2011
Source: Proceedings of the VLDB Endowment, 2011, vol. 4, no. 12
Volume: 4
Issue: 12
Conference: VLDB Endowment 
Abstract: MapReduce has emerged as a viable competitor to database systems in big data analytics. MapReduce programs are being written for a wide variety of application domains including business data processing, text analysis, natural language processing, Web graph and social network analysis, and computational science. However, MapReduce systems lack a feature that has been key to the historical success of database systems, namely, cost-based optimization. A major challenge here is that, to the MapReduce system, a program consists of black-box map and reduce functions written in some programming language like C++, Java, Python, or Ruby. Starfish is a self-tuning system for big data analytics that includes, to our knowledge, the first Cost-based Optimizer for simple to arbitrarily complex MapReduce programs. Starfish also includes a Profiler to collect detailed statistical information from unmodified MapReduce programs, and a What-if Engine for fine-grained cost estimation. This demonstration will present the profiling, whatif analysis, and cost-based optimization of MapReduce programs in Starfish. We will show how (nonexpert) users can employ the Starfish Visualizer to (a) get a deep understanding of a MapReduce program's behavior during execution, (b) ask hypothetical questions on how the program's behavior will change when parameter settings, cluster resources, or input data properties change, and (c) ultimately optimize the program. © 2011 VLDB Endowment.
ISSN: 2150-8097
Type: Conference Papers
Affiliation : Duke University 
Publication Type: Peer Reviewed
Appears in Collections:Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation

CORE Recommender
Show full item record

SCOPUSTM   
Citations 10

28
checked on Nov 6, 2023

Page view(s) 10

271
Last Week
1
Last month
2
checked on Dec 25, 2024

Google ScholarTM

Check


Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.