List of Abstracts

Cache-conscious scheduling of streaming applications

SpeakerKunal Agrawal

We consider the problem of scheduling streaming applications in order to minimize the number of cache-misses. Streaming applications are represented as a directed graph (or multigraph), where nodes are computation modules and edges are channels. When a module fires, it consumes some data-items from its input channels and produces some items on its output channels. In addition, each module may have some state (either code or data) which represents the memory locations that must be loaded into cache in order to execute the module. Our main contribution is to show that for a large and important class of streaming computations, cache-efficient scheduling is essentially equivalent to solving a constrained graph partitioning problem. Given a good partition, we describe a runtime strategy for scheduling streaming graphs and prove that this runtime strategy is asymptotically optimal with constant factor cache-augmentation.

Efficient Computation of Optimal Energy and Fractional Weighted Flow Trade-off Schedules

SpeakerAntonios Antoniadis

Joint work with Neal Barcelo, Mario Consuegra, Peter Kling, Michael Nugent, Kirk Pruhs and Michele Scquizzato. We give a polynomial time algorithm to compute an optimal energy and fractional weighted flow trade-off schedule for a speed-scalable processor with discrete speeds. Our algorithm uses a geometric approach that is based on structural properties obtained from a primal-dual formulation of the problem.

Co-Scheduling Algorithms for High-Throughput Workload Execution

SpeakerGuillaume Aupy

Joint work with M. Shantharam, A. Benoit, Y. Robert and P. Raghavan. This paper investigates co-scheduling algorithms for processing a set of parallel applications. Instead of executing each application one by one, using a maximum degree of parallelism for each of them, we aim at scheduling several applications concurrently. We partition the original application set into a series of packs, which are executed one by one. A pack comprises several applications, each of them with an assigned number of processors, with the constraint that the total number of processors assigned within a pack does not exceed the maximum number of available processors. The objective is to determine a partition into packs, and an assignment of processors to applications, that minimize the sum of the execution times of the packs. We thoroughly study the complexity of this optimization problem, and propose several heuristics that exhibit very good performance on a variety of workloads, whose application execution times model profiles of parallel scientific codes. We show that co-scheduling leads to to faster workload completion time and to faster response times on average (hence increasing system throughput and saving energy), for significant benefits over traditional scheduling from both the user and system perspectives.

New Challenges in Scheduling Theory

March 31 - April 4, 2014 Centre CNRS "Paul-Langevin", Aussois, France