Poster Title:  Hierarchical Coded Matrix Multiplication
Poster Abstract: 

Coded matrix multiplication is a technique to enable straggler-resistant multiplication of large matrices in distributed computing systems. In this work, we first present a conceptual framework to represent the division of work amongst processors in coded matrix multiplication as a cuboid partitioning problem. This framework allows us to unify existing methods and motivates new techniques. Building on this framework, we propose hierarchical coded matrix multiplication which is able to exploit the work completed by all processors (fast and slow), rather than ignoring the slow ones, even if the amount of work completed by stragglers is much less than that completed by the fastest workers. On Amazon EC2, we achieve a 37% improvement in average finishing time compared to non-hierarchical schemes.



Poster ID:  A-5
Poster File:  PDF document HMM.pdf
Poster Image: 
Poster URL: