CMU-CS-23-110 Computer Science Department School of Computer Science, Carnegie Mellon University
Practical Coding-Theoretic Tools for Machine Learning Jack Kosaian Ph.D. Thesis April 2023
Machine learning (ML) is now used in many domains, such as web services and safety-critical systems. This has led to the development of ML systems for deploying and training ML models. Beyond achieving high accuracy, ML systems must also use computing infrastructure efficiently and tolerate unreliable infrastructure. Coding-theoretic tools enable many systems to operate reliably without the significant resource overhead that accompanies replication-based approaches. These tools are used in production storage and communication systems, and there is growing interest in their use for distributed computing. This thesis explores the interplay between ML systems and practical applications of coding theory. Specifically, we show how ML systems can be made more reliable and efficient via novel uses of coding-theoretic tools, and how coding-theoretic tools can be expanded in reach and be made more efficient through techniques from ML and ML systems. We illustrate this interaction via multiple thrusts:
(1) We show how properties unique to ML systems can be exploited to efficiently
integrate coding-theoretic fault tolerance techniques into ML systems. First, we
reduce the execution-time overhead of fault-tolerant inference on GPUs by up to
5.3 x by exploiting trends in neural network design and GPU hardware. Second,
we show how coding-theoretic tools can be coupled with the unique properties of
recommendation models to enable low-overhead fault tolerance in training. Through these thrusts, this thesis demonstrates the promise of using coding-theoretic in ML systems and ideas from ML systems in coding-theoretic tools to bring about the next generation of efficient and reliable systems.
193 pages
Thesis Committee:
Srinivasan Seshan, Head, Computer Science Department
| |
Return to:
SCS Technical Report Collection This page maintained by reports@cs.cmu.edu |