Relay: A New IR for Machine Learning Frameworks

Jared Roesch; Josh Pollock; Logan Weber; Marisa Kirisame; Steven Lyubomirsky; Tianqi Chen; Zachary Tatlock

arxiv: 1810.00952 · v1 · pith:DHLSKT2Anew · submitted 2018-09-26 · 💻 cs.PL · cs.LG

Relay: A New IR for Machine Learning Frameworks

Jared Roesch , Steven Lyubomirsky , Logan Weber , Josh Pollock , Marisa Kirisame , Tianqi Chen , Zachary Tatlock This is my paper

classification 💻 cs.PL cs.LG

keywords relaylearningconstraintsefficientframeworkmachinepowersaccommodate

0 comments

read the original abstract

Machine learning powers diverse services in industry including search, translation, recommendation systems, and security. The scale and importance of these models require that they be efficient, expressive, and portable across an array of heterogeneous hardware devices. These constraints are often at odds; in order to better accommodate them we propose a new high-level intermediate representation (IR) called Relay. Relay is being designed as a purely-functional, statically-typed language with the goal of balancing efficient compilation, expressiveness, and portability. We discuss the goals of Relay and highlight its important design constraints. Our prototype is part of the open source NNVM compiler framework, which powers Amazon's deep learning framework MxNet.

This paper has not been read by Pith yet.

Relay: A New IR for Machine Learning Frameworks

discussion (0)