Write a Blog >>
ICSE 2020
Wed 24 June - Thu 16 July 2020

Modern machine learning programs are often written in Python, with the main computations specified through calls to some highly optimized libraries (e.g., TensorFlow, PyTorch). This work points out a common limitation in existing efforts for optimizing such programs: they focus their views only on the static computation graphs specified by library APIs, but leave the influence from the hosting Python code largely unconsidered. The limitation often causes them to miss the big picture and hence many important optimization opportunities. This work proposes a new approach named HARP to address the problem by enabling holistic analysis that spans across computation graphs and their hosting Python code. Refactoring based on HARP gives 1.3–3X and 2.07X average speedups on a set of TensorFlow and PyTorch programs.