Write a Blog >>
ICSE 2020
Wed 24 June - Thu 16 July 2020
Tue 7 Jul 2020 08:05 - 08:17 at Baekje - I4-Clones and Changes Chair(s): Chanchal K. Roy

Modern machine learning programs are often written in Python, with the main computations specified through calls to some highly optimized libraries (e.g., TensorFlow, PyTorch). How to maximize the computing efficiency of such programs is essential for many application domains, which has drawn lots of recent attention. This work points out a common limitation in existing efforts: they focus their views only on the static computation graphs specified by library APIs, but leave the influence from the hosting Python code largely unconsidered. The limitation often causes them to miss the big picture and hence many important optimization opportunities. This work proposes a new approach named HARP to address the problem. HARP enables holistic analysis that spans across computation graphs and their hosting Python code. HARP achieves it through a set of novel techniques: analytics-conscious speculative analysis to circumvent Python complexities, a unified representation augmented computation graphs to capture all dimensions of knowledge related with the holistic analysis, and conditioned feedback mechanism to allow risk-controlled aggressive analysis. Refactoring based on HARP gives 1.3–3X and 2.07X average speedups on a set of TensorFlow and PyTorch programs.

Tue 7 Jul

Displayed time zone: (UTC) Coordinated Universal Time change

08:05 - 09:05
I4-Clones and ChangesTechnical Papers / Journal First / New Ideas and Emerging Results / Demonstrations at Baekje
Chair(s): Chanchal K. Roy University of Saskatchewan
08:05
12m
Talk
HARP: Holistic Analysis for Refactoring Python-Based Analytics ProgramsTechnical
Technical Papers
Weijie Zhou North Carolina State University, Yue Zhao NCSU, Guoqiang Zhang North Carolina State University, Xipeng Shen North Carolina State University
08:17
12m
Talk
CC2Vec: Distributed Representations of Code ChangesTechnical
Technical Papers
Thong Hoang Singapore Management University, Singapore, Hong Jin Kang School of Information Systems, Singapore Management University, Julia Lawall Inria, David Lo Singapore Management University
Pre-print
08:29
6m
Talk
Code Duplication on Stack OverflowNIER
New Ideas and Emerging Results
Sebastian Baltes QAware GmbH and The University of Adelaide, Christoph Treude The University of Adelaide
Pre-print Media Attached
08:35
8m
Talk
Siamese: Scalable and Incremental Code Clone Search via Multiple Code RepresentationsJ1
Journal First
Chaiyong Ragkhitwetsagul Mahidol University, Thailand, Jens Krinke University College London
Link to publication DOI Pre-print Media Attached
08:43
8m
Talk
Empirical comparison of text-based mobile apps similarity measurement techniquesJ1
Journal First
Afnan Al-Subaihin King Saud University, Federica Sarro University College London, UK, Sue Black Durham University, Licia Capra University College London
Link to publication DOI Media Attached File Attached
08:51
3m
Talk
SimilarAPI: Mining Analogical APIs for Library MigrationDemo
Demonstrations
Chunyang Chen Monash University