Machine Translation Testing via Pathological Invariance
Due to the rapid development of deep neural networks, in recent years, machine translation software has been widely adopted in people’s daily lives, such as communicating with foreigners or understanding political news from the neighbouring countries. However, machine translation software could return incorrect translations because of the complexity of the underlying network. To address this problem, we introduce a novel methodology called PaInv for validating machine translation software. Our key insight is that sentences of different meanings should not have the same translation (i.e., pathological invariance). Specifically, PaInv generates syntactically similar but semantically different sentences by replacing one word in the sentence and filter out unsuitable sentences based on both syntactic and semantic information. We have applied PaInv to Google Translate using 200 English sentences as input with three language settings: English$\rightarrow$Hindi, English$\rightarrow$Chinese, and English$\rightarrow$German. PaInv can accurately find 331 pathological invariants in total, revealing more than 100 translation errors.
Wed 8 JulDisplayed time zone: (UTC) Coordinated Universal Time change
09:10 - 10:00 | |||
09:10 50mPoster | The Role of Egocentric Bias in Undergraduate Agile Software Development Teams ACM Student Research Competition Frederike Ramin Hasso Plattner Institute | ||
09:10 50mPoster | Evaluation of brain activity while Pair Programming ACM Student Research Competition Ananga Thapaliya Innopolis University | ||
09:10 50mPoster | Playing With Your Project Data in Scrum Retrospectives ACM Student Research Competition Christoph Matthies Hasso Plattner Institute, University of Potsdam | ||
09:10 50mPoster | An empirical study of the first contributions of developers to open source projects on GitHub ACM Student Research Competition Vikram N. Subramanian University of Waterloo | ||
09:10 50mPoster | Machine Translation Testing via Pathological Invariance ACM Student Research Competition Shashij Gupta IIT BOMBAY | ||
09:10 50mPoster | Automated Analysis of Inter-Parameter Dependencies in Web APIs ACM Student Research Competition Alberto Martin-Lopez Universidad de Sevilla |