Trace Aware Random Testing for Distributed Systems (SPLASH 2019 - OOPSLA)

Write a Blog >>

Sun 20 - Fri 25 October 2019 Athens, Greece

Who

Burcu Kulahcioglu Ozkan, Rupak Majumdar, Simin Oraee

Track

SPLASH 2019 OOPSLA

Time Zone

The program is currently displayed in (GMT+03:00) Beirut.

Use conference time zone: (GMT+03:00) BeirutSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 23 Oct 2019 14:22 - 14:45 at Olympia - Testing Chair(s): Stephen Kell

Abstract

Distributed and concurrent applications often have subtle bugs that only get exposed under specific schedules. While these schedules may be found by systematic model checking techniques, in practice, model checkers do not scale to large systems. On the other hand, naive random exploration techniques often require a very large number of runs to find the specific interactions needed to expose a bug. In recent years, several random testing algorithms have been proposed that, on the one hand, exploit state-space reduction strategies from model checking and, on the other, provide guarantees on the probability of hitting bugs of certain kinds.

These existing techniques exploit two orthogonal strategies to reduce the state space: partial-order reduction and bug depth. Testing algorithms based on partial order techniques, such as RAPOS or POS, ensure non-redundant exploration of independent interleavings among system events by imposing an equivalence relation on schedules and ideally exploring only one schedule from each equivalence class. Techniques based on bug depth, such as PCT, exploit the empirical observation that many bugs are exposed by the clever scheduling of a small number of key events. They bias the sample space of schedules to only cover all executions of small depth, rather than the much larger space of all schedules. At this point, there is no random testing algorithm that combines the power of both approaches.

In this paper, we provide such an algorithm. Our algorithm, trace-aware PCT (taPCTCP), extends and unifies several different algorithms in the random testing literature. It samples the space of low-depth executions by constructing a schedule online, while taking dependencies among events into account. Moreover, the algorithm comes with a theoretical guarantee on the probability of sampling a trace of low depth—the probability grows exponentially with the depth but only polynomially with the number of racy events explored. We further show that the guarantee is optimal among a large class of techniques.

We empirically compare our algorithm with several state-of-the-art random testing approaches for concurrent software on two large-scale distributed systems, Zookeeper and Cassandra, and show that our approach is effective in uncovering subtle bugs and usually outperforms related random testing algorithms.

DOI

https://doi.org/10.1145/3360606

Burcu Kulahcioglu Ozkan

Max Planck Institute for Software Systems (MPI-SWS)

Germany

Rupak Majumdar

Max Planck Institute for Software Systems (MPI-SWS)

Simin Oraee

Max Planck Institute for Software Systems (MPI-SWS)

Time Zone

The program is currently displayed in (GMT+03:00) Beirut.

Use conference time zone: (GMT+03:00) BeirutSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 23 Oct
Displayed time zone: Beirut change

14:00 - 15:30	Testing OOPSLA at Olympia Chair(s): Stephen Kell University of Kent

14:00 22m Talk		Reflection-Aware Static Regression Test Selection OOPSLA August Shi University of Illinois at Urbana-Champaign, Milica Hadzi-Tanovic Technische Universitat Munchen, Lingming Zhang The University of Texas at Dallas, Darko Marinov University of Illinois at Urbana-Champaign, Owolabi Legunsen University of Illinois at Urbana-Champaign DOI
14:22 22m Talk		Trace Aware Random Testing for Distributed Systems OOPSLA Burcu Kulahcioglu Ozkan Max Planck Institute for Software Systems (MPI-SWS), Rupak Majumdar Max Planck Institute for Software Systems (MPI-SWS), Simin Oraee Max Planck Institute for Software Systems (MPI-SWS) DOI
14:45 22m Talk		Automatic and Scalable Detection of Logical Errors in Functional Programming Assignments OOPSLA Dowon Song Korea University, Myungho Lee Korea University, Hakjoo Oh Korea University DOI
15:07 22m Talk		On the Complexity of Checking Transactional Consistency OOPSLA Ranadeep Biswas IRIF, University Paris Diderot & CNRS, Constantin Enea IRIF, University Paris Diderot & CNRS DOI