On the Impact of Programming Languages on Code Quality (SPLASH 2019 - OOPSLA)

Write a Blog >>

Sun 20 - Fri 25 October 2019 Athens, Greece

Who

Emery D. Berger, Celeste Hollenbeck, Petr Maj, Olga Vitek, Jan Vitek

Track

SPLASH 2019 OOPSLA

Time Zone

The program is currently displayed in (GMT+03:00) Beirut.

Use conference time zone: (GMT+03:00) BeirutSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 24 Oct 2019 14:00 - 14:22 at Attica - Corpus Studies Chair(s): Jonathan Aldrich

Abstract

This paper is a reproduction of work by Ray et al. which claimed to have uncovered a statistically significant association between eleven programming languages and software defects in projects hosted on GitHub. First we conduct an experimental repetition, repetition is only partially successful, but it does validate one of the key claims of the original work about the association of ten programming languages with defects. Next, we conduct a complete, independent reanalysis of the data and statistical modeling steps of the original study. We uncover a number of flaws that undermine the conclusions of the original study as only four languages are found to have a statistically significant association with defects, and even for those the effect size is exceedingly small. We conclude with some additional sources of bias that should be investigated in follow up work and a few best practice recommendations for similar efforts.

Link to Publication

http://janvitek.org/pubs/toplas19.pdf

Link to Preprint

https://arxiv.org/pdf/1901.10220.pdf

DOI

https://doi.org/10.1145/3340571

Emery D. Berger

University of Massachusetts Amherst

United States

Celeste Hollenbeck

Northeastern University

United States

Petr Maj

Czech Technical University

Czechia

Olga Vitek

Northeastern University

United States

Jan Vitek

Northeastern University

United States

Time Zone

The program is currently displayed in (GMT+03:00) Beirut.

Use conference time zone: (GMT+03:00) BeirutSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Thu 24 Oct
Displayed time zone: Beirut change

14:00 - 15:30	Corpus StudiesOOPSLA at Attica Chair(s): Jonathan Aldrich Carnegie Mellon University

14:00 22m Talk		On the Impact of Programming Languages on Code QualityTOPLAS OOPSLA Emery D. Berger University of Massachusetts Amherst, Celeste Hollenbeck Northeastern University, Petr Maj Czech Technical University, Olga Vitek Northeastern University, Jan Vitek Northeastern University Link to publication DOI Pre-print
14:22 22m Talk		Casting about in the Dark: An Empirical Study of Cast Operations in Java Programs OOPSLA Luis Mastrangelo Università della Svizzera italiana, Matthias Hauswirth Università della Svizzera italiana, Nate Nystrom Università della Svizzera italiana DOI
14:45 22m Talk		On the Design, Implementation, and Use of Laziness in R OOPSLA Aviral Goel Northeastern University, Jan Vitek Northeastern University DOI Pre-print
15:07 22m Talk		Aroma: Code Recommendation via Structural Code Search OOPSLA Sifei Luan Facebook, Inc., Di Yang University of California, Irvine, Celeste Barnaby Facebook, Inc., Koushik Sen University of California, Berkeley, Satish Chandra Facebook DOI