Cleaning up confounding: accounting for endogeneity using instrumental variables and two-stage models

dc.contributor.authorGraf-Vlachy, Lorenz
dc.contributor.authorWagner, Stefan
dc.date.accessioned2025-09-09T13:35:56Z
dc.date.available2025-09-09T13:35:56Z
dc.date.issued2024-11
dc.description.abstractStudies in empirical software engineering are often most useful if they make causal claims because this allows practitioners to identify how they can purposefully influence (rather than only predict) outcomes of interest. Unfortunately, many non-experimental studies suffer from potential endogeneity, for example, through omitted confounding variables, which precludes claims of causality. In this conceptual tutorial, we aim to transfer the proven solution of instrumental variables and two-stage models as a means to account for endogeneity from econometrics to the field of empirical software engineering. To this end, we discuss causality and causal inference, provide a definition of endogeneity, explain its causes, and lay out the conceptual idea behind instrumental variable approaches and two-stage models. We also provide an extensive illustration with simulated data and a brief illustration with real data to demonstrate the approach, offering Stata and R code to allow researchers to replicate our analyses and apply the techniques to their own research projects. We close with concrete recommendations and a guide for researchers on how to deal with endogeneity.en
dc.identifier.urihttp://hdl.handle.net/2003/43950
dc.language.isoen
dc.relation.ispartofseriesACM transactions on software engineering and methodology
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectCCS Conceptsen
dc.subjectSoftware and its engineeringen
dc.subjectGeneral and referenceen
dc.subjectEmpirical studiesen
dc.subject.ddc330
dc.titleCleaning up confounding: accounting for endogeneity using instrumental variables and two-stage modelsen
dc.typeText
dc.type.publicationtypeResearchArticle
dcterms.accessRightsopen access
eldorado.dnb.deposittrue
eldorado.doi.registerfalse
eldorado.secondarypublicationtrue
eldorado.secondarypublication.primarycitationGraf-Vlachy L, Wagner S. Cleaning up confounding: accounting for endogeneity using instrumental variables and two-stage models. ACM transactions on software engineering and methodology. 2024;33(8). doi:10.1145/3674730
eldorado.secondarypublication.primaryidentifierhttps://doi.org/10.1145/3674730

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3674730.pdf
Size:
9.06 MB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.82 KB
Format:
Item-specific license agreed upon to submission
Description: