dc.contributor.author
Prechelt, Lutz
dc.date.accessioned
2018-06-08T07:49:17Z
dc.date.available
2014-01-15T10:50:34.500Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/18782
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-22466
dc.description.abstract
Background: Data from software data repositories such as source code version
archives and defect databases contains valuable information that can be used
for insights (leading to subsequent improvements), in particular defect
insertion circumstance analysis and defect prediction. The first step in such
analyses is identifying defect-correcting changes in the version archive
(bugfix commits) and linking them to corresponding entries in the defect
database, thus establishing bugfix links, in order to enrich the content of
the defect-correcting change with additional meta-data. Typically, identifying
the bugfix commits in a version archive is done via heuristic string matching
on the commit message. Research questions: Which filters could be used to
obtain a set of bugfix links? How does one set the cutoff parameters of each?
What effect (results loss and precision) does each filter then have? Which
overall precision, results loss, and recall is achieved? Method: We analyze a
comprehensive modular set of seven independent filters, including new ones
that make use of reverse links. We describe and evaluate visual heuristics
(based on simple diagnostic plots) for setting six filters' cutoff parameter.
We apply these to a commercial repository from the Web CMS domain and validate
the results with unprecendented precision by making use of a product expert to
manually verify over 2500 links. Results: The parameter selection heuristics
pick a very good parameter value in five of the six cases and a reasonably
good one in the sixth. As a result, the combined filtering, called bflinks,
proposes a set of bugfix links that has 93\% precision with only 7\% results
loss. Conclusion: The modular filtering approach can provide high-quality
results and can be adapted to repositories with different properties.
de
dc.relation.ispartofseries
urn:nbn:de:kobv:188-fudocsseries000000000021-2
dc.rights.uri
http://www.fu-berlin.de/sites/refubium/rechtliches/Nutzungsbedingungen
dc.subject.ddc
000 Informatik, Informationswissenschaft, allgemeine Werke::000 Informatik, Wissen, Systeme::006 Spezielle Computerverfahren
dc.title
bflinks: Reliable Bugfix links via bidirectional references and tuned
heuristics
refubium.affiliation
Mathematik und Informatik
de
refubium.mycore.fudocsId
FUDOCS_document_000000019380
refubium.mycore.reportnumber
TR-B-14-01
refubium.resourceType.isindependentpub
no
refubium.series.issueNumber
Institut für Informatik Technical Report
refubium.series.name
Freie Universität Berlin, Fachbereich Mathematik und Informatik
refubium.series.reportNumber
14-1
refubium.mycore.derivateId
FUDOCS_derivate_000000002916
dcterms.accessRights.openaire
open access