Swinburne
Browse

Improving automated documentation to code traceability by combining retrieval techniques

Download (1.03 MB)
conference contribution
posted on 2024-07-09, 14:22 authored by Xiaofan Xiaofan Chen, John Grundy
Documentation written in natural language and source code are two of the major artifacts of a software system. Tracking a variety of traceability links between software documentation and source code assists software developers in comprehension, efficient development, and effective management of a system. Automated traceability systems to date have been faced with a major open research challenge: how to extract these links with both high precision and high recall. In this paper we introduce an approach that combines three supporting techniques, Regular Expression, Key Phrases, and Clustering, with a Vector Space Model (VSM) to improve the performance of automated traceability between documents and source code. This combination approach takes advantage of strengths of the three techniques to ameliorate limitations of VSM. Four case studies have been used to evaluate our combined technique approach. Experimental results indicate that our approach improves the performance of VSM, increases the precision of retrieved links, and recovers more true links than VSM alone.

History

Available versions

PDF (Accepted manuscript)

ISBN

9781457716393

Journal title

2011 26th IEEE/ACM International Conference on Automated Software Engineering, ASE 2011, Proceedings

Conference name

Automated Software Engineering Conference

Location

Lawrence, KS

Start date

2011-11-06

End date

2011-11-10

Pagination

9 pp

Publisher

IEEE

Copyright statement

Copyright © 2011 IEEE. The accepted manuscript is reproduced in accordance with the copyright policy of the publisher. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC