Recovering Documentation-to-Source-Code Traceability Links using Latent Semantic Indexing 论文

2007引用 547
Software Engineering ResearchWeb Data Mining and AnalysisAdvanced Malware Detection Techniques

摘要

An information retrieval technique, latent semantic indexing, is used to automatically identi traceability links from system documentation to program source code. The results of two experiments to identi links in existing software systems (i.e., the LEDA library, and Albergate) are presented. These results are compared with other similar type experimental results of traceability link identification using different types of information retrieval techniques. The method presented proves to give good results by comparison and additionally it is a low cost, highly flexible method to apply with regards to preprocessing and/or parsing of the source code and documentation.