Proposed Application of Data Mining Techniques for Clustering Software Projects

Main Article Content

Henrique Ribeiro Rezende
Ahmed Ali Abdalla Esmin

Abstract

Software projects always generate a lot of data, ranging from informal documentation to a database with thousands of lines of code. This information extracted from software projects takes even greater when it comes to OSS (Open Source Software). Such data may include source code base, historical change in the software, bug reports, mailing lists, among others. Using data mining techniques, we can extract valuable knowledge of this set of information, thus providing improvements throughout the process of software development. The results can be used to improve the quality of software, or even to manage the project in order to obtain maximum efficiency. This article proposes the application of data mining techniques to cluster software projects, cites the advantages that can be obtained with these techniques, and illustrates the application of data mining in a Open Source Software database.

Article Details

How to Cite
Rezende, H. R., & Esmin, A. A. A. (2010). Proposed Application of Data Mining Techniques for Clustering Software Projects. INFOCOMP Journal of Computer Science, 9(6), 43–48. Retrieved from https://infocomp.dcc.ufla.br/index.php/infocomp/article/view/383
Section
Articles