A learning-based module extraction method for object-oriented systems

Ural Erdemir*, Feza Buzluca

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

17 Citations (Scopus)

Abstract

Developers apply object-oriented (OO) design principles to produce modular, reusable software. Therefore, service-specific groups of related software classes called modules arise in OO systems. Extracting the modules is critical for better software comprehension, efficient architecture recovery, determination of service candidates to migrate legacy software to a service-oriented architecture, and transportation of such services to cloud-based distributed systems. In this study, we propose a novel approach to automatic module extraction to identify services in OO software systems. In our approach, first we create a weighted and directed graph of the software system in which vertices and edges represent the classes and their relations, respectively. Then, we apply a clustering algorithm over the graph to extract the modules. We calculate the weight of an edge by considering its probability of being within a module or between modules. To estimate these positional probabilities, we propose a machine-learning-based classification system that we train with data gathered from a real-world OO reference system. We have implemented an automatic module extraction tool and evaluated the proposed approach on several open-source and industrial projects. The experimental results show that the proposed approach generates highly accurate decompositions that are close to authoritative module structures and outperforms existing methods.

Original languageEnglish
Pages (from-to)156-177
Number of pages22
JournalJournal of Systems and Software
Volume97
DOIs
Publication statusPublished - 1 Nov 2014

Bibliographical note

Publisher Copyright:
© 2014 Elsevier Inc. All rights reserved.

Keywords

  • SOA
  • Software architecture recovery
  • Software modularization

Fingerprint

Dive into the research topics of 'A learning-based module extraction method for object-oriented systems'. Together they form a unique fingerprint.

Cite this