Automated topic naming

571

Software repositories provide a deluge of software artifacts to analyze. Researchers have attempted to summarize, categorize, and relate these artifacts by using semi-unsupervised machine-learning algorithms, such as Latent Dirichlet Allocation (LDA). LDA is used for concept and topic analysis to suggest candidate word-lists or topics that describe and relate software artifacts. However, these word-lists and topics are difficult to interpret in the absence of meaningful summary labels. Current attempts to interpret topics assume manual labelling and do not use domain-specific knowledge to improve, contextualize, or describe results for the developers. We propose a solution: automated labelled topic extraction. Topics are extracted using LDA from commit-log comments recovered from source control systems. These topics are given labels from a generalizable cross-project taxonomy, consisting of non-functional requirements. Our approach was evaluated with experiments and case studies on three large-scale Relational Database Management System (RDBMS) projects: MySQL, PostgreSQL and MaxDB. The case studies show that labelled topic extraction can produce appropriate, context-sensitive labels that are relevant to these projects, and provide fresh insight into their evolving software development activities.

DOWNLOAD PROJECT

Automated topic naming

Related

PLACE YOUR ADVERT HERE

DEPARTMENTS

LATEST PROJECTS

STUDIES ON SOME ASPECTS OF ANTHRACNOSE-BLIGHT-DIEBACK COMPLEX OF CULTIVARS OF GRAPEVINES (VITIS SPP.) IN...

GENETIC VARIABILITY STUDIES OF TWENTY POTATO GENOTYPES

RELATIONSHIP OF HAEMOGLOBIN AND POTASSIUM POLYMORPHISM WITH CONFORMATION, MILK PRODUCTION AND BLOOD BIOCHEMICAL PROFILES...

ADOPTION OF AGRICULTURAL INNOVATIONS AMONG MEMBERS AND NON-MEMBERS OF WOMEN CO-OPERATIVE SOCIETIES IN OJU...

SMALL FARMER CREDIT WITH PARTICULAR REFERENCE TO NIGERIA

DISCLAIMER

EDITOR PICKS

STUDIES ON SOME ASPECTS OF ANTHRACNOSE-BLIGHT-DIEBACK COMPLEX OF CULTIVARS OF GRAPEVINES...

GENETIC VARIABILITY STUDIES OF TWENTY POTATO GENOTYPES

RELATIONSHIP OF HAEMOGLOBIN AND POTASSIUM POLYMORPHISM WITH CONFORMATION, MILK PRODUCTION AND...

POPULAR POSTS

Accounting project topics

CIVIL SERVICE IN NIGERIA

TOP 5 BEST TRUSTED RESEARCH PROJECT TOPICS AND MATERIALS WEBSITE IN...

POPULAR CATEGORY

ACCOUNTING EDUCATION: A COMPARATIVE STUDY OF PERCEPTION OF BUSINESS AND NON BUSINESS STUDENTS OF...

STUDENTS’ PASSION FOR GRADES AT SECONDARY SCHOOL LEVEL IN PAKISTAN

PUBLIC RELATIONS AS A MANAGEMENT FUNCTION PROBLEMS AND IMPACT (A CASE STUDY OF THE...