Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Databases and Information Systems

Research Collection School Of Computing and Information Systems

2015

Software information site

Articles 1 - 2 of 2

Full-Text Articles in Physical Sciences and Mathematics

Tagcombine: Recommending Tags To Contents In Software Information Sites, Xin Yu Wang, Xin Xia, David Lo Sep 2015

Tagcombine: Recommending Tags To Contents In Software Information Sites, Xin Yu Wang, Xin Xia, David Lo

Research Collection School Of Computing and Information Systems

Nowadays, software engineers use a variety of online media to search and become informed of new and interesting technologies, and to learn from and help one another. We refer to these kinds of online media which help software engineers improve their performance in software development, maintenance, and test processes as software information sites. In this paper, we propose TagCombine, an automatic tag recommendation method which analyzes objects in software information sites. TagCombine has three different components: 1) multi-label ranking component which considers tag recommendation as a multi-label learning problem; 2) similarity-based ranking component which recommends tags from similar objects; 3) …


Multi-Factor Duplicate Question Detection In Stack Overflow, Yun Zhang, David Lo, Xin Xia, Jian Ling Sun Sep 2015

Multi-Factor Duplicate Question Detection In Stack Overflow, Yun Zhang, David Lo, Xin Xia, Jian Ling Sun

Research Collection School Of Computing and Information Systems

Stack Overflow is a popular on-line question and answer site for software developers to share their experience and expertise. Among the numerous questions posted in Stack Overflow, two or more of them may express the same point and thus are duplicates of one another. Duplicate questions make Stack Overflow site maintenance harder, waste resources that could have been used to answer other questions, and cause developers to unnecessarily wait for answers that are already available. To reduce the problem of duplicate questions, Stack Overflow allows questions to be manually marked as duplicates of others. Since there are thousands of questions …