面向开源协作数字生态的信息服务与数据挖掘

Translated title of the contribution: Data Mining and Information Service for Open Collaboration Digital Ecosystem

Xiaoya Xia, Shengyu Zhao, Fanyu Han, Fenglin Bi, Wei Wang*, Xuan Zhou, Aoying Zhou

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Large-scale development and proliferation of open source software has constructed an ecosystem for open source development and collaboration. Within this system individuals and organizations collaboratively develop high-quality software that is accessible to all. Social collaboration platforms, represented by GitHub, have further facilitated large-scale, distributed, and fine-grained code collaboration and technical socialization. Countless developers submit code, review code, report bugs, or propose new feature requests on these platforms every day. This results in a vast amount of behavioral data from the fully open collaborative development process, which holds immense value. This paper designs and implements a one-stop data mining system for the open source collaboration digital ecosystem, named OpenDigger. Its goal is to build data infrastructure in the open source field and promote the continuous development of the open source ecosystem. OpenDigger system consists primarily of data collection module, storage module, tag data module, and information service module. It is built upon an OLAP columnar database and a graph database. The system continuously collects data from multiple sources within the open-source ecosystem and provides various types of open-source information services to different user groups through a unified interface. Additionally, OpenDigger mines key information from the open-source digital ecosystem through the perspective of collaborative relationship networks. Compared to traditional statistical indicators, the collaborative network perspective better illustrates the association characteristics between open-source projects and developers.

Translated title of the contributionData Mining and Information Service for Open Collaboration Digital Ecosystem
Original languageChinese (Traditional)
Pages (from-to)187-195
Number of pages9
JournalComputer Science
Volume51
Issue number10
DOIs
StatePublished - 15 Oct 2024

Fingerprint

Dive into the research topics of 'Data Mining and Information Service for Open Collaboration Digital Ecosystem'. Together they form a unique fingerprint.

Cite this