跳到主要导航 跳到搜索 跳到主要内容

Exploring Activity and Contributors on GitHub: Who, What, When, and Where

  • Xiaoya Xia*
  • , Zhenjie Weng
  • , Wei Wang
  • , Shengyu Zhao
  • *此作品的通讯作者
  • East China Normal University
  • Tongji University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Apart from being a code hosting platform, GitHub is the place where large-scale open collaborations and contributions happen. Every minute, thousands of developers are submitting code, having discussions of issues or pull requests, with all user behaviors recorded in the GitHub Event Stream (GES). Exploration of the activities in the GES could help understand who is active, the way they work, the time when they are active and even their location. To this end, a large-scale analysis was initially performed based on the 0.86 billion event records generated in 2020. We extracted 902K active contributors out of 14 million GitHub accounts by observing their activity distribution, then explored their behavior distribution, active time in the day and week, and estimated time zone distributions on the basis of their circadian activity rhythm. To go deeper, a case study of 79 projects in CNCF and contrast analyses of different project maturity levels were conducted. Our results showed that from a macro perspective, bots are increasingly more active and can serve numerous projects. Contributors work on weekdays, and are globally more inclined toward the daytime working hours in the Americas and Europe. The time zone distribution also reveals that UTC+2 and UTC-4 have the most active contributors. A critical discovery was the validation and quantification of a high bus factor risk exists in the OSS ecosystem. Whether from a large group point of view or within specific projects, a rather small group of OSS contributors (less than 20%) undertook the majority of the work. The GES can provide a wealth of information about open source software (OSS). Our findings provide insights into global GitHub collaboration behaviors and may be of help for researchers and practitioners to further understand modern OSS ecosystem.

源语言英语
主期刊名Proceedings - 2022 29th Asia-Pacific Software Engineering Conference, APSEC 2022
出版商IEEE Computer Society
11-20
页数10
ISBN(电子版)9781665455374
DOI
出版状态已出版 - 2022
活动29th Asia-Pacific Software Engineering Conference, APSEC 2022 - Virtual, Online, 日本
期限: 6 12月 20229 12月 2022

出版系列

姓名Proceedings - Asia-Pacific Software Engineering Conference, APSEC
2022-December
ISSN(印刷版)1530-1362

会议

会议29th Asia-Pacific Software Engineering Conference, APSEC 2022
国家/地区日本
Virtual, Online
时期6/12/229/12/22

指纹

探究 'Exploring Activity and Contributors on GitHub: Who, What, When, and Where' 的科研主题。它们共同构成独一无二的指纹。

引用此