Understanding the Archived Projects on GitHub

  • Xiaoya Xia
  • , Shengyu Zhao
  • , Xinran Zhang
  • , Zehua Lou
  • , Wei Wang*
  • , Fenglin Bi
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Open source software (OSS) on GitHub is experiencing continued growth and a rapid rise in the creation of projects. As OSS evolved, numerous projects faced decline, with some inevitably degrading into unmaintained status and getting archived by the owners. Understanding the deprecated projects helps deepen the perception of OSS maintenance and evolution. This paper describes the study of 361 popular GitHub projects that have been archived. By reading repository READMEs and sending surveys to project maintainers, we found the software repositories were archived due to being transferred, evolved, or unmaintained. We provide a set of 16 reasons and 10 practices to describe why and how these projects were archived. We further reveal the OSS development activity lifecycle patterns by fitting their commit history curves. We also confirmed the importance of the bus factor risk that would influence the sustainability of open source projects. Archiving a software repository is an explicit indication of repository deprecation. By providing the reason (the why), the strategy (the how), and the lifecycle patterns (the what) of the archived projects, we bring implications and practices that promote the health and sustainability of open source projects and the ecosystem overall.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2023
EditorsTao Zhang, Xin Xia, Nicole Novielli
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages13-24
Number of pages12
ISBN (Electronic)9781665452786
DOIs
StatePublished - 2023
Event30th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2023 - Macao, China
Duration: 21 Mar 202324 Mar 2023

Publication series

NameProceedings - 2023 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2023

Conference

Conference30th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2023
Country/TerritoryChina
CityMacao
Period21/03/2324/03/23

Keywords

  • Open Source Governance
  • Software Deprecation
  • Software Evolution

Fingerprint

Dive into the research topics of 'Understanding the Archived Projects on GitHub'. Together they form a unique fingerprint.

Cite this