跳到主要导航 跳到搜索 跳到主要内容

LASCA: A Large-Scale Stable Customer Segmentation Approach to Credit Risk Assessment

  • Yongfeng Gu
  • , Yupeng Wu
  • , Huakang Lu
  • , Xingyu Lu*
  • , Hong Qian*
  • , Jun Zhou*
  • , Aimin Zhou
  • *此作品的通讯作者
  • Ant Group
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Customer segmentation plays a crucial role in credit risk assessment by dividing users into specific risk levels based on their credit scores. Previous methods fail to comprehensively consider the stability in the segmentation process, resulting in frequent changes and inconsistencies in users' risk levels over time. This increases potential risks to a company. To this end, this paper at first introduces and formalizes the concept of stability regret in the segmentation process. However, evaluating stability is challenging due to its black-box nature and the computational burden posed by vast user data sets. To address these challenges, this paper proposes a large-scale stable customer segmentation approach named LASCA. LASCA consists of two phases: high-quality dataset construction (HDC) and reliable data-driven optimization (RDO). Specifically, HDC utilizes an evolutionary algorithm to collect high-quality binning solutions. RDO subsequently builds a reliable surrogate model to search for the most stable binning solution based on the collected dataset. Extensive experiments conducted on real-world large-scale datasets (up to 0.8 billion) show that LASCA surpasses the state-of-the-art binning methods in finding the most stable binning solution. Notably, HDC greatly enhances data quality by 50%. RDO efficiently discovers more stable binning solutions with a 36% improvement in stability, accelerating the optimization process by 25 times via data-driven evaluation. Currently, LASCA has been successfully deployed in the large-scale credit risk assessment system of Alipay.

源语言英语
主期刊名KDD 2024 - Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
出版商Association for Computing Machinery
5006-5017
页数12
ISBN(电子版)9798400704901
DOI
出版状态已出版 - 24 8月 2024
活动30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2024 - Barcelona, 西班牙
期限: 25 8月 202429 8月 2024

出版系列

姓名Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
ISSN(印刷版)2154-817X

会议

会议30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2024
国家/地区西班牙
Barcelona
时期25/08/2429/08/24

指纹

探究 'LASCA: A Large-Scale Stable Customer Segmentation Approach to Credit Risk Assessment' 的科研主题。它们共同构成独一无二的指纹。

引用此