跳到主要导航 跳到搜索 跳到主要内容

ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications

  • Yuhan Liu
  • , Chengcheng Wan*
  • , Kuntai Du
  • , Henry Hoffmann
  • , Junchen Jiang
  • , Shan Lu
  • , Michael Maire
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

ML APIs have greatly relieved application developers of the burden to design and train their own neural network models—classifying objects in an image can now be as simple as one line of Python code to call an API. However, these APIs offer the same pre-trained models regardless of how their output is used by different applications. This can be suboptimal as not all ML inference errors can cause application failures, and the distinction between inference errors that can or cannot cause failures varies greatly across applications. To tackle this problem, we first study 77 real-world applications, which collectively use six ML APIs from two providers, to reveal common patterns of how ML API output affects applications’ decision processes. Inspired by the findings, we propose ChameleonAPI, an optimization framework for ML APIs, which takes effect without changing the application source code. ChameleonAPI provides application developers with a parser that automatically analyzes the application to produce an abstract of its decision process, which is then used to devise an application-specific loss function that only penalizes API output errors critical to the application. ChameleonAPI uses the loss function to efficiently train a neural network model customized for each application and deploys it to serve API invocations from the respective application via existing interface. Compared to a baseline that selects the best-of-all commercial ML API, we show that ChameleonAPI reduces incorrect application decisions by 43%.

源语言英语
主期刊名Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2024
出版商USENIX Association
365-386
页数22
ISBN(电子版)9781939133403
出版状态已出版 - 2024
活动18th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2024 - Santa Clara, 美国
期限: 10 7月 202412 7月 2024

出版系列

姓名Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2024

会议

会议18th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2024
国家/地区美国
Santa Clara
时期10/07/2412/07/24

指纹

探究 'ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications' 的科研主题。它们共同构成独一无二的指纹。

引用此