跳到主要导航 跳到搜索 跳到主要内容

DSA: Scalable distributed sequence alignment system using SIMD Instructions

  • Bo Xu
  • , Changlong Li
  • , Hang Zhuang
  • , Jiali Wang
  • , Qingfeng Wang
  • , Jinhong Zhou
  • , Xuehai Zhou
  • University of Science and Technology of China

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Sequence alignment algorithms are a basic and critical component of many bioinformatics fields. With rapid development of sequencing technology, the fast growing reference database volumes and longer length of query sequence become new challenges for sequence alignment. However, the algorithms have prohibitively high time and space complexity. In this paper, we present DSA, a scalable distributed sequence alignment system that employs Apache Spark to process sequences data in a horizontally scalable distributed environment, and leverages data parallel strategy based on Single Instruction Multiple Data (SIMD) instruction to parallelize the algorithms in each core of worker node. The experimental results demonstrate that 1) DSA has outstanding performance and achieves up to 201x speedup over SparkSW. 2) DSA has excellent scalability and achieves near linear speedup when increasing the number of nodes in cluster.

源语言英语
主期刊名Proceedings - 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017
出版商Institute of Electrical and Electronics Engineers Inc.
758-761
页数4
ISBN(电子版)9781509066100
DOI
出版状态已出版 - 10 7月 2017
已对外发布
活动17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017 - Madrid, 西班牙
期限: 14 5月 201717 5月 2017

出版系列

姓名Proceedings - 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017

会议

会议17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017
国家/地区西班牙
Madrid
时期14/05/1717/05/17

指纹

探究 'DSA: Scalable distributed sequence alignment system using SIMD Instructions' 的科研主题。它们共同构成独一无二的指纹。

引用此