Abstract
With the advancement of biology and computer science, amount of bioinformatics data has grown at a rapid rate. Due to this increasing demand for performance and testing of new algorithms, bioinformaticians are required to maintain efficient technological infrastructures. Hence, adoption of such novel technologies is necessary to cater the increasing demand of the industry. Furthermore, it is imperative to increase the productivity of the existing systems and at the same time execute large jobs associated with the domain. Various scheduling techniques ranging from classic First Come First Serve to the latest cloud technologies such as MapReduce can be used to execute these jobs in parallel. The work presented in this paper demonstrates an optimized platform to support the execution of various bioinformatics computations that deal with massively large datasets. This platform comprises of a MapReduce model that adopt multilevel feedback queue algorithm in scheduling such large-scale, time-consuming jobs parallel in a multicore processor. A broad comparison of existing common scheduling algorithms is conducted, to identify the most suitable scheduling algorithm. The paper also presents the performance evaluation results of the proposed solution with a range of biological sequences and algorithms as inputs. The time efficiency of the proposed solution has a x18 improvement over general First Come First Serve algorithm, for processing 1000 sequences while it gives 10x improvement at 10000 sequences, dropping again to 3x at 50000. Multilevel sequence alignment tools that are not optimized for GPU parallelism are benefited mostly from our solution.
| Original language | English |
|---|---|
| Title of host publication | 2018 18th International Conference on Advances in ICT for Emerging Regions (ICTer) |
| Place of Publication | New Jersey, U.S.A. |
| Publisher | Institute of Electrical and Electronics Engineers |
| Pages | 400-406 |
| Number of pages | 7 |
| ISBN (Electronic) | 978-1-5386-7352-2, 978-1-5386-7350-8 |
| ISBN (Print) | 978-1-5386-7353-9 |
| DOIs | |
| Publication status | Published - 2018 |
| Externally published | Yes |
| Event | 18th International Conference on Advances in ICT for Emerging Regions, ICTer 2018 - Colombo, Sri Lanka Duration: 27 Sept 2018 → 28 Sept 2018 |
Publication series
| Name | International Conference on Advances in ICT for Emerging Regions |
|---|---|
| ISSN (Print) | 2377-6854 |
| ISSN (Electronic) | 2472-7598 |
Conference
| Conference | 18th International Conference on Advances in ICT for Emerging Regions, ICTer 2018 |
|---|---|
| Country/Territory | Sri Lanka |
| City | Colombo |
| Period | 27/09/18 → 28/09/18 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 9 Industry, Innovation, and Infrastructure
Keywords
- Bioinformatics
- Job scheduling
- MapReduce
- Multi-level feedback queue
- Optimization
- Task scheduling
Fingerprint
Dive into the research topics of 'Efficient scheduling for scalable bioinformatics analysis platform with microservices'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver