A Three-Stage Algorithm for the Closest String Problem on Artificial and Real Gene Sequences

Date:

arXiv:2407.13023v1 Announce Type: new
Abstract: The Closest String Problem is an NP-hard problem that aims to find a string that has the minimum distance from all sequences that belong to the given set of strings. Its applications can be found in coding theory, computational biology, and designing degenerated primers, among others. There are efficient exact algorithms that have reached high-quality solutions for binary sequences. However, there is still room for improvement concerning the quality of solutions over DNA and protein sequences. In this paper, we introduce a three-stage algorithm that comprises the following process: first, we apply a novel alphabet pruning method to reduce the search space for effectively finding promising search regions. Second, a variant of beam search to find a heuristic solution is employed. This method utilizes a newly developed guiding function based on an expected distance heuristic score of partial solutions. Last, we introduce a local search to improve the quality of the solution obtained from the beam search. Furthermore, due to the lack of real-world benchmarks, two real-world datasets are introduced to verify the robustness of the method. The extensive experimental results show that the proposed method outperforms the previous approaches from the literature.

Share post:

Subscribe

Popular

More like this
Related

RBR50 요약 : 로봇 공학 혁신에 대한 스포트라이트

로봇 보고서 팟 캐스트 · RBR50 요약 : 로봇...

Picknik의 MoveitPro와 함께 haptic 컨트롤러를 제공하는 거친 로봇 공학

Haply Robotics의 Inverse3 시스템을 통해 운영자는 실시간 힘 피드백을받는...

웹 세미나의 AI 진보를 설명하는 로봇 피킹 전문가

Ambi, ABB 및 Plus One 은이 무료 웹 세미나에서...

비디오 금요일 : RIVR은 패키지를 제공합니다

Video Friday는 친구가 수집 한 주별 멋진 로봇 비디오입니다....