A Three-Stage Algorithm for the Closest String Problem on Artificial and Real Gene Sequences

Date:

arXiv:2407.13023v1 Announce Type: new
Abstract: The Closest String Problem is an NP-hard problem that aims to find a string that has the minimum distance from all sequences that belong to the given set of strings. Its applications can be found in coding theory, computational biology, and designing degenerated primers, among others. There are efficient exact algorithms that have reached high-quality solutions for binary sequences. However, there is still room for improvement concerning the quality of solutions over DNA and protein sequences. In this paper, we introduce a three-stage algorithm that comprises the following process: first, we apply a novel alphabet pruning method to reduce the search space for effectively finding promising search regions. Second, a variant of beam search to find a heuristic solution is employed. This method utilizes a newly developed guiding function based on an expected distance heuristic score of partial solutions. Last, we introduce a local search to improve the quality of the solution obtained from the beam search. Furthermore, due to the lack of real-world benchmarks, two real-world datasets are introduced to verify the robustness of the method. The extensive experimental results show that the proposed method outperforms the previous approaches from the literature.

Share post:

Subscribe

spot_imgspot_img

Popular

More like this
Related

Microrobot 시스템

Artedrone은 카테터가 뇌졸중 환자의 혈전을 회수하는 데 도움이되는 자석과...

Mbodi AI는 Y 콤비네이터에서 출시되어 산업용 로봇을위한 구체화 된 AI 개발

Mbodi는 ABB Robotics와 같은 파트너와 협력하고 있습니다. 출처 :...

Orbit 5.0은 Boston Dynamics의 Spot Quadruped Robot에 기능을 추가합니다.

Spot Quadruped의 궤도 5.0은 AI를 사용하여 사이트 건강에 대한...

VR에서 더 나은 시간 동안 자신을 해킹하십시오

헤드셋 하드웨어와 사려 깊은 소프트웨어 디자인의 발전에도 불구하고 가상...