A Three-Stage Algorithm for the Closest String Problem on Artificial and Real Gene Sequences

Date:

arXiv:2407.13023v1 Announce Type: new
Abstract: The Closest String Problem is an NP-hard problem that aims to find a string that has the minimum distance from all sequences that belong to the given set of strings. Its applications can be found in coding theory, computational biology, and designing degenerated primers, among others. There are efficient exact algorithms that have reached high-quality solutions for binary sequences. However, there is still room for improvement concerning the quality of solutions over DNA and protein sequences. In this paper, we introduce a three-stage algorithm that comprises the following process: first, we apply a novel alphabet pruning method to reduce the search space for effectively finding promising search regions. Second, a variant of beam search to find a heuristic solution is employed. This method utilizes a newly developed guiding function based on an expected distance heuristic score of partial solutions. Last, we introduce a local search to improve the quality of the solution obtained from the beam search. Furthermore, due to the lack of real-world benchmarks, two real-world datasets are introduced to verify the robustness of the method. The extensive experimental results show that the proposed method outperforms the previous approaches from the literature.

Share post:

Subscribe

spot_imgspot_img

Popular

More like this
Related

5월 23일 정부지원사업 신규 공고 리스트 (95건) _ (파일 재가공/재배포 가능)

5월 23일 95건<5/23 지원사업 신규 공고 목록> *전 영업일인 5/22에...

Photoneo는 로봇 인식을 향상시키기 위해 MotionCAM-3D 컬러 (파란색)를 출시합니다

MotionCam 3D Color (Blue)는이 팔레팅 응용 프로그램에서와 같이 거리에서...

Rainbow Robotics는 전 방향 바퀴, 이중 암 로봇을위한 개발 키트를 공개합니다.

RB-Y1에는 휠 모바일 플랫폼에 장착 된 두 개의 암이...

10 로봇 트렌드는 2025 년에 발견되었습니다

지난 주 디트로이트에서 2025 년을 소집했습니다. 출처 : 로봇...