A branch-and-cut algorithm for multiple sequence alignment

Abstract

We consider a branch-and-cut approach for solving the multiple sequence alignment problem, which is a central problem in computational biology. We propose a general model for this problem in which arbitrary gap costs are allowed. An interesting aspect of our approach is that the three (exponentially large) classes of natural valid inequalities that we consider turn out to be both facet-defining for the convex hull of integer solutions and separable in polynomial time. Both the proofs that these classes of valid inequalities are facet-defining and the description of the separation algorithms are far from trivial. Experimental results on several benchmark instances show that our method outperforms the best tools developed so far, in that it produces alignments that are better from a biological point of view. A noteworthy outcome of the results is the effectiveness of using branch-and-cut with only a carefully-selected subset of the variables as a heuristic.

Citation

[05+ACL] Althaus, E., Caprara, A., Lenhof, H.-P.,  Reinert, K. A branch-and-cut algorithm for multiple sequence alignment.Math. Program. 105, 387–425 (2006). https://doi.org/10.1007/s10107-005-0659-3
Read Publication