Signed rearrangement distances considering repeated : genes, intergenic regions, and indels
Gabriel Siqueira, Alexsandro Oliveira Alexandrino, Zanoni Dias
ARTIGO
Inglês
Agradecimentos: This work was supported by the National Council of Technological and Scientific Development, CNPq (grant 202292/2020-7 ), the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—Brasil (CAPES)—Finance Code 001, and the São Paulo Research Foundation, FAPESP (grants...
Ver mais
Agradecimentos: This work was supported by the National Council of Technological and Scientific Development, CNPq (grant 202292/2020-7 ), the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—Brasil (CAPES)—Finance Code 001, and the São Paulo Research Foundation, FAPESP (grants 2013/08293-7, 2015/11937-9, 2017/12646-3, and 2021/13824-8). Funding: The authors have not disclosed any funding
Ver menos
Abstract: Genome rearrangement distance problems allow to estimate the evolutionary distance between genomes. These problems aim to compute the minimum number of mutations called rearrangement events necessary to transform one genome into another. Two commonly studied rearrangements are the...
Ver mais
Abstract: Genome rearrangement distance problems allow to estimate the evolutionary distance between genomes. These problems aim to compute the minimum number of mutations called rearrangement events necessary to transform one genome into another. Two commonly studied rearrangements are the reversal, which inverts a sequence of genes, and the transposition, which exchanges two consecutive sequences of genes. Seminal works on this topic focused on the sequence of genes and assumed that each gene occurs exactly once on each genome. More realistic models have been assuming that a gene may have multiple copies or may appear in only one of the genomes. Other models also take into account the nucleotides between consecutive pairs of genes, which are called intergenic regions. This work combines all these generalizations defining the signed intergenic reversal distance (SIRD), the signed intergenic reversal and transposition distance (SIRTD), the signed intergenic reversal and indels distance (SIRID), and the signed intergenic reversal, transposition, and indels distance (SIRTID) problems. We show a relation between these problems and the signed minimum common intergenic string partition (SMCISP) problem. From such relation, we derive-approximation algorithms for the SIRD and the SIRTD problems, where k is maximum number of copies of a gene in the genomes. These algorithms also work as heuristics for the SIRID and SIRTID problems. Additionally, we present some parametrized algorithms for SMCISP that ensure constant approximation factors for the distance problems. Our experimental tests on simulated genomes show an improvement on the rearrangement distances with the use of the partition algorithms
Ver menos
CONSELHO NACIONAL DE DESENVOLVIMENTO CIENTÍFICO E TECNOLÓGICO - CNPQ
202292/2020-7
COORDENAÇÃO DE APERFEIÇOAMENTO DE PESSOAL DE NÍVEL SUPERIOR - CAPES
001
FUNDAÇÃO DE AMPARO À PESQUISA DO ESTADO DE SÃO PAULO - FAPESP
2013/08293-7; 2015/11937-9; 2017/12646-3; 2021/13824-8
Fechado
Signed rearrangement distances considering repeated : genes, intergenic regions, and indels
Gabriel Siqueira, Alexsandro Oliveira Alexandrino, Zanoni Dias
Signed rearrangement distances considering repeated : genes, intergenic regions, and indels
Gabriel Siqueira, Alexsandro Oliveira Alexandrino, Zanoni Dias
Fontes
Journal of combinatorial optimization (Fonte avulsa) |