New methods for biological sequence alignment

Biological sequence alignment is a fundamental technique in bioinformatics, and consists of iden- tifying series of similar (conserved) characters that appear in the same order in both sequences, and eventually deducing a set of modifications (substitutions, insertions and deletions) involved in the transformation of one sequence into the other. This technique allows one to infer, based on sequence similarity, if two or more biological sequences are potentially homologous, i.e. if they share a common ancestor, thus enabling the understanding of sequence evolution. This thesis addresses sequence comparison problems in two different contexts: homology detection and high throughput DNA sequencing. The goal of this work is to develop sensitive alignment methods that provide solutions to the following two problems: i) the detection of hidden protein homologies by protein sequence comparison, when the source of the divergence are frameshift mutations, and ii) mapping short SOLiD reads (sequences of overlapping di- nucleotides encoded as colors) to a reference genome. In both cases, the same general idea is applied: to implicitly compare DNA sequences for detecting changes occurring at this level, while manipulating, in practice, other representations (protein sequences, sequences of di-nucleotide codes) that provide additional information and thus help to improve the similarity search. The aim is to design and implement exact and heuristic alignment methods, along with scoring schemes, adapted to these scenarios.

Data and Resources

Additional Info

Field Value
Source https://theses.hal.science/tel-00833311
Author Gîrdea, Marta, L.
Maintainer CCSD
Last Updated May 10, 2026, 18:17 (UTC)
Created May 10, 2026, 18:17 (UTC)
Identifier tel-00833311
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire d'Informatique Fondamentale de Lille (LIFL) ; Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)
creator Gîrdea, Marta, L.
date 2010-12-10T00:00:00
harvest_object_id e5aaa3da-0e84-448e-a2c2-9b6c9a23a6fd
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-02-26T00:00:00
set_spec type:THESE