Título: | WORKLOAD BALANCING STRATEGIES FOR PARALLEL BLAST EVALUATION ON REPLICATED DATABASES AND PRIMARY FRAGMENTS | |||||||
Autor: |
DANIEL XAVIER DE SOUSA |
|||||||
Colaborador(es): |
SERGIO LIFSCHITZ - Orientador |
|||||||
Catalogação: | 07/ABR/2008 | Língua(s): | PORTUGUESE - BRAZIL |
|||||
Tipo: | TEXT | Subtipo: | THESIS | |||||
Notas: |
[pt] Todos os dados constantes dos documentos são de inteira responsabilidade de seus autores. Os dados utilizados nas descrições dos documentos estão em conformidade com os sistemas da administração da PUC-Rio. [en] All data contained in the documents are the sole responsibility of the authors. The data used in the descriptions of the documents are in conformity with the systems of the administration of PUC-Rio. |
|||||||
Referência(s): |
[pt] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/ETDs/consultas/conteudo.php?strSecao=resultado&nrSeq=11509&idi=1 [en] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/ETDs/consultas/conteudo.php?strSecao=resultado&nrSeq=11509&idi=2 |
|||||||
DOI: | https://doi.org/10.17771/PUCRio.acad.11509 | |||||||
Resumo: | ||||||||
A fundamental task in the area of computational biology is
the search
for relevant information within the large amount of
available data.
Among others, it is important to run tools such as BLAST -
Basic Local
Alignment Search Tool - effciently, which enables the
comparison of
biological sequences and discovery of homologies and other
related information.
However, the execution cost of BLAST is highly dependent on
the
database size, which has considerably increased. The
evaluation of BLAST
in distributed and parallel environments like PC clusters
has been largely
investigated in order to obtain better performances. This
work reports a
replicated allocation of the (sequences) database where
each copy is also
physically fragmented, with some fragments assigned as
primary. This way
we show that it is possible to execute BLAST with some nice
characteristics
of both replicated and fragmented conventional strategies,
like flexibility
and I/O parallelism. We propose two dynamic workload
balancing strategies
associated with this data allocation. We have adopted a non-
intrusive
approach, i.e., the BLAST code remains unchanged. These
methods are implemented
and practical results show that we achieve not only a
balanced
workload but also very good performances.
|
||||||||