Logo PUC-Rio Logo Maxwell
TRABALHOS DE FIM DE CURSO @PUC-Rio
Consulta aos Conteúdos
Título: DOCUMENT CLASSIFICATION AN APPROACH TO THE PROBLEM USING THE NAÏVE BAYES ALGORITHM
Autor(es): EDUARDO CRUZ MONTEIRO DE BARROS
Colaborador(es): RUY LUIZ MILIDIU - Orientador
Catalogação: 15/MAI/2009 Língua(s): PORTUGUESE - BRAZIL
Tipo: TEXT Subtipo: SENIOR PROJECT
Notas: [pt] Todos os dados constantes dos documentos são de inteira responsabilidade de seus autores. Os dados utilizados nas descrições dos documentos estão em conformidade com os sistemas da administração da PUC-Rio.
[en] All data contained in the documents are the sole responsibility of the authors. The data used in the descriptions of the documents are in conformity with the systems of the administration of PUC-Rio.
Referência(s): [pt] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=13502@1
[en] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=13502@2
DOI: https://doi.org/10.17771/PUCRio.acad.13502
Resumo:
This paper demonstrates the use of the Naive Bayes and SVM algorithms to solve the common problem in the information science: document classification/categorization. To achieve that, we demonstrate the mathematics fundaments of both algorithms. Those been the probability theory, the Bayes Theorem, the Naïve Bayes classifier and the concepts about SVMs. After that, we present a way of applying these algorithms to solve the main problem of the subject of this document. Every theory discussed will be sustained with simple examples in order to make the concept easier to understand. The obtained results of the experiments were as expected, which demonstrates the efficacy of the presented algorithms in solving the problem of text classification. We could also conclude that the determination of the training test is of fundamental importance to the accuracy of the results.
Descrição: Arquivo:   
COMPLETE PDF