ETDs

Estatística

Título:

GRADVEC: EXPLORING GRADIENT FEATURES FOR IMPROVED OUT-OF-DISTRIBUTION DETECTION IN DEEP LEARNING

Autor:

THIAGO MEDEIROS CARVALHO

Colaborador(es):

MARLEY MARIA BERNARDES REBUZZI VELLASCO - Orientador
JOSE FRANCO MACHADO DO AMARAL - Coorientador

Catalogação:

23/JUN/2025

Língua(s):

ENGLISH - UNITED STATES

Tipo:

TEXT

Subtipo:

THESIS

Notas:

[pt] Todos os dados constantes dos documentos são de inteira responsabilidade de seus autores. Os dados utilizados nas descrições dos documentos estão em conformidade com os sistemas da administração da PUC-Rio.
[en] All data contained in the documents are the sole responsibility of the authors. The data used in the descriptions of the documents are in conformity with the systems of the administration of PUC-Rio.

Referência(s):

[pt] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/ETDs/consultas/conteudo.php?strSecao=resultado&nrSeq=71193&idi=1
[en] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/ETDs/consultas/conteudo.php?strSecao=resultado&nrSeq=71193&idi=2

DOI:

https://doi.org/10.17771/PUCRio.acad.71193

Resumo:

Deploying Deep Learning models in real-world requires considerations that are generally overlooked during training. In real-world scenarios, where the input data cannot be controlled, it is important for a model to identify when a sample does not belong to any known class. This issue is accomplished using out-of-distribution (OOD) detection, a technique designed to distinguish unk nown samples from those in the in-distribution classes. These methods mainly rely on model output scores or intermediate features to calculate OOD scores. However, the gradient space remains underexplored for this task due to the complexity involved in generating a suitable gradient representation. However, the gradient space is generally known for providing a richer representation of the model s knowledge or uncertainty regarding the expected output for the input sample, which is fully aligned with what is expected for an OOD detection task. Based on this idea, in this work we propose a new family of methods using gradient features, named GradVec, which uses the gradient vector as input representation for different OOD detection methods. The main idea is that the model gradient presents, in a more informative way, the knowledge that a sample belongs to a known class, being able to distinguish it from other unknown ones. GradVec methods do not change the model training procedure, with no additional data needed to adjust the OOD detector, and it is applica ble to any pre-trained model. We evaluated our model in different tasks, such as image classification, text classification, and semantic segmentation. Our ap proach presents superior results in different scenarios for OOD detection in image classification and sentiment analysis, reducing FPR95 by 88.17 percent and 56.91 percent, respectively.

Descrição:			Arquivo:
COMPLETE			PDF