Título: | GRADVEC: EXPLORING GRADIENT FEATURES FOR IMPROVED OUT-OF-DISTRIBUTION DETECTION IN DEEP LEARNING | ||||||||||||
Autor: |
THIAGO MEDEIROS CARVALHO |
||||||||||||
Colaborador(es): |
MARLEY MARIA BERNARDES REBUZZI VELLASCO - Orientador JOSE FRANCO MACHADO DO AMARAL - Coorientador |
||||||||||||
Catalogação: | 23/JUN/2025 | Língua(s): | ENGLISH - UNITED STATES |
||||||||||
Tipo: | TEXT | Subtipo: | THESIS | ||||||||||
Notas: |
[pt] Todos os dados constantes dos documentos são de inteira responsabilidade de seus autores. Os dados utilizados nas descrições dos documentos estão em conformidade com os sistemas da administração da PUC-Rio. [en] All data contained in the documents are the sole responsibility of the authors. The data used in the descriptions of the documents are in conformity with the systems of the administration of PUC-Rio. |
||||||||||||
Referência(s): |
[pt] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/ETDs/consultas/conteudo.php?strSecao=resultado&nrSeq=71193&idi=1 [en] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/ETDs/consultas/conteudo.php?strSecao=resultado&nrSeq=71193&idi=2 |
||||||||||||
DOI: | https://doi.org/10.17771/PUCRio.acad.71193 | ||||||||||||
Resumo: | |||||||||||||
Deploying Deep Learning models in real-world requires considerations
that are generally overlooked during training. In real-world scenarios, where
the input data cannot be controlled, it is important for a model to identify when
a sample does not belong to any known class. This issue is accomplished using
out-of-distribution (OOD) detection, a technique designed to distinguish unk
nown samples from those in the in-distribution classes. These methods mainly
rely on model output scores or intermediate features to calculate OOD scores.
However, the gradient space remains underexplored for this task due to the
complexity involved in generating a suitable gradient representation. However,
the gradient space is generally known for providing a richer representation of
the model s knowledge or uncertainty regarding the expected output for the
input sample, which is fully aligned with what is expected for an OOD detection task. Based on this idea, in this work we propose a new family of methods
using gradient features, named GradVec, which uses the gradient vector as
input representation for different OOD detection methods. The main idea is
that the model gradient presents, in a more informative way, the knowledge
that a sample belongs to a known class, being able to distinguish it from other
unknown ones. GradVec methods do not change the model training procedure,
with no additional data needed to adjust the OOD detector, and it is applica
ble to any pre-trained model. We evaluated our model in different tasks, such
as image classification, text classification, and semantic segmentation. Our ap
proach presents superior results in different scenarios for OOD detection in
image classification and sentiment analysis, reducing FPR95 by 88.17 percent and
56.91 percent, respectively.
|
|||||||||||||
|