Título: | APPLYING DATA SCIENCE TO THE FUEL THEFT PROBLEM ON PIPELINES | ||||||||||||
Autor(es): |
RACHEL MARTINS VENTRIGLIA |
||||||||||||
Colaborador(es): |
SILVIO HAMACHER - Orientador LEILA FIGUEIREDO DANTAS - Coorientador |
||||||||||||
Catalogação: | 14/FEV/2022 | Língua(s): | ENGLISH - UNITED STATES |
||||||||||
Tipo: | TEXT | Subtipo: | SENIOR PROJECT | ||||||||||
Notas: |
[pt] Todos os dados constantes dos documentos são de inteira responsabilidade de seus autores. Os dados utilizados nas descrições dos documentos estão em conformidade com os sistemas da administração da PUC-Rio. [en] All data contained in the documents are the sole responsibility of the authors. The data used in the descriptions of the documents are in conformity with the systems of the administration of PUC-Rio. |
||||||||||||
Referência(s): |
[pt] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=57397@1 [en] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=57397@2 |
||||||||||||
DOI: | https://doi.org/10.17771/PUCRio.acad.57397 | ||||||||||||
Resumo: | |||||||||||||
Fuel theft is a concern faced by various countries and oil and gas companies due to its impact
on the environment and the safety of communities near oil pipelines. Thus, monitoring and
inspecting the location of the oil pipelines through alerts is essential to prevent suspected fuel
theft events and mitigate risks. Alerts are triggered by monitoring systems, and patrols are sent
to verify the location and confirm the occurrence of illegal tapping. However, various signals
can be activated in a short period, and their correct prioritization is essential to identify illegal
tapping as quickly as possible. This work aims to use data science and machine learning
techniques to perform a predictive model capable of forecasting the probability of an event
resulting in an illegal tapping and understanding the factors that influence the occurrence. A
Brazilian oil and gas transportation company provided data from a monitoring system
supervised from January 2019 to August 2021. We used Four machine learning algorithms:
Logistic Regression, Random Forest, XGBoost, and Catboost. The Random Forest obtained the
best results in classifying alerts associated (or not) with an illegal tapping., showing accuracy
and specificity of 78.6% and 68.3%, respectively. In this problem, specificity means that it is
possible to reduce the sending of patrols on the field in 68.3% of cases. As for the external
validation, the model also showed a good performance, with an accuracy and specificity of
61%. The high duration of the alert, the high history of illegal tapping, and the occurrence
during the night are the ones that most influence illegal tapping occurrence.
|
|||||||||||||
|