Logo PUC-Rio Logo Maxwell
TRABALHOS DE FIM DE CURSO @PUC-Rio
Consulta aos Conteúdos
Título: APPLYING DATA SCIENCE TO THE FUEL THEFT PROBLEM ON PIPELINES
Autor(es): RACHEL MARTINS VENTRIGLIA
Colaborador(es): SILVIO HAMACHER - Orientador
LEILA FIGUEIREDO DANTAS - Coorientador
Catalogação: 14/FEV/2022 Língua(s): ENGLISH - UNITED STATES
Tipo: TEXT Subtipo: SENIOR PROJECT
Notas: [pt] Todos os dados constantes dos documentos são de inteira responsabilidade de seus autores. Os dados utilizados nas descrições dos documentos estão em conformidade com os sistemas da administração da PUC-Rio.
[en] All data contained in the documents are the sole responsibility of the authors. The data used in the descriptions of the documents are in conformity with the systems of the administration of PUC-Rio.
Referência(s): [pt] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=57397@1
[en] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=57397@2
DOI: https://doi.org/10.17771/PUCRio.acad.57397
Resumo:
Fuel theft is a concern faced by various countries and oil and gas companies due to its impact on the environment and the safety of communities near oil pipelines. Thus, monitoring and inspecting the location of the oil pipelines through alerts is essential to prevent suspected fuel theft events and mitigate risks. Alerts are triggered by monitoring systems, and patrols are sent to verify the location and confirm the occurrence of illegal tapping. However, various signals can be activated in a short period, and their correct prioritization is essential to identify illegal tapping as quickly as possible. This work aims to use data science and machine learning techniques to perform a predictive model capable of forecasting the probability of an event resulting in an illegal tapping and understanding the factors that influence the occurrence. A Brazilian oil and gas transportation company provided data from a monitoring system supervised from January 2019 to August 2021. We used Four machine learning algorithms: Logistic Regression, Random Forest, XGBoost, and Catboost. The Random Forest obtained the best results in classifying alerts associated (or not) with an illegal tapping., showing accuracy and specificity of 78.6% and 68.3%, respectively. In this problem, specificity means that it is possible to reduce the sending of patrols on the field in 68.3% of cases. As for the external validation, the model also showed a good performance, with an accuracy and specificity of 61%. The high duration of the alert, the high history of illegal tapping, and the occurrence during the night are the ones that most influence illegal tapping occurrence.
Descrição: Arquivo:   
COMPLETE PDF