Logo PUC-Rio Logo Maxwell
TRABALHOS DE FIM DE CURSO @PUC-Rio
Consulta aos Conteúdos
Estatística
Título: 3D POSE ESTIMATION AND TRACKING IN REAL TIME FROM MULTIPLE CAMERAS
Autor(es): MATHEUS MELLO DE SOUZA MENDES
YURI VELASQUEZ RIVAS DA SILVA MOREIRA
Colaborador(es): WOUTER CAARLS - Orientador
Catalogação: 13/JUL/2023 Língua(s): PORTUGUESE - BRAZIL
Tipo: TEXT Subtipo: SENIOR PROJECT
Notas: [pt] Todos os dados constantes dos documentos são de inteira responsabilidade de seus autores. Os dados utilizados nas descrições dos documentos estão em conformidade com os sistemas da administração da PUC-Rio.
[en] All data contained in the documents are the sole responsibility of the authors. The data used in the descriptions of the documents are in conformity with the systems of the administration of PUC-Rio.
Referência(s): [pt] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=63208@1
[en] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=63208@2
DOI: https://doi.org/10.17771/PUCRio.acad.63208
Resumo:
This work focuses on the development of a real-time 3D pose estimation system for tracking human movements. Pose estimation is vital in computer vision and has applications in entertainment, sports, healthcare and robotics. The challenge of estimating 3D poses involves identifying the body s major joints and their spatial relationships, creating a skeletal representation of human figures, which is complicated by factors such as occlusions, variations in camera point of view, and close interactions between individuals. The project proposes a low cost solution even though this may result in a decrease in overall system accuracy, it provides an alternative to industry standard systems used in motion capture. We used an arbitrary number of cameras (n≥4) to minimize projection and tracking errors caused by target occlusions. The approach adopted in this project involves a reconstruction logic, as seen in fig 1-a. The Python programming language is used for image processing, with the help of the OpenCV library for 3D pose reconstruction. The camera chosen for the project is Intelbras VIP1430 B G2, and Google s MediaPipe open source implementation is used for 2D pose estimation. The limitations of this approach and possible improvements are also discussed, providing a solid basis for future research and applications in the field.
Descrição: Arquivo:   
COMPLETE PDF