Arquitectura multi-agente descentralizada para detección de eventos en el ambiente
| dc.contributor.advisor | Martínez Vásquez, David Alejandro | |
| dc.contributor.author | Castaño Ortiz, Daniel Fernando | |
| dc.contributor.corporatename | Universidad Santo Tomás | |
| dc.contributor.cvlac | https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0001560096 | |
| dc.contributor.cvlac | https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0002173917 | |
| dc.contributor.googlescholar | https://scholar.google.com/citations?user=U5Qf1nUAAAAJ&hl=es&oi=ao | |
| dc.contributor.orcid | https://orcid.org/0000-0001-9750-2653 | |
| dc.contributor.orcid | https://orcid.org/0009-0003-2216-0937 | |
| dc.date.accessioned | 2026-04-20T15:12:57Z | |
| dc.date.available | 2026-04-20T15:12:57Z | |
| dc.date.issued | 2026-04-20 | |
| dc.description | La detección de eventos en el ambiente constituye un reto central en la robótica moderna, especialmente en sistemas que buscan autonomía, cooperación y toma de decisiones distribuidas. Los sistemas multi-agente descentralizados (MAS) surgen como una alternativa eficiente frente a los enfoques centralizados, ya que ofrecer mayor escalabilidad, robustez y tolerancia a fallos. Este trabajo propone el diseño e implementación de una arquitectura multi-agente descentralizada basada en ROS 2 para la detección de eventos en el ambiente, utilizando plataformas móviles TurtleBot3 Burger y aprendizaje por refuerzo profundo. La investigación se desarrolló con un enfoque experimental y aplicado, estructurado en tres fases: (1) análisis y selección de variables físicas, algoritmos y plataforma robótica; (2) desarrollo de la arquitectura descentralizada, integrando un agente líder entrenado mediante el algoritmo Deep Q-Learning (DQN), junto con agentes seguidores controlados con el algoritmo Follower de ROBOTIS; y (3) validación del sistema en el simulador Gazebo, mediante el uso del middleware ROS 2 para la comunicación entre nodos y la gestión independiente de espacios de nombres para cada robot. Durante 1000 episodios de entrenamiento, el agente líder presentó una mejora progresiva en la recompensa acumulada, demostrando un aprendizaje estable y una navegación autónoma libre de colisiones. En la simulación multi-agente, los robots seguidores reprodujeron con precisión la trayectoria del líder, manteniendo formaciones estables y comunicación efectiva. Finalmente, las pruebas físicas confirmaron la correcta transferencia del modelo entrenado, manteniendo la coherencia del comportamiento observado en simulación. Los resultados evidencian que la combinación del aprendizaje por refuerzo profundo con una arquitectura descentralizada basada en ROS 2 constituye una estrategia viable para el desarrollo de comportamientos cooperativos en robótica móvil. El sistema diseñado reduce la dependencia de control central, mejora la escalabilidad del sistema y sienta las bases para futuras validaciones en entornos físicos más complejos. | |
| dc.description.abstract | Environmental event detection is a key challenge in modern robotics, particularly in systems that require autonomy, cooperation, and distributed decision-making. Decentralized multi-agent systems (MAS) emerge as anefficient alternative to centralized approaches, offering greater scalability, robustness, and fault tolerance. This work proposes the design and implementation of a decentralized multi-agent architecture based on ROS 2 for environmental event detection, using TurtleBot3 Burger mobile platforms and deep reinforcement learning. The research followed an experimental and applied approach, structured into three phases: (1) analysis and selection of physical variables, algorithms, and robotic platform; (2) development of the decentralized architecture integrating a leader agent trained through the Deep Q-Learning (DQN) algorithm and follower agents controlled by the ROBOTIS Follower algorithm; and (3) system validation in the Gazebo simulator using the ROS2middleware for inter-node communication and independent namespace management for each robot. Over 1000 training episodes, the leader agent showed a progressive improvement in cumulative rewards, achieving stable learning and autonomous navigation without collisions. In the multi-agent simulation, the follower robots accurately replicated the leader’s trajectory, maintaining stable formations and effective communication. Finally, physical tests confirmed the successful transfer of the trained model, maintaining consistency with the simulated behaviors. The results confirm that combining deep reinforcement learning with a decentralized ROS 2-based architecture is a viable strategy for developing cooperative behaviors in mobile robotics. The designed system reduces the dependence on a central controller, enhances scalability, and establishes a solid foundation for future validations in more complex physical environments. | |
| dc.description.degreelevel | Pregrado | spa |
| dc.description.degreename | Ingeniero Electronico | spa |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | Castaño Ortiz, D. F. (2026). Arquitectura multi-agente descentralizada para detección de eventos en el ambiente. [Trabajo de Grado, Universidad Santo Tomás]. Repositorio Institucional | |
| dc.identifier.instname | instname:Universidad Santo Tomás | spa |
| dc.identifier.reponame | reponame:Repositorio Institucional Universidad Santo Tomás | spa |
| dc.identifier.repourl | repourl:https://repository.usta.edu.co | spa |
| dc.identifier.uri | http://hdl.handle.net/11634/72123 | |
| dc.language.iso | spa | |
| dc.publisher | Universidad Santo Tomás | spa |
| dc.publisher.branch | CRAI-USTA Bogotá | |
| dc.publisher.faculty | Facultad de Ingeniería Electrónica | spa |
| dc.publisher.program | Pregrado Ingeniería Electrónica | spa |
| dc.relation.references | S. Ajwad, «Distributed control of multi-agent systems under communication constraints : application to robotics», Tesis doct., sep. de 2020. | |
| dc.relation.references | «GED- Grupo de Estudio y Desarrollo en Robótica», dirección: https://scienti.minciencias. gov.co/gruplac/jsp/visualiza/visualizagr.jsp?nro=00000000002964. | |
| dc.relation.references | J. Park, R. Delgado y B. Choi, «Real-Time Characteristics of ROS 2.0 in Multiagent Robot Systems: An Empirical Study», IEEEAccess, vol. PP, págs.1-1, ago. de2020. DOI:10.1109/ACCESS.2020.3018122. | |
| dc.relation.references | M. Zaryouli, M. T. Fathi y M. Ezziyyani, «Data collection based on multi-agent modeling for intelli gent and precision farming in lokoss region morocco», en 2020 1st International Conference on Innova tive Research in Applied Science, Engineering and Technology (IRASET), 2020, págs. 1-6. DOI: 10.1109/ IRASET48871.2020.9092214. | |
| dc.relation.references | P. Vanella, Implementation of ROS-based multi-agent slam centralized and decentralized approaches, dic. de 2023. dirección: https://webthesis.biblio.polito.it/29366/. | |
| dc.relation.references | J. R. Marden, G. Arslan y J. S. Shamma, «Cooperative Control and Potential Games», IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 39, n.o 6, págs. 1393-1407, 2009. DOI: 10.1109/ TSMCB.2009.2017273. | |
| dc.relation.references | S. Hoet y N. Sabouret, «Reinforcement Learning of Communication in a Multi-agent Context», en 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 2, 2011, págs. 240-243. DOI: 10.1109/WI-IAT.2011.125. | |
| dc.relation.references | J. Wang, J. Wu y X. Kong, «Multi-agent Simulation for Strategic Bidding in Electricity Markets Using Reinforcement Learning», CSEE Journal of Power and Energy Systems, vol. 9, n.o 3, págs. 1051-1065, 2023. DOI: 10.17775/CSEEJPES.2020.02820. | |
| dc.relation.references | D.Martínez y E. Mojica-Nava, «Distortion based potential game for distributed coverage control», Infor mation Sciences, vol. 600, págs. 209-225, 2022, ISSN: 0020-0255. DOI: https://doi.org/10.1016/j. ins.2022.03.090. dirección: https://www.sciencedirect.com/science/article/pii/ S0020025522003176 | |
| dc.relation.references | G.CardonayJ.Calderon,«RobotSwarmNavigationandVictimDetectionUsingRendezvousConsensus in Search and Rescue Operations», Applied Sciences, vol. 9, pág. 1702, abr. de 2019. DOI: 10.3390/ app9081702. | |
| dc.relation.references | «Objetivo9: Construir infraestructuras resilientes, promover la industrialización sostenible y fomentar la innovación», dirección: https://www.un.org/sustainabledevelopment/es/infrastructure/ | |
| dc.relation.references | «Objetivo 11: Lograr que las ciudades sean más inclusivas, seguras, resilientes y sostenibles», dirección: https://www.un.org/sustainabledevelopment/es/cities/. | |
| dc.relation.references | «Objetivo 6: Garantizar la disponibilidad de agua y su gestión sostenible y el saneamiento para todos», dirección: https://www.un.org/sustainabledevelopment/es/water-and-sanitation/. | |
| dc.relation.references | «Objetivo 15: Gestionar sosteniblemente los bosques, luchar contra la desertificación, detener e invertir la degradación de las tierras, detener la pérdida de biodiversidad», dirección: https://www.un.org/ sustainabledevelopment/es/biodiversity/. | |
| dc.relation.references | C.EscribanoGarcía-Machín,«Leader-FollowerDecentralicedControlofaNanoquadrotorSwarm»,2019. | |
| dc.relation.references | M.-L. Li, S. Chen y J. Chen, «Adaptive Learning: A New Decentralized Reinforcement Learning Ap proach for Cooperative Multiagent Systems», IEEE Access, vol. 8, págs. 99404-99421, 2020. DOI: 10. 1109/ACCESS.2020.2997899. | |
| dc.relation.references | T. Ikeda y T. Shibuya, «Centralized Training with Decentralized Execution Reinforcement Learning for Cooperative Multi-agent Systems with Communication Delay», en 2022 61st Annual Conference of the Society of Instrument and Control Engineers (SICE), 2022, págs. 135-140. DOI: 10.23919/SICE56594. 2022.9905866. | |
| dc.relation.references | W. Li, Y. Deng, M. Zhang, J. Li, S. Chen y S. Zhang, «Integrated Multistage Self-Healing in Smart Dis tribution Grids Using Decentralized Multiagent», IEEE Access, vol. 9, págs. 159081-159092, 2021. DOI: 10.1109/ACCESS.2021.3131214. | |
| dc.relation.references | G. Xu, Z. Yang, W. Lu y L. Zhang, «Decentralized Multi-UAV Cooperative Search Based on ROS1 and ROS2», en International Conference on Autonomous Unmanned Systems, Springer, 2021, págs. 2427-2435. | |
| dc.relation.references | M.Siwek,K.BesseghieuryL.Baranowski,«Theeffectsoftheswarmconfigurationandtheobstaclespla cement oncontrol signals transmission delays in decentralized ROS-embedded group of mobile robots», en AIP Conference Proceedings, AIP Publishing, vol. 2029, 2018. | |
| dc.relation.references | J. M. Esposito y V. Kumar, «An asynchronous integration and event detection algorithm for simulating multi-agent hybrid systems», ACM Transactions on Modeling and Computer Simulation (TOMACS), vol. 14, n.o 4, págs. 363-388, 2004. | |
| dc.relation.references | J. Wang, X. Luo y J. Yan, «Event-Triggered Consensus Control for Second-Order Multi-Agent Systems With/Without Input Time Delay», IEEE Access, vol. PP, págs. 1-1, oct. de 2019. DOI: 10.1109/ACCESS. 2019.2946263. | |
| dc.relation.references | F. MuñozPalacios, E. S. Espinoza Quesada, H. La, S. Salazar, S. Commuri y L. R. Garcia Carrillo, «Adap tive consensus algorithms for real-time operation of multi-agent systems affected by switching network events», International Journal of Robust and Nonlinear Control, vol. 27, oct. de 2016. DOI: 10.1002/rnc. 3687. | |
| dc.relation.references | M.Wooldridge, An Introduction to MultiAgent Systems, 2.a ed. Wiley, 2009. | |
| dc.relation.references | Y. Shoham y K. Leyton-Brown, Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations. Cambridge University Press, 2009. | |
| dc.relation.references | S. Russell y P. Norvig, Artificial Intelligence: A Modern Approach, 4.a ed. Pearson, 2021. | |
| dc.relation.references | G. Weiss, Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence. MIT Press, 1999. | |
| dc.relation.references | W.RenyR.W.Beard, Distributed Consensus in Multi-vehicle Cooperative Control. Springer, 2008. | |
| dc.relation.references | R. Olfati-Saber, J. A. Fax y R. M. Murray, «Consensus and cooperation in networked multi-agent sys tems», Proceedings of the IEEE, vol. 95, n.o 1, págs. 215-233, 2007. | |
| dc.relation.references | M.Brambilla, E. Ferrante, M. Birattari y M. Dorigo, «Swarm robotics: a review from the swarm enginee ring perspective», Swarm Intelligence, vol. 7, n.o 1, págs. 1-41, 2013. | |
| dc.relation.references | R. S. Sutton y A. G. Barto, Reinforcement Learning: An Introduction, 2.a ed. MIT Press, 2018. | |
| dc.relation.references | L. P. Kaelbling, M. L. Littman y A. W. Moore, «Reinforcement Learning: A Survey», Journal of Artificial Intelligence Research, vol. 4, págs. 237-285, 1996. | |
| dc.relation.references | C. J. Watkins y P. Dayan, «Q-Learning», Machine Learning, vol. 8, n.o 3-4, págs. 279-292, 1992. | |
| dc.relation.references | J. Kober, J. A. Bagnell y J. Peters, «Reinforcement Learning in Robotics: A Survey», The International Journal of Robotics Research, vol. 32, n.o 11, págs. 1238-1274, 2013. | |
| dc.relation.references | T.P.Lillicrapetal., «Continuouscontrolwithdeepreinforcementlearning»,arXivpreprintarXiv:1509.02971, 2016. | |
| dc.relation.references | V.Mnihetal.,«Human-levelcontrolthroughdeepreinforcementlearning»,Nature,vol.518,págs.529-533, 2015 | |
| dc.relation.references | L.-J. Lin, «Self-improving reactive agents based on reinforcement learning, planning and teaching», Ma chine Learning, vol. 8, n.o 3-4, págs. 293-321, 1992 | |
| dc.relation.references | J.Cortés, S. Martínez yF.Bullo, «Robustrendezvousformobileautonomousagentsviaproximitygraphs in arbitrary dimensions», IEEE Transactions on Automatic Control, vol. 51, n.o 8, págs. 1289-1298, 2006. | |
| dc.relation.references | M. Quigley et al., «ROS: an open-source Robot Operating System», en IEEE International Conference on Robotics and Automation (ICRA) Workshop on Open Source Software, 2009. | |
| dc.relation.references | N.KoenigyA.Howard,«DesignanduseparadigmsforGazebo,anopen-sourcemulti-robotsimulator», IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), págs. 2149-2154, 2004. | |
| dc.relation.references | S. Macenski, F. Martín, R. White y J. Ginés Clavero, «The ROS 2 Navigation System (Nav2): Design, architecture, and methods», The International Journal of Robotics Research, vol. 41, n.o 7, págs. 688-716, 2022. | |
| dc.relation.references | B. Gerkey, ROS 2: Towards a fully distributed, real-time robotic middleware, Open Robotics Blog, 2020. direc ción: https://www.openrobotics.org/blog. | |
| dc.relation.references | Y. Chen, Y. Fan y M. Jin, «Research on Sensor Technology in Mobile Robot Navigation», Applied and Computational Engineering, vol. 93, n.o 1, págs. 50-55, 2024 | |
| dc.relation.references | StandardBots. «Types of sensors in robotics: Complete guide with examples». dirección: https:// standardbots.com/blog/every-type-of-sensors-in-robotics---explained. | |
| dc.relation.references | F. Cañadas-Aránega et al., «Autonomous collaborative mobile robot for greenhouses», ScienceDirect, 2024, One of the most common sensors used for navigation is the LiDAR sensor. | |
| dc.relation.references | X.Qiu,K.WanyF.Li,«MobileRobotNavigationUsingDeepReinforcementLearning»,Processes,vol.10, n.o 12, pág. 2748, 2022. | |
| dc.relation.references | Y.Zhu,W.Z.W.Hasan,H.R.H.Ramli,N.M.H.Norsahperi,M.S.M.KassimeY.Yao,«DeepReinforce mentLearning of Mobile Robot Navigation in Dynamic Environment: A Review», Sensors, vol. 25, n.o 11, pág. 3394, 2025. | |
| dc.relation.references | L. Dokoet al., «A Comprehensive Review of Mobile Robot Navigation Using Deep Reinforcement Lear ning Algorithms in Crowded Environments», Journal of Intelligent & Robotic Systems, vol. 110, 2024. | |
| dc.relation.references | ROBOTIS, Robot Platform — TurtleBot3 Features, Consulta: 10 de noviembre de 2025, 2025. dirección: https://emanual.robotis.com/docs/en/platform/turtlebot3/features/. | |
| dc.relation.references | ROBOTIS, TurtleBot3 — Overview and ROS 2 Integration, Consulta: 10 de noviembre de 2025, 2025. direc ción: https://emanual.robotis.com/docs/en/platform/turtlebot3/overview/. | |
| dc.relation.references | A. Kumar y K. Konathalapalli, «Autonomous Navigation of ROS 2 based TurtleBot3 in Static and Dy namic Environments using Intelligent Approaches», International Journal of Information Technology, 2025, Uso de TurtleBot3 con LiDAR y aprendizaje por refuerzo profundo. | |
| dc.relation.references | A. W. Services. «Get to know your AWS DeepRacer vehicle». Consulta: 10 de noviembre de 2025. di rección: https://docs.aws.amazon.com/deepracer/latest/developerguide/inspect your-vehicle.html. | |
| dc.relation.references | W. contributors. «Traxxas — RC Vehicle Manufacturer Overview». Consulta: 10 de noviembre de 2025. dirección: https://en.wikipedia.org/wiki/Traxxas. | |
| dc.relation.references | R. Federation. «Middle Size League — RoboCup Federation». Consulta: 10 de noviembre de 2025. direc ción: https://msl.robocup.org/. | |
| dc.relation.references | L. ROBOTIS Co., turtlebot3_applications, https://github.com/ROBOTIS- GIT/turtlebot3_ applications, Accessed: November 9, 2025, n.d. | |
| dc.relation.references | L.ROBOTISCo.«TurtleBot3BasicExamples–Follower».Accessed:November9,2025.dirección:https: //emanual.robotis.com/docs/en/platform/turtlebot3/basic_examples. | |
| dc.relation.references | L. ROBOTIS Co., turtlebot3_machine_learning, https://github.com/ROBOTIS-GIT/turtlebot3_ machine_learning/tree/humble, Último acceso: 9 de noviembre de 2025, n.d. | |
| dc.relation.references | L. ROBOTIS Co. «TurtleBot3 Machine Learning– Deep Q-Learning». Accessed: November 9, 2025. di rección: https://emanual.robotis.com/docs/en/platform/turtlebot3/machine_ learning/. | |
| dc.rights.accessrights | info:eu-repo/semantics/openAccess | |
| dc.rights.coar | http://purl.org/coar/access_right/c_abf2 | |
| dc.rights.local | Abierto (Texto Completo) | spa |
| dc.subject.keyword | Multi-Agent Systems | |
| dc.subject.keyword | Reinforcement Learning | |
| dc.subject.keyword | Deep Q-Learning | |
| dc.subject.keyword | ROS 2 | |
| dc.subject.keyword | Gazebo | |
| dc.subject.keyword | Decentralized Robotics. | |
| dc.subject.lemb | Ingeniería electrónica | |
| dc.subject.lemb | Robótica móvil -- Toma de decisiones | |
| dc.subject.lemb | Inteligencia artificial -- Aprendizaje | |
| dc.subject.proposal | Sistemas Multi-Agente | |
| dc.subject.proposal | Aprendizaje por Refuerzo | |
| dc.subject.proposal | Deep Q-Learning | |
| dc.subject.proposal | ROS 2 | |
| dc.subject.proposal | Gazebo | |
| dc.subject.proposal | Robótica Descentralizada | |
| dc.title | Arquitectura multi-agente descentralizada para detección de eventos en el ambiente | |
| dc.type | bachelor thesis | |
| dc.type.coar | http://purl.org/coar/resource_type/c_7a1f | |
| dc.type.coarversion | http://purl.org/coar/version/c_ab4af688f83e57aa | |
| dc.type.drive | info:eu-repo/semantics/bachelorThesis | |
| dc.type.local | Trabajo de grado | spa |
| dc.type.version | info:eu-repo/semantics/acceptedVersion |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- 2025danielcastano.pdf
- Tamaño:
- 3.06 MB
- Formato:
- Adobe Portable Document Format
Bloque de licencias
1 - 3 de 3
Cargando...
- Nombre:
- license.txt
- Tamaño:
- 807 B
- Formato:
- Item-specific license agreed upon to submission
- Descripción:
Cargando...
- Nombre:
- 2025cartafacultad.pdf
- Tamaño:
- 491.35 KB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Carta de facultad
Cargando...
- Nombre:
- 2025cartaderechosdeautor.pdf
- Tamaño:
- 32.4 KB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Carta derechos de autor

