Arquitectura multi-agente descentralizada para detección de eventos en el ambiente

Castaño Ortiz, Daniel Fernando

Arquitectura multi-agente descentralizada para detección de eventos en el ambiente

dc.contributor.advisor	Martínez Vásquez, David Alejandro
dc.contributor.author	Castaño Ortiz, Daniel Fernando
dc.contributor.corporatename	Universidad Santo Tomás
dc.contributor.cvlac	https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0001560096
dc.contributor.cvlac	https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0002173917
dc.contributor.googlescholar	https://scholar.google.com/citations?user=U5Qf1nUAAAAJ&hl=es&oi=ao
dc.contributor.orcid	https://orcid.org/0000-0001-9750-2653
dc.contributor.orcid	https://orcid.org/0009-0003-2216-0937
dc.date.accessioned	2026-04-20T15:12:57Z
dc.date.available	2026-04-20T15:12:57Z
dc.date.issued	2026-04-20
dc.description	La detección de eventos en el ambiente constituye un reto central en la robótica moderna, especialmente en sistemas que buscan autonomía, cooperación y toma de decisiones distribuidas. Los sistemas multi-agente descentralizados (MAS) surgen como una alternativa eficiente frente a los enfoques centralizados, ya que ofrecer mayor escalabilidad, robustez y tolerancia a fallos. Este trabajo propone el diseño e implementación de una arquitectura multi-agente descentralizada basada en ROS 2 para la detección de eventos en el ambiente, utilizando plataformas móviles TurtleBot3 Burger y aprendizaje por refuerzo profundo. La investigación se desarrolló con un enfoque experimental y aplicado, estructurado en tres fases: (1) análisis y selección de variables físicas, algoritmos y plataforma robótica; (2) desarrollo de la arquitectura descentralizada, integrando un agente líder entrenado mediante el algoritmo Deep Q-Learning (DQN), junto con agentes seguidores controlados con el algoritmo Follower de ROBOTIS; y (3) validación del sistema en el simulador Gazebo, mediante el uso del middleware ROS 2 para la comunicación entre nodos y la gestión independiente de espacios de nombres para cada robot. Durante 1000 episodios de entrenamiento, el agente líder presentó una mejora progresiva en la recompensa acumulada, demostrando un aprendizaje estable y una navegación autónoma libre de colisiones. En la simulación multi-agente, los robots seguidores reprodujeron con precisión la trayectoria del líder, manteniendo formaciones estables y comunicación efectiva. Finalmente, las pruebas físicas confirmaron la correcta transferencia del modelo entrenado, manteniendo la coherencia del comportamiento observado en simulación. Los resultados evidencian que la combinación del aprendizaje por refuerzo profundo con una arquitectura descentralizada basada en ROS 2 constituye una estrategia viable para el desarrollo de comportamientos cooperativos en robótica móvil. El sistema diseñado reduce la dependencia de control central, mejora la escalabilidad del sistema y sienta las bases para futuras validaciones en entornos físicos más complejos.
dc.description.abstract	Environmental event detection is a key challenge in modern robotics, particularly in systems that require autonomy, cooperation, and distributed decision-making. Decentralized multi-agent systems (MAS) emerge as anefficient alternative to centralized approaches, offering greater scalability, robustness, and fault tolerance. This work proposes the design and implementation of a decentralized multi-agent architecture based on ROS 2 for environmental event detection, using TurtleBot3 Burger mobile platforms and deep reinforcement learning. The research followed an experimental and applied approach, structured into three phases: (1) analysis and selection of physical variables, algorithms, and robotic platform; (2) development of the decentralized architecture integrating a leader agent trained through the Deep Q-Learning (DQN) algorithm and follower agents controlled by the ROBOTIS Follower algorithm; and (3) system validation in the Gazebo simulator using the ROS2middleware for inter-node communication and independent namespace management for each robot. Over 1000 training episodes, the leader agent showed a progressive improvement in cumulative rewards, achieving stable learning and autonomous navigation without collisions. In the multi-agent simulation, the follower robots accurately replicated the leader’s trajectory, maintaining stable formations and effective communication. Finally, physical tests confirmed the successful transfer of the trained model, maintaining consistency with the simulated behaviors. The results confirm that combining deep reinforcement learning with a decentralized ROS 2-based architecture is a viable strategy for developing cooperative behaviors in mobile robotics. The designed system reduces the dependence on a central controller, enhances scalability, and establishes a solid foundation for future validations in more complex physical environments.
dc.description.degreelevel	Pregrado	spa
dc.description.degreename	Ingeniero Electronico	spa
dc.format.mimetype	application/pdf
dc.identifier.citation	Castaño Ortiz, D. F. (2026). Arquitectura multi-agente descentralizada para detección de eventos en el ambiente. [Trabajo de Grado, Universidad Santo Tomás]. Repositorio Institucional
dc.identifier.instname	instname:Universidad Santo Tomás	spa
dc.identifier.reponame	reponame:Repositorio Institucional Universidad Santo Tomás	spa
dc.identifier.repourl	repourl:https://repository.usta.edu.co	spa
dc.identifier.uri	http://hdl.handle.net/11634/72123
dc.language.iso	spa
dc.publisher	Universidad Santo Tomás	spa
dc.publisher.branch	CRAI-USTA Bogotá
dc.publisher.faculty	Facultad de Ingeniería Electrónica	spa
dc.publisher.program	Pregrado Ingeniería Electrónica	spa
dc.relation.references	S. Ajwad, «Distributed control of multi-agent systems under communication constraints : application to robotics», Tesis doct., sep. de 2020.
dc.relation.references	«GED- Grupo de Estudio y Desarrollo en Robótica», dirección: https://scienti.minciencias. gov.co/gruplac/jsp/visualiza/visualizagr.jsp?nro=00000000002964.
dc.relation.references	J. Park, R. Delgado y B. Choi, «Real-Time Characteristics of ROS 2.0 in Multiagent Robot Systems: An Empirical Study», IEEEAccess, vol. PP, págs.1-1, ago. de2020. DOI:10.1109/ACCESS.2020.3018122.
dc.relation.references	M. Zaryouli, M. T. Fathi y M. Ezziyyani, «Data collection based on multi-agent modeling for intelli gent and precision farming in lokoss region morocco», en 2020 1st International Conference on Innova tive Research in Applied Science, Engineering and Technology (IRASET), 2020, págs. 1-6. DOI: 10.1109/ IRASET48871.2020.9092214.
dc.relation.references	P. Vanella, Implementation of ROS-based multi-agent slam centralized and decentralized approaches, dic. de 2023. dirección: https://webthesis.biblio.polito.it/29366/.
dc.relation.references	J. R. Marden, G. Arslan y J. S. Shamma, «Cooperative Control and Potential Games», IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 39, n.o 6, págs. 1393-1407, 2009. DOI: 10.1109/ TSMCB.2009.2017273.
dc.relation.references	S. Hoet y N. Sabouret, «Reinforcement Learning of Communication in a Multi-agent Context», en 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 2, 2011, págs. 240-243. DOI: 10.1109/WI-IAT.2011.125.
dc.relation.references	J. Wang, J. Wu y X. Kong, «Multi-agent Simulation for Strategic Bidding in Electricity Markets Using Reinforcement Learning», CSEE Journal of Power and Energy Systems, vol. 9, n.o 3, págs. 1051-1065, 2023. DOI: 10.17775/CSEEJPES.2020.02820.
dc.relation.references	D.Martínez y E. Mojica-Nava, «Distortion based potential game for distributed coverage control», Infor mation Sciences, vol. 600, págs. 209-225, 2022, ISSN: 0020-0255. DOI: https://doi.org/10.1016/j. ins.2022.03.090. dirección: https://www.sciencedirect.com/science/article/pii/ S0020025522003176
dc.relation.references	G.CardonayJ.Calderon,«RobotSwarmNavigationandVictimDetectionUsingRendezvousConsensus in Search and Rescue Operations», Applied Sciences, vol. 9, pág. 1702, abr. de 2019. DOI: 10.3390/ app9081702.
dc.relation.references	«Objetivo9: Construir infraestructuras resilientes, promover la industrialización sostenible y fomentar la innovación», dirección: https://www.un.org/sustainabledevelopment/es/infrastructure/
dc.relation.references	«Objetivo 11: Lograr que las ciudades sean más inclusivas, seguras, resilientes y sostenibles», dirección: https://www.un.org/sustainabledevelopment/es/cities/.
dc.relation.references	«Objetivo 6: Garantizar la disponibilidad de agua y su gestión sostenible y el saneamiento para todos», dirección: https://www.un.org/sustainabledevelopment/es/water-and-sanitation/.
dc.relation.references	«Objetivo 15: Gestionar sosteniblemente los bosques, luchar contra la desertificación, detener e invertir la degradación de las tierras, detener la pérdida de biodiversidad», dirección: https://www.un.org/ sustainabledevelopment/es/biodiversity/.
dc.relation.references	C.EscribanoGarcía-Machín,«Leader-FollowerDecentralicedControlofaNanoquadrotorSwarm»,2019.
dc.relation.references	M.-L. Li, S. Chen y J. Chen, «Adaptive Learning: A New Decentralized Reinforcement Learning Ap proach for Cooperative Multiagent Systems», IEEE Access, vol. 8, págs. 99404-99421, 2020. DOI: 10. 1109/ACCESS.2020.2997899.
dc.relation.references	T. Ikeda y T. Shibuya, «Centralized Training with Decentralized Execution Reinforcement Learning for Cooperative Multi-agent Systems with Communication Delay», en 2022 61st Annual Conference of the Society of Instrument and Control Engineers (SICE), 2022, págs. 135-140. DOI: 10.23919/SICE56594. 2022.9905866.
dc.relation.references	W. Li, Y. Deng, M. Zhang, J. Li, S. Chen y S. Zhang, «Integrated Multistage Self-Healing in Smart Dis tribution Grids Using Decentralized Multiagent», IEEE Access, vol. 9, págs. 159081-159092, 2021. DOI: 10.1109/ACCESS.2021.3131214.
dc.relation.references	G. Xu, Z. Yang, W. Lu y L. Zhang, «Decentralized Multi-UAV Cooperative Search Based on ROS1 and ROS2», en International Conference on Autonomous Unmanned Systems, Springer, 2021, págs. 2427-2435.
dc.relation.references	M.Siwek,K.BesseghieuryL.Baranowski,«Theeffectsoftheswarmconfigurationandtheobstaclespla cement oncontrol signals transmission delays in decentralized ROS-embedded group of mobile robots», en AIP Conference Proceedings, AIP Publishing, vol. 2029, 2018.
dc.relation.references	J. M. Esposito y V. Kumar, «An asynchronous integration and event detection algorithm for simulating multi-agent hybrid systems», ACM Transactions on Modeling and Computer Simulation (TOMACS), vol. 14, n.o 4, págs. 363-388, 2004.
dc.relation.references	J. Wang, X. Luo y J. Yan, «Event-Triggered Consensus Control for Second-Order Multi-Agent Systems With/Without Input Time Delay», IEEE Access, vol. PP, págs. 1-1, oct. de 2019. DOI: 10.1109/ACCESS. 2019.2946263.
dc.relation.references	F. MuñozPalacios, E. S. Espinoza Quesada, H. La, S. Salazar, S. Commuri y L. R. Garcia Carrillo, «Adap tive consensus algorithms for real-time operation of multi-agent systems affected by switching network events», International Journal of Robust and Nonlinear Control, vol. 27, oct. de 2016. DOI: 10.1002/rnc. 3687.
dc.relation.references	M.Wooldridge, An Introduction to MultiAgent Systems, 2.a ed. Wiley, 2009.
dc.relation.references	Y. Shoham y K. Leyton-Brown, Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations. Cambridge University Press, 2009.
dc.relation.references	S. Russell y P. Norvig, Artificial Intelligence: A Modern Approach, 4.a ed. Pearson, 2021.
dc.relation.references	G. Weiss, Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence. MIT Press, 1999.
dc.relation.references	W.RenyR.W.Beard, Distributed Consensus in Multi-vehicle Cooperative Control. Springer, 2008.
dc.relation.references	R. Olfati-Saber, J. A. Fax y R. M. Murray, «Consensus and cooperation in networked multi-agent sys tems», Proceedings of the IEEE, vol. 95, n.o 1, págs. 215-233, 2007.
dc.relation.references	M.Brambilla, E. Ferrante, M. Birattari y M. Dorigo, «Swarm robotics: a review from the swarm enginee ring perspective», Swarm Intelligence, vol. 7, n.o 1, págs. 1-41, 2013.
dc.relation.references	R. S. Sutton y A. G. Barto, Reinforcement Learning: An Introduction, 2.a ed. MIT Press, 2018.
dc.relation.references	L. P. Kaelbling, M. L. Littman y A. W. Moore, «Reinforcement Learning: A Survey», Journal of Artificial Intelligence Research, vol. 4, págs. 237-285, 1996.
dc.relation.references	C. J. Watkins y P. Dayan, «Q-Learning», Machine Learning, vol. 8, n.o 3-4, págs. 279-292, 1992.
dc.relation.references	J. Kober, J. A. Bagnell y J. Peters, «Reinforcement Learning in Robotics: A Survey», The International Journal of Robotics Research, vol. 32, n.o 11, págs. 1238-1274, 2013.
dc.relation.references	T.P.Lillicrapetal., «Continuouscontrolwithdeepreinforcementlearning»,arXivpreprintarXiv:1509.02971, 2016.
dc.relation.references	V.Mnihetal.,«Human-levelcontrolthroughdeepreinforcementlearning»,Nature,vol.518,págs.529-533, 2015
dc.relation.references	L.-J. Lin, «Self-improving reactive agents based on reinforcement learning, planning and teaching», Ma chine Learning, vol. 8, n.o 3-4, págs. 293-321, 1992
dc.relation.references	J.Cortés, S. Martínez yF.Bullo, «Robustrendezvousformobileautonomousagentsviaproximitygraphs in arbitrary dimensions», IEEE Transactions on Automatic Control, vol. 51, n.o 8, págs. 1289-1298, 2006.
dc.relation.references	M. Quigley et al., «ROS: an open-source Robot Operating System», en IEEE International Conference on Robotics and Automation (ICRA) Workshop on Open Source Software, 2009.
dc.relation.references	N.KoenigyA.Howard,«DesignanduseparadigmsforGazebo,anopen-sourcemulti-robotsimulator», IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), págs. 2149-2154, 2004.
dc.relation.references	S. Macenski, F. Martín, R. White y J. Ginés Clavero, «The ROS 2 Navigation System (Nav2): Design, architecture, and methods», The International Journal of Robotics Research, vol. 41, n.o 7, págs. 688-716, 2022.
dc.relation.references	B. Gerkey, ROS 2: Towards a fully distributed, real-time robotic middleware, Open Robotics Blog, 2020. direc ción: https://www.openrobotics.org/blog.
dc.relation.references	Y. Chen, Y. Fan y M. Jin, «Research on Sensor Technology in Mobile Robot Navigation», Applied and Computational Engineering, vol. 93, n.o 1, págs. 50-55, 2024
dc.relation.references	StandardBots. «Types of sensors in robotics: Complete guide with examples». dirección: https:// standardbots.com/blog/every-type-of-sensors-in-robotics---explained.
dc.relation.references	F. Cañadas-Aránega et al., «Autonomous collaborative mobile robot for greenhouses», ScienceDirect, 2024, One of the most common sensors used for navigation is the LiDAR sensor.
dc.relation.references	X.Qiu,K.WanyF.Li,«MobileRobotNavigationUsingDeepReinforcementLearning»,Processes,vol.10, n.o 12, pág. 2748, 2022.
dc.relation.references	Y.Zhu,W.Z.W.Hasan,H.R.H.Ramli,N.M.H.Norsahperi,M.S.M.KassimeY.Yao,«DeepReinforce mentLearning of Mobile Robot Navigation in Dynamic Environment: A Review», Sensors, vol. 25, n.o 11, pág. 3394, 2025.
dc.relation.references	L. Dokoet al., «A Comprehensive Review of Mobile Robot Navigation Using Deep Reinforcement Lear ning Algorithms in Crowded Environments», Journal of Intelligent & Robotic Systems, vol. 110, 2024.
dc.relation.references	ROBOTIS, Robot Platform — TurtleBot3 Features, Consulta: 10 de noviembre de 2025, 2025. dirección: https://emanual.robotis.com/docs/en/platform/turtlebot3/features/.
dc.relation.references	ROBOTIS, TurtleBot3 — Overview and ROS 2 Integration, Consulta: 10 de noviembre de 2025, 2025. direc ción: https://emanual.robotis.com/docs/en/platform/turtlebot3/overview/.
dc.relation.references	A. Kumar y K. Konathalapalli, «Autonomous Navigation of ROS 2 based TurtleBot3 in Static and Dy namic Environments using Intelligent Approaches», International Journal of Information Technology, 2025, Uso de TurtleBot3 con LiDAR y aprendizaje por refuerzo profundo.
dc.relation.references	A. W. Services. «Get to know your AWS DeepRacer vehicle». Consulta: 10 de noviembre de 2025. di rección: https://docs.aws.amazon.com/deepracer/latest/developerguide/inspect your-vehicle.html.
dc.relation.references	W. contributors. «Traxxas — RC Vehicle Manufacturer Overview». Consulta: 10 de noviembre de 2025. dirección: https://en.wikipedia.org/wiki/Traxxas.
dc.relation.references	R. Federation. «Middle Size League — RoboCup Federation». Consulta: 10 de noviembre de 2025. direc ción: https://msl.robocup.org/.
dc.relation.references	L. ROBOTIS Co., turtlebot3_applications, https://github.com/ROBOTIS- GIT/turtlebot3_ applications, Accessed: November 9, 2025, n.d.
dc.relation.references	L.ROBOTISCo.«TurtleBot3BasicExamples–Follower».Accessed:November9,2025.dirección:https: //emanual.robotis.com/docs/en/platform/turtlebot3/basic_examples.
dc.relation.references	L. ROBOTIS Co., turtlebot3_machine_learning, https://github.com/ROBOTIS-GIT/turtlebot3_ machine_learning/tree/humble, Último acceso: 9 de noviembre de 2025, n.d.
dc.relation.references	L. ROBOTIS Co. «TurtleBot3 Machine Learning– Deep Q-Learning». Accessed: November 9, 2025. di rección: https://emanual.robotis.com/docs/en/platform/turtlebot3/machine_ learning/.
dc.rights.accessrights	info:eu-repo/semantics/openAccess
dc.rights.coar	http://purl.org/coar/access_right/c_abf2
dc.rights.local	Abierto (Texto Completo)	spa
dc.subject.keyword	Multi-Agent Systems
dc.subject.keyword	Reinforcement Learning
dc.subject.keyword	Deep Q-Learning
dc.subject.keyword	ROS 2
dc.subject.keyword	Gazebo
dc.subject.keyword	Decentralized Robotics.
dc.subject.lemb	Ingeniería electrónica
dc.subject.lemb	Robótica móvil -- Toma de decisiones
dc.subject.lemb	Inteligencia artificial -- Aprendizaje
dc.subject.proposal	Sistemas Multi-Agente
dc.subject.proposal	Aprendizaje por Refuerzo
dc.subject.proposal	Deep Q-Learning
dc.subject.proposal	ROS 2
dc.subject.proposal	Gazebo
dc.subject.proposal	Robótica Descentralizada
dc.title	Arquitectura multi-agente descentralizada para detección de eventos en el ambiente
dc.type	bachelor thesis
dc.type.coar	http://purl.org/coar/resource_type/c_7a1f
dc.type.coarversion	http://purl.org/coar/version/c_ab4af688f83e57aa
dc.type.drive	info:eu-repo/semantics/bachelorThesis
dc.type.local	Trabajo de grado	spa
dc.type.version	info:eu-repo/semantics/acceptedVersion

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: 2025danielcastano.pdf
Tamaño:: 3.06 MB
Formato:: Adobe Portable Document Format

Descargar

Bloque de licencias

Mostrando 1 - 3 de 3

Nombre:: license.txt
Tamaño:: 807 B
Formato:: Item-specific license agreed upon to submission
Descripción:

Descargar

Nombre:: 2025cartafacultad.pdf
Tamaño:: 491.35 KB
Formato:: Adobe Portable Document Format
Descripción:: Carta de facultad

Descargar

Nombre:: 2025cartaderechosdeautor.pdf
Tamaño:: 32.4 KB
Formato:: Adobe Portable Document Format
Descripción:: Carta derechos de autor

Descargar

Colecciones

Pregrado Ingeniería Electrónica