Arquitectura RAG para el Contexto en PLN Generación y Acceso Inteligente de Datos

dc.contributor.advisorGonzalez Torres, Daniel Leonardo
dc.contributor.advisorSanta Quintero, Ricardo Andres
dc.contributor.authorGonzalez Torres, Daniel Leonardo
dc.contributor.authorSanta Quintero, Ricardo Andres
dc.coverage.spatialBogotáspa
dc.creator.emaildaniell-gonzalezt@unilibre.edu.cospa
dc.creator.emailricardoa.santaq@unilibre.edu.cospa
dc.date.accessioned2025-06-04T15:32:03Z
dc.date.available2025-06-04T15:32:03Z
dc.date.created2025-03-06
dc.description.abstractEste artículo explora en profundidad la integración de técnicas avanzadas de machine learning mediante la metodología Retrieval Augmented Generation (RAG). Se analiza la arquitectura dual que combina procesos de recuperación y generación de información, resaltando su impacto en el entrenamiento de modelos de lenguaje natural. Asimismo, se presentan variantes especializadas como el Corrective RAG y el Advanced RAG, que incorporan mecanismos de retroalimentación y optimización en tiempo real. Se incluye, además, una mención del producto JurislibreIA, desarrollado por el semillero Sensorama, ejemplificando aplicaciones prácticas en dominios complejos como el legal. El estudio se fundamenta en ejemplos de implementación en Python, diagramas explicativos y una revisión crítica de las fuentes relevantes, ofreciendo una guía completa para investigadores y desarrolladores interesados en impulsar soluciones innovadoras basadas en RAG.spa
dc.description.abstractenglishThis article explores in depth the integration of advanced machine learning techniques using the Retrieval Augmented Generation (RAG) methodology. The dual architecture that combines information retrieval and generation processes is analyzed, highlighting its impact on the training of natural language models. Likewise, specialized variants such as Corrective RAG and Advanced RAG are presented, which incorporate real-time feedback and optimization mechanisms. Also included a mention of the JurislibreIA product, developed by the Sensorama research group, exemplifying practical applications in complex domains such as the legal one. The study is based on implementation examples in Python, explanatory diagrams and a critical review of relevant sources, offering a complete guide for researchers and developers interested in promoting innovative solutions based on RAG.spa
dc.description.sponsorshipUniversidad Libre -- Ingenieria -- Ingenieria de sistemasspa
dc.formatPDFspa
dc.identifier.urihttps://hdl.handle.net/10901/31241
dc.relation.referencesP. Lewis, et al., "Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks," Advances in Neural Information Processing Systems (NeurIPS), 2020.spa
dc.relation.referencesJ. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," Proc. of NAACL, 2018.spa
dc.relation.referencesT. Mikolov, et al., "Distributed Representations of Words and Phrases and their Compositionality," Advances in Neural Information Processing Systems (NeurIPS), 2013.spa
dc.relation.referencesA. Vaswani, et al., "Attention is All You Need," Advances in Neural Information Processing Systems (NeurIPS), 2017.spa
dc.relation.referencesI. Sutskever, O. Vinyals, and Q. V. Le, "Sequence to Sequence Learning with Neural Networks," Advances in Neural Information Processing Systems (NeurIPS), 2014.spa
dc.relation.referencesS. Hochreiter and J. Schmidhuber, "Long Short-Term Memory," Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997.spa
dc.relation.referencesY. Kim, "Convolutional Neural Networks for Sentence Classification," Proc. of EMNLP, 2014.spa
dc.relation.referencesR. Johnson, et al., "Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation," arXiv:1609.08144, 2016.spa
dc.relation.referencesZ. Yang, et al., "XLNet: Generalized Autoregressive Pretraining for Language Understanding," Advances in Neural Information Processing Systems (NeurIPS), 2019.spa
dc.relation.referencesK. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," Proc. of CVPR, 2016.spa
dc.relation.referencesM. Abadi, et al., "TensorFlow: A System for Large-Scale Machine Learning," Proc. of OSDI, 2016.spa
dc.relation.referencesF. Chollet, "Keras," GitHub repository, 2015. [Online]. Available: https://github.com/keras-team/keras.spa
dc.relation.referencesP. Russell and S. Norvig, Artificial Intelligence: A Modern Approach, 3rd ed. Prentice Hall, 2010.spa
dc.relation.referencesS. Bengio and Y. LeCun, "Scaling Learning Algorithms towards AI," in Large-Scale Kernel Machines, 2007.spa
dc.relation.referencesHugging Face, "Retrieval-Augmented Generation (RAG)," [Online]. Available: https://huggingface.co/docs/transformers/model_doc/rag Accessed: Feb. 18, 2025.spa
dc.rights.accessrightsinfo:eu-repo/semantics/openAccessspa
dc.rights.coarhttp://purl.org/coar/access_right/c_abf2spa
dc.rights.licenseAtribución-NoComercial-SinDerivadas 2.5 Colombiaspa
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/2.5/co/spa
dc.subjectRetrieval Augmented Generation (RAG)spa
dc.subjectMachine Learningspa
dc.subjectNatural Language Processing (NLP)spa
dc.subjectCorrective RAGspa
dc.subjectAdvanced RAGspa
dc.subjectJurislibreIAspa
dc.subjectSensoramaspa
dc.subjectBases de Datos Vectorialesspa
dc.subjectGrafosspa
dc.subjectModelos de Lenguaje (LLM)spa
dc.subject.lembGestión de datosspa
dc.subject.subjectenglishRetrieval Augmented Generation (RAG)spa
dc.subject.subjectenglishMachine Learningspa
dc.subject.subjectenglishNatural Language Processing (NLP)spa
dc.subject.subjectenglishCorrective RAGspa
dc.subject.subjectenglishAdvanced RAGspa
dc.subject.subjectenglishJurislibreIAspa
dc.subject.subjectenglishSensoramaspa
dc.subject.subjectenglishVector databasespa
dc.subject.subjectenglishGraphspa
dc.subject.subjectenglishLanguage Models (LLM)spa
dc.titleArquitectura RAG para el Contexto en PLN Generación y Acceso Inteligente de Datosspa
dc.title.alternativeWritten Work: RAG Architecture for Context in NLP: Intelligent Data Generation and Accessspa
dc.type.coarhttp://purl.org/coar/resource_type/c_7a1fspa
dc.type.driverinfo:eu-repo/semantics/bachelorThesisspa
dc.type.hasversioninfo:eu-repo/semantics/acceptedVersionspa
dc.type.localTesis de Pregradospa

Archivos

Bloque original

Mostrando 1 - 2 de 2
Cargando...
Miniatura
Nombre:
Trabajo Escrito Arquitectura RAG para el Contexto en PLN Generación y Acceso Inteligente de Datos.pdf
Tamaño:
694.8 KB
Formato:
Adobe Portable Document Format
Descripción:
Cargando...
Miniatura
Nombre:
Formato autorización PUBLICACIÓN DE OBRAS - Trabajo Escrito Arquitectura RAG para el Contexto en PLN Generación y Acceso Inteligente de Datos.pdf
Tamaño:
182.79 KB
Formato:
Adobe Portable Document Format
Descripción:

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
license.txt
Tamaño:
1.71 KB
Formato:
Item-specific license agreed upon to submission
Descripción:

Colecciones