Mate in one

Autores: Ponce, Ezequiel
Año de publicación: 2024
Idioma: español castellano
Tipo de recurso: tesis de grado
Estado: versión publicada
Colaborador/a o director/a de tesis: Bianchi, Bruno
Corro, Luciano del
Descripción: Esta investigación explora cómo los grandes modelos de lenguaje pueden desarrollar habilidades de razonamiento a partir de datos de partidas de ajedrez, enfocándose en la predicción de jugadas de jaque mate en una. Se implementaron enfoques de Supervised Fine-Tuning (SFT) y Direct Preference Optimization (DPO) para mejorar el rendimiento en la tarea de predicción. El estudio evaluó la eficacia de diferentes representaciones de entrada y segmentó el dataset por niveles de Elo para reflejar distintas habilidades. Los resultados muestran un aumento significativo en la capacidad del modelo para predecir jugadas de jaque mate, logrando una mejora de hasta 370 veces respecto al rendimiento base. Estos hallazgos subrayan el potencial de los grandes modelos de lenguaje en el análisis estratégico y la resolución de problemas complejos.
This research explores how large language models can develop reasoning skills from chess game data, focusing on predicting checkmate-in-one moves. Approaches such as Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) were implemented to enhance performance in the prediction task. The study evaluated the effectiveness of different input representations and segmented the dataset by Elo levels to reflect various skill levels. The results show a significant increase in the model’s ability to predict checkmate moves, achieving an improvement of up to 370 times compared to the baseline performance. These findings highlight the potential of large language models in strategic analysis and solving complex problems.
Fil: Ponce, Ezequiel. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales; Argentina.
Materia: AJEDREZ
TRANSFORMERS
GRANDES MODELOS DE LENGUAJE
SUPERVISED FINE-TUNING
DIRECT PREFERENCE OPTIMIZATION
CHESS
TRANSFORMERS
LARGE LANGUAGE MODELS
SUPERVISED FINE-TUNING
DIRECT PREFERENCE OPTIMIZATION
Nivel de accesibilidad: acceso abierto
Condiciones de uso: https://creativecommons.org/licenses/by-nc-sa/2.5/ar
Repositorio
Institución: Universidad Nacional de Buenos Aires. Facultad de Ciencias Exactas y Naturales
OAI Identificador: seminario:seminario_nDAT000005_Ponce

Acceder

id	BDUBAFCEN_fdc07a6e449dc462f41350f5957041de
oai_identifier_str	seminario:seminario_nDAT000005_Ponce
network_acronym_str	BDUBAFCEN
repository_id_str	1896
network_name_str	Biblioteca Digital (UBA-FCEN)
spelling	Mate in oneMate en 1Ponce, EzequielAJEDREZTRANSFORMERSGRANDES MODELOS DE LENGUAJESUPERVISED FINE-TUNINGDIRECT PREFERENCE OPTIMIZATIONCHESSTRANSFORMERSLARGE LANGUAGE MODELSSUPERVISED FINE-TUNINGDIRECT PREFERENCE OPTIMIZATIONEsta investigación explora cómo los grandes modelos de lenguaje pueden desarrollar habilidades de razonamiento a partir de datos de partidas de ajedrez, enfocándose en la predicción de jugadas de jaque mate en una. Se implementaron enfoques de Supervised Fine-Tuning (SFT) y Direct Preference Optimization (DPO) para mejorar el rendimiento en la tarea de predicción. El estudio evaluó la eficacia de diferentes representaciones de entrada y segmentó el dataset por niveles de Elo para reflejar distintas habilidades. Los resultados muestran un aumento significativo en la capacidad del modelo para predecir jugadas de jaque mate, logrando una mejora de hasta 370 veces respecto al rendimiento base. Estos hallazgos subrayan el potencial de los grandes modelos de lenguaje en el análisis estratégico y la resolución de problemas complejos.This research explores how large language models can develop reasoning skills from chess game data, focusing on predicting checkmate-in-one moves. Approaches such as Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) were implemented to enhance performance in the prediction task. The study evaluated the effectiveness of different input representations and segmented the dataset by Elo levels to reflect various skill levels. The results show a significant increase in the model’s ability to predict checkmate moves, achieving an improvement of up to 370 times compared to the baseline performance. These findings highlight the potential of large language models in strategic analysis and solving complex problems.Fil: Ponce, Ezequiel. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales; Argentina.Universidad de Buenos Aires. Facultad de Ciencias Exactas y NaturalesBianchi, BrunoCorro, Luciano del2024-12-19info:eu-repo/semantics/bachelorThesisinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_7a1finfo:ar-repo/semantics/tesisDeGradoapplication/pdfhttps://hdl.handle.net/20.500.12110/seminario_nDAT000005_Poncespainfo:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/arreponame:Biblioteca Digital (UBA-FCEN)instname:Universidad Nacional de Buenos Aires. Facultad de Ciencias Exactas y Naturalesinstacron:UBA-FCEN2026-06-04T09:43:35Zseminario:seminario_nDAT000005_PonceInstitucionalhttps://digital.bl.fcen.uba.ar/Universidad públicaNo correspondehttps://digital.bl.fcen.uba.ar/cgi-bin/oaiserver.cgiana@bl.fcen.uba.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:18962026-06-04 09:43:36.533Biblioteca Digital (UBA-FCEN) - Universidad Nacional de Buenos Aires. Facultad de Ciencias Exactas y Naturalesfalse
dc.title.none.fl_str_mv	Mate in one Mate en 1
title	Mate in one
spellingShingle	Mate in one Ponce, Ezequiel AJEDREZ TRANSFORMERS GRANDES MODELOS DE LENGUAJE SUPERVISED FINE-TUNING DIRECT PREFERENCE OPTIMIZATION CHESS TRANSFORMERS LARGE LANGUAGE MODELS SUPERVISED FINE-TUNING DIRECT PREFERENCE OPTIMIZATION
title_short	Mate in one
title_full	Mate in one
title_fullStr	Mate in one
title_full_unstemmed	Mate in one
title_sort	Mate in one
dc.creator.none.fl_str_mv	Ponce, Ezequiel
author	Ponce, Ezequiel
author_facet	Ponce, Ezequiel
author_role	author
dc.contributor.none.fl_str_mv	Bianchi, Bruno Corro, Luciano del
dc.subject.none.fl_str_mv	AJEDREZ TRANSFORMERS GRANDES MODELOS DE LENGUAJE SUPERVISED FINE-TUNING DIRECT PREFERENCE OPTIMIZATION CHESS TRANSFORMERS LARGE LANGUAGE MODELS SUPERVISED FINE-TUNING DIRECT PREFERENCE OPTIMIZATION
topic	AJEDREZ TRANSFORMERS GRANDES MODELOS DE LENGUAJE SUPERVISED FINE-TUNING DIRECT PREFERENCE OPTIMIZATION CHESS TRANSFORMERS LARGE LANGUAGE MODELS SUPERVISED FINE-TUNING DIRECT PREFERENCE OPTIMIZATION
dc.description.none.fl_txt_mv	Esta investigación explora cómo los grandes modelos de lenguaje pueden desarrollar habilidades de razonamiento a partir de datos de partidas de ajedrez, enfocándose en la predicción de jugadas de jaque mate en una. Se implementaron enfoques de Supervised Fine-Tuning (SFT) y Direct Preference Optimization (DPO) para mejorar el rendimiento en la tarea de predicción. El estudio evaluó la eficacia de diferentes representaciones de entrada y segmentó el dataset por niveles de Elo para reflejar distintas habilidades. Los resultados muestran un aumento significativo en la capacidad del modelo para predecir jugadas de jaque mate, logrando una mejora de hasta 370 veces respecto al rendimiento base. Estos hallazgos subrayan el potencial de los grandes modelos de lenguaje en el análisis estratégico y la resolución de problemas complejos. This research explores how large language models can develop reasoning skills from chess game data, focusing on predicting checkmate-in-one moves. Approaches such as Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) were implemented to enhance performance in the prediction task. The study evaluated the effectiveness of different input representations and segmented the dataset by Elo levels to reflect various skill levels. The results show a significant increase in the model’s ability to predict checkmate moves, achieving an improvement of up to 370 times compared to the baseline performance. These findings highlight the potential of large language models in strategic analysis and solving complex problems. Fil: Ponce, Ezequiel. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales; Argentina.
description	Esta investigación explora cómo los grandes modelos de lenguaje pueden desarrollar habilidades de razonamiento a partir de datos de partidas de ajedrez, enfocándose en la predicción de jugadas de jaque mate en una. Se implementaron enfoques de Supervised Fine-Tuning (SFT) y Direct Preference Optimization (DPO) para mejorar el rendimiento en la tarea de predicción. El estudio evaluó la eficacia de diferentes representaciones de entrada y segmentó el dataset por niveles de Elo para reflejar distintas habilidades. Los resultados muestran un aumento significativo en la capacidad del modelo para predecir jugadas de jaque mate, logrando una mejora de hasta 370 veces respecto al rendimiento base. Estos hallazgos subrayan el potencial de los grandes modelos de lenguaje en el análisis estratégico y la resolución de problemas complejos.
publishDate	2024
dc.date.none.fl_str_mv	2024-12-19
dc.type.none.fl_str_mv	info:eu-repo/semantics/bachelorThesis info:eu-repo/semantics/publishedVersion http://purl.org/coar/resource_type/c_7a1f info:ar-repo/semantics/tesisDeGrado
format	bachelorThesis
status_str	publishedVersion
dc.identifier.none.fl_str_mv	https://hdl.handle.net/20.500.12110/seminario_nDAT000005_Ponce
url	https://hdl.handle.net/20.500.12110/seminario_nDAT000005_Ponce
dc.language.none.fl_str_mv	spa
language	spa
dc.rights.none.fl_str_mv	info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-nc-sa/2.5/ar
eu_rights_str_mv	openAccess
rights_invalid_str_mv	https://creativecommons.org/licenses/by-nc-sa/2.5/ar
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales
publisher.none.fl_str_mv	Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales
dc.source.none.fl_str_mv	reponame:Biblioteca Digital (UBA-FCEN) instname:Universidad Nacional de Buenos Aires. Facultad de Ciencias Exactas y Naturales instacron:UBA-FCEN
reponame_str	Biblioteca Digital (UBA-FCEN)
collection	Biblioteca Digital (UBA-FCEN)
instname_str	Universidad Nacional de Buenos Aires. Facultad de Ciencias Exactas y Naturales
instacron_str	UBA-FCEN
institution	UBA-FCEN
repository.name.fl_str_mv	Biblioteca Digital (UBA-FCEN) - Universidad Nacional de Buenos Aires. Facultad de Ciencias Exactas y Naturales
repository.mail.fl_str_mv	ana@bl.fcen.uba.ar
_version_	1867090999963025408
score	12.832306

Mate in one

Publicaciones similares