Adding frequencies to the LGLex lexicon with IRASUBCAT
- Autores
- Tolone, Elsa; Altamirano, Romina
- Año de publicación
- 2013
- Idioma
- inglés
- Tipo de recurso
- documento de conferencia
- Estado
- versión publicada
- Descripción
- We present a method for enlarge a lexicon (with frequencies information), that is useful for parsing and others NLP applications. We show an example enlarging the verbal LGLex lexicon of French [8], using several corpora extracted from the evaluation campaign for French parsers Passage [5]. To do that, we use the results of the frmg parser [7] with IRASubcat, a tool that automatically acquires subcategorization frames from corpus in any language and that also allows to complete an existing lexicon. We obtain the frequencies of occurrence for each input and each subcategorization frame for 14,068 distinct lemmas.
Sociedad Argentina de Informática e Investigación Operativa - Materia
-
Ciencias Informáticas
Lexicon-Grammar
syntactic lexicon
french lexicon
subcategorization
frequency of occurence - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- http://creativecommons.org/licenses/by-sa/4.0/
- Repositorio
- Institución
- Universidad Nacional de La Plata
- OAI Identificador
- oai:sedici.unlp.edu.ar:10915/76359
Ver los metadatos del registro completo
id |
SEDICI_62837b33f72f282d5a2685fa508a4a32 |
---|---|
oai_identifier_str |
oai:sedici.unlp.edu.ar:10915/76359 |
network_acronym_str |
SEDICI |
repository_id_str |
1329 |
network_name_str |
SEDICI (UNLP) |
spelling |
Adding frequencies to the LGLex lexicon with IRASUBCATTolone, ElsaAltamirano, RominaCiencias InformáticasLexicon-Grammarsyntactic lexiconfrench lexiconsubcategorizationfrequency of occurenceWe present a method for enlarge a lexicon (with frequencies information), that is useful for parsing and others NLP applications. We show an example enlarging the verbal LGLex lexicon of French [8], using several corpora extracted from the evaluation campaign for French parsers Passage [5]. To do that, we use the results of the frmg parser [7] with IRASubcat, a tool that automatically acquires subcategorization frames from corpus in any language and that also allows to complete an existing lexicon. We obtain the frequencies of occurrence for each input and each subcategorization frame for 14,068 distinct lemmas.Sociedad Argentina de Informática e Investigación Operativa2013-09info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionObjeto de conferenciahttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdf202-205http://sedici.unlp.edu.ar/handle/10915/76359enginfo:eu-repo/semantics/altIdentifier/url/http://42jaiio.sadio.org.ar/proceedings/simposios/Trabajos/ASAI/19.pdfinfo:eu-repo/semantics/altIdentifier/issn/1850-2784info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-sa/4.0/Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2025-10-15T11:05:25Zoai:sedici.unlp.edu.ar:10915/76359Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292025-10-15 11:05:25.34SEDICI (UNLP) - Universidad Nacional de La Platafalse |
dc.title.none.fl_str_mv |
Adding frequencies to the LGLex lexicon with IRASUBCAT |
title |
Adding frequencies to the LGLex lexicon with IRASUBCAT |
spellingShingle |
Adding frequencies to the LGLex lexicon with IRASUBCAT Tolone, Elsa Ciencias Informáticas Lexicon-Grammar syntactic lexicon french lexicon subcategorization frequency of occurence |
title_short |
Adding frequencies to the LGLex lexicon with IRASUBCAT |
title_full |
Adding frequencies to the LGLex lexicon with IRASUBCAT |
title_fullStr |
Adding frequencies to the LGLex lexicon with IRASUBCAT |
title_full_unstemmed |
Adding frequencies to the LGLex lexicon with IRASUBCAT |
title_sort |
Adding frequencies to the LGLex lexicon with IRASUBCAT |
dc.creator.none.fl_str_mv |
Tolone, Elsa Altamirano, Romina |
author |
Tolone, Elsa |
author_facet |
Tolone, Elsa Altamirano, Romina |
author_role |
author |
author2 |
Altamirano, Romina |
author2_role |
author |
dc.subject.none.fl_str_mv |
Ciencias Informáticas Lexicon-Grammar syntactic lexicon french lexicon subcategorization frequency of occurence |
topic |
Ciencias Informáticas Lexicon-Grammar syntactic lexicon french lexicon subcategorization frequency of occurence |
dc.description.none.fl_txt_mv |
We present a method for enlarge a lexicon (with frequencies information), that is useful for parsing and others NLP applications. We show an example enlarging the verbal LGLex lexicon of French [8], using several corpora extracted from the evaluation campaign for French parsers Passage [5]. To do that, we use the results of the frmg parser [7] with IRASubcat, a tool that automatically acquires subcategorization frames from corpus in any language and that also allows to complete an existing lexicon. We obtain the frequencies of occurrence for each input and each subcategorization frame for 14,068 distinct lemmas. Sociedad Argentina de Informática e Investigación Operativa |
description |
We present a method for enlarge a lexicon (with frequencies information), that is useful for parsing and others NLP applications. We show an example enlarging the verbal LGLex lexicon of French [8], using several corpora extracted from the evaluation campaign for French parsers Passage [5]. To do that, we use the results of the frmg parser [7] with IRASubcat, a tool that automatically acquires subcategorization frames from corpus in any language and that also allows to complete an existing lexicon. We obtain the frequencies of occurrence for each input and each subcategorization frame for 14,068 distinct lemmas. |
publishDate |
2013 |
dc.date.none.fl_str_mv |
2013-09 |
dc.type.none.fl_str_mv |
info:eu-repo/semantics/conferenceObject info:eu-repo/semantics/publishedVersion Objeto de conferencia http://purl.org/coar/resource_type/c_5794 info:ar-repo/semantics/documentoDeConferencia |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.none.fl_str_mv |
http://sedici.unlp.edu.ar/handle/10915/76359 |
url |
http://sedici.unlp.edu.ar/handle/10915/76359 |
dc.language.none.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/url/http://42jaiio.sadio.org.ar/proceedings/simposios/Trabajos/ASAI/19.pdf info:eu-repo/semantics/altIdentifier/issn/1850-2784 |
dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-sa/4.0/ Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
http://creativecommons.org/licenses/by-sa/4.0/ Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
dc.format.none.fl_str_mv |
application/pdf 202-205 |
dc.source.none.fl_str_mv |
reponame:SEDICI (UNLP) instname:Universidad Nacional de La Plata instacron:UNLP |
reponame_str |
SEDICI (UNLP) |
collection |
SEDICI (UNLP) |
instname_str |
Universidad Nacional de La Plata |
instacron_str |
UNLP |
institution |
UNLP |
repository.name.fl_str_mv |
SEDICI (UNLP) - Universidad Nacional de La Plata |
repository.mail.fl_str_mv |
alira@sedici.unlp.edu.ar |
_version_ |
1846064108528467968 |
score |
13.22299 |