Zilio, L., Finatto, M.J.B. and Villavicencio, A. orcid.org/0000-0002-3731-9168 (2016) Verblexpor: A lexical resource with semantic roles for Portuguese. In: Calzolari, N., Choukri, K., Declerck, T., Goggi, S., Grobelnik, M., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J. and Piperidis , S., (eds.) Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16). Tenth International Conference on Language Resources and Evaluation (LREC'16), 23-28 May 2016, Portorož, Slovenia. European Language Resources Association (ELRA) , pp. 2656-2661. ISBN 9782951740891
Abstract
This paper presents a lexical resource developed for Portuguese. The resource contains sentences annotated with semantic roles. The sentences were extracted from two domains: Cardiology research papers and newspaper articles. Both corpora were analyzed with the PALAVRAS parser and subsequently processed with a subcategorization frames extractor, so that each sentence that contained at least one main verb was stored in a database together with its syntactic organization. The annotation was manually carried out by a linguist using an annotation interface. Both the annotated and non-annotated data were exported to an XML format, which is readily available for download. The reason behind exporting non-annotated data is that there is syntactic information collected from the parser annotation in the non-annotated data, and this could be useful for other researchers. The sentences from both corpora were annotated separately, so that it is possible to access sentences either from the Cardiology or from the newspaper corpus. The full resource presents more than seven thousand semantically annotated sentences, containing 192 different verbs and more than 15 thousand individual arguments and adjuncts.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2016 European Language Resources Association (ELRA) |
Keywords: | Semantic Role Labeling; Lexical Resource; Corpus |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 21 Nov 2019 15:44 |
Last Modified: | 21 Nov 2019 15:44 |
Published Version: | https://www.aclweb.org/anthology/L16-1422 |
Status: | Published |
Publisher: | European Language Resources Association (ELRA) |
Refereed: | Yes |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:153560 |