NLP-enhanced Shift Analysis of Named Entities in an English<>Spanish Intermodal Corpus of European Petitions


Book chapter


Gloria Corpas Pastor, Fernando Sánchez Rodas
Marta Kajzer-Wietrzny, Adriano Ferraresi, Ilmari Ivaska, Silvia Bernardini, Mediated discourse at the European Parliament: Empirical investigations, Language Science Press, Berlin, 2022, pp. 219-251


PDF
Cite

Cite

APA   Click to copy
Pastor, G. C., & Rodas, F. S. (2022). NLP-enhanced Shift Analysis of Named Entities in an English<>Spanish Intermodal Corpus of European Petitions. In M. Kajzer-Wietrzny, A. Ferraresi, I. Ivaska, & S. Bernardini (Eds.), Mediated discourse at the European Parliament: Empirical investigations (pp. 219–251). Berlin: Language Science Press. https://doi.org/10.5281/zenodo.6977052


Chicago/Turabian   Click to copy
Pastor, Gloria Corpas, and Fernando Sánchez Rodas. “NLP-Enhanced Shift Analysis of Named Entities in an English≪≫Spanish Intermodal Corpus of European Petitions.” In Mediated Discourse at the European Parliament: Empirical Investigations, edited by Marta Kajzer-Wietrzny, Adriano Ferraresi, Ilmari Ivaska, and Silvia Bernardini, 219–251. Berlin: Language Science Press, 2022.


MLA   Click to copy
Pastor, Gloria Corpas, and Fernando Sánchez Rodas. “NLP-Enhanced Shift Analysis of Named Entities in an English≪≫Spanish Intermodal Corpus of European Petitions.” Mediated Discourse at the European Parliament: Empirical Investigations, edited by Marta Kajzer-Wietrzny et al., Language Science Press, 2022, pp. 219–51, doi:10.5281/zenodo.6977052.


BibTeX   Click to copy

@inbook{gloria2022a,
  title = {NLP-enhanced Shift Analysis of Named Entities in an English<>Spanish Intermodal Corpus of European Petitions},
  year = {2022},
  address = {Berlin},
  pages = {219-251},
  publisher = {Language Science Press},
  doi = {10.5281/zenodo.6977052},
  author = {Pastor, Gloria Corpas and Rodas, Fernando Sánchez},
  editor = {Kajzer-Wietrzny, Marta and Ferraresi, Adriano and Ivaska, Ilmari and Bernardini, Silvia},
  booktitle = {Mediated discourse at the European Parliament: Empirical investigations}
}

Description
This chapter aims at presenting an NLP-enhanced corpus-based analysis of the translation and interpreting shifts observed in the named entities (NEs) of PETIMOD, an English<>Spanish intermodal corpus of written and oral mediated texts from the Committee on Petitions of the European Parliament. Our main assumption is that shifts in institutional genres mostly occur in the transfer of NEs, and that NLP techniques such as automatic Named Entity Recognition (NER) can be applied to systematically extract and compare examples of these shifts. Results show that traits like normalisation, transformation and simplification depend not only on the language direction or the mediation mode, but also on the semantic category (person, organisation, etc.) of the NE involved.