Seminar in Computational Linguistics
- Date: –15:00
- Location: Engelska parken 9-3042
- Lecturer: Paola Merlo
- Contact person: Miryam de Lhoneux
- Seminarium
Probing word and sentence embeddings for long-distance dependencies effects in French and English
Abstract
Despite their practical success and impressive performances, neural-network-based and distributed semantics techniques have often been criticized as they remain fundamentally opaque and difficult to interpret. Several recent pieces of work have investigated the linguistic abilities of these representations, and whether they can capture long agreement and thus hierarchical notions. Results are at present inconclusive. In this vein, we study another core, defining and more challenging property of language: the ability to construe long-distance dependencies. Human languages exhibit the ability to interpret discontinuous elements distant from each other in the string as if they were adjacent. This ability is blocked if a similar, but extraneous, element intervenes between the discontinuous components. We present results that show that word embeddings and the similarity spaces they define do not unequivocally correlate with experimental results on intervention similarity in long-distance dependencies narrowly defined, even if translated in more 'syntactic' portion of the distributional space or tested in prediction tasks. These results show that the linguistic encoding in distributed representations does not appear to be human-like, and it also brings evidence to the debate on narrow or broad definitions of similarity in syntax and sentence processing.
Bio
Paola Merlo is associate professor in the Linguistics department of the University of Geneva. She is the head of the interdisciplinary research group Computational Learning and Computational Linguistics (CLCL). The group is concerned with interdisciplinary research combining linguistic modelling with machine learning techniques. Prof. Merlo has been editor of Computational Linguistics, published by MIT Press and a member of the executive committee of the ACL. Prof. Merlo holds a doctorate in Computational Linguistics from the University of Maryland, USA. She has been associate research fellow at the University of Pennsylvania, and visiting scholar at Rutgers, Edinburgh, and Stanford.