Publications in Computational Linguistics
-
PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions
Part of Northern European Journal of Language Technology (NEJLT), 2023.
DOI for PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions Download full text (pdf) of PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions
-
LingFN: A Framenet for the Linguistic Domain
Part of Computational Linguistics and Intelligent Text Processing, p. 367-379, 2023.
-
A linguistic sketch of Kanashi
Part of Synchronic and diachronic aspects of Kanashi, p. 53-127, 2022.
DOI for A linguistic sketch of Kanashi Download full text (pdf) of A linguistic sketch of Kanashi
-
Clues to Kanashi prehistory 2: Loanword adaptation in verbs
Part of Synchronic and diachronic aspects of Kanashi, p. 215-234, 2022.
DOI for Clues to Kanashi prehistory 2: Loanword adaptation in verbs Download full text (pdf) of Clues to Kanashi prehistory 2: Loanword adaptation in verbs
-
Clues to Kanashi prehistory 1: Loanword adaptation in nouns and adjectives
Part of Synchronic and diachronic aspects of Kanashi, p. 173-214, 2022.
DOI for Clues to Kanashi prehistory 1: Loanword adaptation in nouns and adjectives Download full text (pdf) of Clues to Kanashi prehistory 1: Loanword adaptation in nouns and adjectives
-
And then there was one: Kanashi numerals from borrowed superdiversity to borrowed uniformity
Part of Synchronic and diachronic aspects of Kanashi, p. 145-170, 2022.
DOI for And then there was one: Kanashi numerals from borrowed superdiversity to borrowed uniformity Download full text (pdf) of And then there was one: Kanashi numerals from borrowed superdiversity to borrowed uniformity
-
Quotation and Narration in Contemporary Popular Fiction in Swedish: Stylometric Explorations
Part of Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), p. 203-211, 2022.
-
Swedish Diachronic Corpus
Part of CLARIN, p. 561-585, 2022.
DOI for Swedish Diachronic Corpus Download full text (pdf) of Swedish Diachronic Corpus
-
Schrödinger's tree: On syntax and neural language models
Part of Frontiers in Artificial Intelligence, 2022.
DOI for Schrödinger's tree: On syntax and neural language models Download full text (pdf) of Schrödinger's tree: On syntax and neural language models
-
A Few Thousand Translations Go A Long Way! Leveraging Pre-trained Models for African News Translation
Part of NAACL 2022, p. 3053-3070, 2022.
-
To the Most Gracious Highness, from Your Humble Servant: Analysing Swedish 18th Century Petitions Using Text Classification
Part of Proceedings of the 6th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, p. 53-64, 2022.
-
A Tale of Four Parsers: Methodological Reflections on Diagnostic Evaluation and In-Depth Error Analysis for Meaning Representation Parsing
Part of Language Resources and Evaluation, p. 1075-1102, 2022.
DOI for A Tale of Four Parsers: Methodological Reflections on Diagnostic Evaluation and In-Depth Error Analysis for Meaning Representation Parsing Download full text (pdf) of A Tale of Four Parsers: Methodological Reflections on Diagnostic Evaluation and In-Depth Error Analysis for Meaning Representation Parsing
-
Tha sound system of Kanashi
Part of Synchronic and diachronic aspects of Kanashi, p. 13-51, 2022.
-
Linguistic variation: A challenge for describing the phonology of Kanashi
Part of Synchronic and diachronic aspects of Kanashi, p. 131-144, 2022.
DOI for Linguistic variation: A challenge for describing the phonology of Kanashi
-
Kanashi and West Himalayish: Genealogy, language contact, prehistoric migrations
Part of Synchronic and diachronic aspects of Kanashi, p. 237-254, 2022.
DOI for Kanashi and West Himalayish: Genealogy, language contact, prehistoric migrations
-
Introduction: Kanashi, its speakers, its linguistic and extralinguistic context
Part of Synchronic and diachronic aspects of Kanashi, p. 3-11, 2022.
DOI for Introduction: Kanashi, its speakers, its linguistic and extralinguistic context
-
Cause and Effect in Governmental Reports: Two Data Sets for Causality Detection in Swedish
Part of Proceedings of the LREC 2022 Workshop on Natural Language Processing for Political Sciences, p. 46-55, 2022.
-
Fine-Grained Controllable Text Generation Using Non-Residual Prompting
Part of Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1 Long Papers), p. 6837-6857, 2022.
DOI for Fine-Grained Controllable Text Generation Using Non-Residual Prompting
-
Nucleus Composition in Transition-Based Dependency Parsing
Part of Computational Linguistics, p. 849-886, 2022.
-
Processing of Condition Monitoring Annotations with BERT and Technical Language Processing
Part of PHM Society European Conference, p. 306-314, 2022.
-
Exploring Cross-Lingual Transfer to Counteract Data Scarcity for Causality Detection
Part of WWW '22, p. 501-508, 2022.
DOI for Exploring Cross-Lingual Transfer to Counteract Data Scarcity for Causality Detection Download full text (pdf) of Exploring Cross-Lingual Transfer to Counteract Data Scarcity for Causality Detection
-
Overview of Touché 2022: Argument Retrieval
Part of Experimental IR Meets Multilinguality, Multimodality, and Interaction (CLEF 2022), p. 311-336, 2022.
-
The DECODE Database of Historical Ciphers and Keys: Version 2
Part of Proceedings of the 5th International Conference on Historical Cryptology. HistoCrypt 2022., p. 111-114, 2022.
DOI for The DECODE Database of Historical Ciphers and Keys: Version 2
-
Identifying Cleartext in Historical Ciphers
Part of Proceedings of the Workshop on Language Technologies for Historical and Ancient Languages. LT4HALA 2022., 2022.
-
Lost in Transcription of Graphic Signs in Ciphers
Part of Proceedings of the 5th International Conference on Historical Cryptology. HistoCrypt 2022, p. 153-158, 2022.
-
What Was Encoded in Historical Cipher Keys in the Early Modern Era?
Part of Proceedings of the 5th International Conference on Historical Cryptology. HistoCrypt 2022., 2022.
DOI for What Was Encoded in Historical Cipher Keys in the Early Modern Era?
-
Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning
Part of Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Acl 2022), p. 578-587, 2022.
DOI for Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning
-
Word Order Does Matter (And Shuffled Language Models Know It)
Part of PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1, p. 6907-6919, 2022.
-
Challenges and Strategies in Cross-Cultural NLP
Part of PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1, p. 6997-7013, 2022.
-
SLäNDa Version 2.0: Improved and Extended Annotation of Narrative and Dialogue in Swedish Literature
Part of Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022), p. 5324-5333, 2022.
-
Uppsala University at SemEval-2022 Task 1: Can Foreign Entries Enhance an English Reverse Dictionary?
Part of Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), p. 88-93, 2022.
-
Cause and Effect in Governmental Reports: Two Data Sets for Causality Detection in Swedish
Part of Proceedings of the First Workshop on Natural Language Processing for Political Sciences (PoliticalNLP), p. 46-55, 2022.
-
Few shots are all you need: A progressive learning approach for low resource handwritten text recognition
Part of Pattern Recognition Letters, p. 43-49, 2022.
DOI for Few shots are all you need: A progressive learning approach for low resource handwritten text recognition Download full text (pdf) of Few shots are all you need: A progressive learning approach for low resource handwritten text recognition
-
A bird’s-eye view on South Asian languages through LSI
Part of Journal of South Asian languages and linguistics, p. 203-237, 2021.
DOI for A bird’s-eye view on South Asian languages through LSI
-
Syntactic Nuclei in Dependency Parsing –: A Multilingual Exploration
Part of Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, p. 1376-1387, 2021.
DOI for Syntactic Nuclei in Dependency Parsing –: A Multilingual Exploration Download full text (pdf) of Syntactic Nuclei in Dependency Parsing –: A Multilingual Exploration
-
Attention Can Reflect Syntactic Structure (If You Let It)
Part of Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, p. 3031-3045, 2021.
DOI for Attention Can Reflect Syntactic Structure (If You Let It)
-
Audiobook stylistics: Comparing print and audio in the bestselling segment
Part of Journal of Cultural Analytics, p. 1-30, 2021.
DOI for Audiobook stylistics: Comparing print and audio in the bestselling segment Download full text (pdf) of Audiobook stylistics: Comparing print and audio in the bestselling segment
-
Have Attention Heads in BERT Learned Constituency Grammar?
Part of Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, p. 8-15, 2021.
DOI for Have Attention Heads in BERT Learned Constituency Grammar? Download full text (pdf) of Have Attention Heads in BERT Learned Constituency Grammar?
-
Whit’s the Richt Pairt o Speech: PoS tagging for Scots
Part of Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial), p. 39-48, 2021.
Download full text (pdf) of Whit’s the Richt Pairt o Speech: PoS tagging for Scots
-
Investigation of Transfer Languages for Parsing Latin: Italic Branch vs. Hellenic Branch
Part of Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), p. 315-320, 2021.
-
Survey and reproduction of computational approaches to dating of historical texts
Part of Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), p. 145-156, 2021.
-
Uppsala NLP at SemEval-2021 Task 2: Multilingual Language Models for Fine-tuning and Feature Extraction in Word-in-Context Disambiguation
Part of Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), p. 150-156, 2021.
-
Swedish FrameNet++ and comparative linguistics
Part of The Swedish FrameNet++, p. 139-166, 2021.
-
Universal Dependencies
Part of Computational Linguistics, p. 255-308, 2021.
DOI for Universal Dependencies Download full text (pdf) of Universal Dependencies
-
Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics
Part of Dagstuhl Reports, p. 89-138, 2021.
DOI for Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics
-
Revisiting Negation in Neural Machine Translation
Part of Transactions of the Association for Computational Linguistics, p. 740-755, 2021.
-
Bidirectional Domain Adaptation Using Weighted Multi-Task Learning
Part of IWPT 2021, p. 93-105, 2021.
DOI for Bidirectional Domain Adaptation Using Weighted Multi-Task Learning
-
Unsupervised Alphabet Matching in Historical Encrypted Manuscript Images
Part of Proceedings of the 4th International Conference on Historical Cryptology HistoCrypt 2021, 2021.
DOI for Unsupervised Alphabet Matching in Historical Encrypted Manuscript Images Download full text (pdf) of Unsupervised Alphabet Matching in Historical Encrypted Manuscript Images
-
Key Design in the Early Modern Era in Europe
Part of Proceedings of the 4th International Conference on Historical Cryptology (HistoCrypt 2021), 2021.
DOI for Key Design in the Early Modern Era in Europe Download full text (pdf) of Key Design in the Early Modern Era in Europe
-
Tang, Gongbo
Understanding Neural Machine Translation: An investigation into linguistic phenomena and attention mechanisms
2020.
-
Czech Historical Named Entity Corpus v 1.0
Part of Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), p. 4458-4465, 2020.
Download full text (pdf) of Czech Historical Named Entity Corpus v 1.0
-
What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions?
Part of Computational linguistics - Association for Computational Linguistics (Print), p. 763-784, 2020.
DOI for What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions? Download full text (pdf) of What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions?
-
The DReaM Corpus: A Multilingual Annotated Corpus of Grammars for the World's Languages
Part of Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), p. 878-884, 2020.
-
Czech Historical Named Entity Corpus v 1.0
Part of 12th Conference on Language Resources and Evaluation (LREC 2020), p. 4458-4465, 2020.
Download full text (pdf) of Czech Historical Named Entity Corpus v 1.0
-
Exploiting Cross-lingual Hints to Discover Event Pronouns
Part of Proceedings of the 12th Conference on Linguistic Resources and Evaluation (LREC), p. 99-103, 2020.
Download full text (pdf) of Exploiting Cross-lingual Hints to Discover Event Pronouns
-
A Tale of Three Parsers: Towards Diagnostic Evaluation for Meaning Representation Parsing
Part of Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), p. 1902-1909, 2020.
-
SLäNDa: An Annotated Corpus of Narrative and Dialogue in Swedish Literary Fiction
Part of Proceedings of the 12th Language Resources and Evaluation Conference, p. 826-834, 2020.
-
A bird’s-eye view on South Asian languages through LSI
Part of Journal of South Asian languages and linguistics, p. 203-237, 2020.
DOI for A bird’s-eye view on South Asian languages through LSI Download full text (pdf) of A bird’s-eye view on South Asian languages through LSI
-
Understanding Pure Character-Based Neural Machine Translation: The Case of Translating Finnish into English
Part of Proceedings of the 28th International Conference on Computational Linguistics, p. 4251-4262, 2020.
-
Real-valued syntactic word vectors
Part of Journal of experimental and theoretical artificial intelligence (Print), p. 557-579, 2020.
DOI for Real-valued syntactic word vectors Download full text (pdf) of Real-valued syntactic word vectors
-
Multilingual Dependency Parsing from Universal Dependencies to Sesame Street
Part of Text, Speech, and Dialogue (TSD 2020), p. 11-29, 2020.
DOI for Multilingual Dependency Parsing from Universal Dependencies to Sesame Street
-
Cross-Lingual Domain Adaptation for Dependency Parsing
Part of Proceedings of the 19th International Workshop on Treebanks and Linguistic Theories (TLT), p. 62-69, 2020.
DOI for Cross-Lingual Domain Adaptation for Dependency Parsing Download full text (pdf) of Cross-Lingual Domain Adaptation for Dependency Parsing
-
Cross-lingual Embeddings Reveal Universal and Lineage-Specific Patterns in Grammatical Gender Assignment
Part of Proceedings of the the 24th Conference on Computational Natural Language Learning, p. 265-275, 2020.
DOI for Cross-lingual Embeddings Reveal Universal and Lineage-Specific Patterns in Grammatical Gender Assignment Download full text (pdf) of Cross-lingual Embeddings Reveal Universal and Lineage-Specific Patterns in Grammatical Gender Assignment
-
Evaluating Word Embeddings for Indonesian–English Code-Mixed Text Based on Synthetic Data
Part of Proceedings of the 4th Workshop on Computational Approaches to Code Switching, p. 26-35, 2020.
-
Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions
Part of Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, p. 107-118, 2020.
-
The University of Edinburgh-Uppsala University's Submission to the WMT 2020 Chat Translation Task
Part of Proceedings of the 5th Conference on Machine Translation (WMT), p. 473-478, 2020.
-
Coreference Strategies in English-German Translation
Part of Proceedings of the 3rd Workshop on Computational Models of Reference, Anaphora and Coreference, p. 139-153, 2020.
Download full text (pdf) of Coreference Strategies in English-German Translation
-
IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine Translation
Part of Proceedings of the First International Workshop on Natural Language Processing Beyond Text, p. 41-50, 2020.
DOI for IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine Translation
-
Text Processing Procedures for Analysing a Corpus with Medieval Marian Miracle Tales in Old Swedish
Part of Proceedings of the 12th International Conference on Agents and Artificial Intelligence, p. 452-458, 2020.
-
Marian Miracles in Old Swedish Texts
Part of Les miracles de Notre-Dame du Moyen Âge à nos jours, p. 179-190, 2020.
-
Towards Privacy by Design in Learner Corpora Research: A Case of On-the-fly Pseudonymization of Swedish Learner Essays
Part of Proceedings of the 28th International Conference on Computational Linguistics. COLING 2020, p. 357-369, 2020.
-
Kopsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
Part of 16th International Conference on Parsing Technologies and IWPT 2020 Shared Task on Parsing Into Enhanced Universal Dependencies, p. 236-244, 2020.
DOI for Kopsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
-
Transcription of Historical Ciphers and Keys
Part of Proceedings of the 3rd International Conference on Historical Cryptology, p. 106-115, 2020.
Download full text (pdf) of Transcription of Historical Ciphers and Keys
-
Automatic Key Structure Extraction
Part of Proceedings of the 3rd International Conference on Historical Cryptology, p. 146-152, 2020.
Download full text (pdf) of Automatic Key Structure Extraction
-
Rubenson on the Move: A Biographical Journey
Part of Wisdom on the Move, p. 247-250, 2020.
-
Classification of Medieval Documents: Determining the Issuer, Place of Issue, and Decade for Old Swedish Charters
Part of DHN 2020 Digital Humanities in the Nordic Countries, p. 12-23, 2020.
-
A Web-based Interactive Transcription Tool for Encrypted Manuscripts
Part of Proceedings of the 3rd International Conference on Historical Cryptology HistoCrypt 2020, 2020.
DOI for A Web-based Interactive Transcription Tool for Encrypted Manuscripts Download full text (pdf) of A Web-based Interactive Transcription Tool for Encrypted Manuscripts
-
A Statistical Explanation of the Distribution of Sortal Classifiers in Languages of the World via Computational Classifiers
Part of Journal of Quantitative Linguistics, p. 93-113, 2020.
DOI for A Statistical Explanation of the Distribution of Sortal Classifiers in Languages of the World via Computational Classifiers Download full text (pdf) of A Statistical Explanation of the Distribution of Sortal Classifiers in Languages of the World via Computational Classifiers
-
Encoders Help You Disambiguate Word Senses in Neural Machine Translation
Part of Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), p. 1429-1435, 2019.
DOI for Encoders Help You Disambiguate Word Senses in Neural Machine Translation
-
Deep Contextualized Word Embeddings in Transition-Based and Graph-Based Dependency Parsing – A Tale of Two Parsers Revisited
Part of Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), p. 2755-2768, 2019.