Publications in Computational Linguistics
-
What Causes Unemployment?: Unsupervised Causality Mining from Swedish Governmental Reports
Part of Proceedings of the Second Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2023), p. 25-29, 2023.
-
Kulmizev, Artur
The Search for Syntax: Investigating the Syntactic Knowledge of Neural Language Models Through the Lens of Dependency Parsing
2023.
-
Towards Data-effective Educational Question Generation with Prompt-based Learning
Part of Proceedings of 2023 Computing Conference, 2023.
-
Historical Language Models in Cryptanalysis: Case Studies on English and German
Part of Proceedings of the 6th International Conference on Historical Cryptology HistoCrypt 2023, 2023.
DOI for Historical Language Models in Cryptanalysis: Case Studies on English and German
-
What is the Code for the Code?Historical Cryptology Terminology
Part of Proceedings of the 6th International Conference on Historical Cryptology HistoCrypt 2023, 2023.
DOI for What is the Code for the Code?Historical Cryptology Terminology
-
PARSEME Corpus Release 1.3
Part of Proceedings of the 19th Workshop on Multiword Expressions (MWE 2023), p. 24-35, 2023.
-
Parser Evaluation for Analyzing Swedish 19th–20th Century Literature
Part of Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), p. 335-346, 2023.
Download full text (pdf) of Parser Evaluation for Analyzing Swedish 19th–20th Century Literature
-
Multilingual Automatic Speech Recognition for Scandinavian Languages
Part of Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), p. 460-466, 2023.
Download full text (pdf) of Multilingual Automatic Speech Recognition for Scandinavian Languages
-
PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions
Part of Northern European Journal of Language Technology (NEJLT), 2023.
DOI for PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions Download full text (pdf) of PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions
-
LingFN: A Framenet for the Linguistic Domain
Part of Computational Linguistics and Intelligent Text Processing, p. 367-379, 2023.
-
Cause and Effect in Governmental Reports: Two Data Sets for Causality Detection in Swedish
Part of Proceedings of the First Workshop on Natural Language Processing for Political Sciences (PoliticalNLP), p. 46-55, 2022.
-
Nucleus Composition in Transition-Based Dependency Parsing
Part of Computational Linguistics, p. 849-886, 2022.
DOI for Nucleus Composition in Transition-Based Dependency Parsing Download full text (pdf) of Nucleus Composition in Transition-Based Dependency Parsing
-
SLäNDa Version 2.0: Improved and Extended Annotation of Narrative and Dialogue in Swedish Literature
Part of Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022), p. 5324-5333, 2022.
-
Schrödinger's tree: On syntax and neural language models
Part of Frontiers in Artificial Intelligence, 2022.
DOI for Schrödinger's tree: On syntax and neural language models Download full text (pdf) of Schrödinger's tree: On syntax and neural language models
-
Linguistic variation: A challenge for describing the phonology of Kanashi
Part of Synchronic and diachronic aspects of Kanashi, p. 131-144, 2022.
DOI for Linguistic variation: A challenge for describing the phonology of Kanashi Download full text (pdf) of Linguistic variation: A challenge for describing the phonology of Kanashi
-
Linking endangerment databases and descriptive linguistics: an assessment of the use of terms relating to language endangerment in grammars
Part of Language Documentation & Conservation, p. 290-318, 2022.
-
Kanashi and West Himalayish: Genealogy, language contact, prehistoric migrations
Part of Synchronic and diachronic aspects of Kanashi, p. 237-254, 2022.
DOI for Kanashi and West Himalayish: Genealogy, language contact, prehistoric migrations Download full text (pdf) of Kanashi and West Himalayish: Genealogy, language contact, prehistoric migrations
-
Introduction: Kanashi, its speakers, its linguistic and extralinguistic context
Part of Synchronic and diachronic aspects of Kanashi, p. 3-11, 2022.
DOI for Introduction: Kanashi, its speakers, its linguistic and extralinguistic context Download full text (pdf) of Introduction: Kanashi, its speakers, its linguistic and extralinguistic context
-
A linguistic sketch of Kanashi
Part of Synchronic and diachronic aspects of Kanashi, p. 53-127, 2022.
DOI for A linguistic sketch of Kanashi Download full text (pdf) of A linguistic sketch of Kanashi
-
Clues to Kanashi prehistory 2: Loanword adaptation in verbs
Part of Synchronic and diachronic aspects of Kanashi, p. 215-234, 2022.
DOI for Clues to Kanashi prehistory 2: Loanword adaptation in verbs Download full text (pdf) of Clues to Kanashi prehistory 2: Loanword adaptation in verbs
-
Clues to Kanashi prehistory 1: Loanword adaptation in nouns and adjectives
Part of Synchronic and diachronic aspects of Kanashi, p. 173-214, 2022.
DOI for Clues to Kanashi prehistory 1: Loanword adaptation in nouns and adjectives Download full text (pdf) of Clues to Kanashi prehistory 1: Loanword adaptation in nouns and adjectives
-
And then there was one: Kanashi numerals from borrowed superdiversity to borrowed uniformity
Part of Synchronic and diachronic aspects of Kanashi, p. 145-170, 2022.
DOI for And then there was one: Kanashi numerals from borrowed superdiversity to borrowed uniformity Download full text (pdf) of And then there was one: Kanashi numerals from borrowed superdiversity to borrowed uniformity
-
Quotation and Narration in Contemporary Popular Fiction in Swedish: Stylometric Explorations
Part of Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), p. 203-211, 2022.
-
Swedish Diachronic Corpus
Part of CLARIN, p. 561-585, 2022.
DOI for Swedish Diachronic Corpus Download full text (pdf) of Swedish Diachronic Corpus
-
A Few Thousand Translations Go A Long Way! Leveraging Pre-trained Models for African News Translation
Part of NAACL 2022, p. 3053-3070, 2022.
-
To the Most Gracious Highness, from Your Humble Servant: Analysing Swedish 18th Century Petitions Using Text Classification
Part of Proceedings of the 6th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, p. 53-64, 2022.
-
A Tale of Four Parsers: Methodological Reflections on Diagnostic Evaluation and In-Depth Error Analysis for Meaning Representation Parsing
Part of Language Resources and Evaluation, p. 1075-1102, 2022.
DOI for A Tale of Four Parsers: Methodological Reflections on Diagnostic Evaluation and In-Depth Error Analysis for Meaning Representation Parsing Download full text (pdf) of A Tale of Four Parsers: Methodological Reflections on Diagnostic Evaluation and In-Depth Error Analysis for Meaning Representation Parsing
-
Exploring Cross-Lingual Transfer to Counteract Data Scarcity for Causality Detection
Part of WWW '22, p. 501-508, 2022.
DOI for Exploring Cross-Lingual Transfer to Counteract Data Scarcity for Causality Detection Download full text (pdf) of Exploring Cross-Lingual Transfer to Counteract Data Scarcity for Causality Detection
-
Overview of Touché 2022: Argument Retrieval
Part of Experimental IR Meets Multilinguality, Multimodality, and Interaction (CLEF 2022), p. 311-336, 2022.
-
The DECODE Database of Historical Ciphers and Keys: Version 2
Part of Proceedings of the 5th International Conference on Historical Cryptology. HistoCrypt 2022., p. 111-114, 2022.
DOI for The DECODE Database of Historical Ciphers and Keys: Version 2
-
Identifying Cleartext in Historical Ciphers
Part of Proceedings of the Workshop on Language Technologies for Historical and Ancient Languages. LT4HALA 2022., 2022.
-
Lost in Transcription of Graphic Signs in Ciphers
Part of Proceedings of the 5th International Conference on Historical Cryptology. HistoCrypt 2022, p. 153-158, 2022.
-
What Was Encoded in Historical Cipher Keys in the Early Modern Era?
Part of Proceedings of the 5th International Conference on Historical Cryptology. HistoCrypt 2022., 2022.
DOI for What Was Encoded in Historical Cipher Keys in the Early Modern Era?
-
Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning
Part of Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Acl 2022), p. 578-587, 2022.
DOI for Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning
-
Word Order Does Matter (And Shuffled Language Models Know It)
Part of PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1, p. 6907-6919, 2022.
-
Challenges and Strategies in Cross-Cultural NLP
Part of PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1, p. 6997-7013, 2022.
-
Uppsala University at SemEval-2022 Task 1: Can Foreign Entries Enhance an English Reverse Dictionary?
Part of Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), p. 88-93, 2022.
-
Few shots are all you need: A progressive learning approach for low resource handwritten text recognition
Part of Pattern Recognition Letters, p. 43-49, 2022.
DOI for Few shots are all you need: A progressive learning approach for low resource handwritten text recognition Download full text (pdf) of Few shots are all you need: A progressive learning approach for low resource handwritten text recognition
-
A Mention-Based System for Revision Requirements Detection
Part of Proceedings of the 1st Workshop on Understanding Implicit and Underspecified Language, p. 58-63, 2021.
DOI for A Mention-Based System for Revision Requirements Detection
-
Attention Can Reflect Syntactic Structure (If You Let It)
Part of Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, p. 3031-3045, 2021.
DOI for Attention Can Reflect Syntactic Structure (If You Let It)
-
Syntactic Nuclei in Dependency Parsing –: A Multilingual Exploration
Part of Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, p. 1376-1387, 2021.
DOI for Syntactic Nuclei in Dependency Parsing –: A Multilingual Exploration Download full text (pdf) of Syntactic Nuclei in Dependency Parsing –: A Multilingual Exploration
-
Audiobook stylistics: Comparing print and audio in the bestselling segment
Part of Journal of Cultural Analytics, p. 1-30, 2021.
DOI for Audiobook stylistics: Comparing print and audio in the bestselling segment Download full text (pdf) of Audiobook stylistics: Comparing print and audio in the bestselling segment
-
Have Attention Heads in BERT Learned Constituency Grammar?
Part of Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, p. 8-15, 2021.
DOI for Have Attention Heads in BERT Learned Constituency Grammar? Download full text (pdf) of Have Attention Heads in BERT Learned Constituency Grammar?
-
Whit’s the Richt Pairt o Speech: PoS tagging for Scots
Part of Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial), p. 39-48, 2021.
Download full text (pdf) of Whit’s the Richt Pairt o Speech: PoS tagging for Scots
-
Investigation of Transfer Languages for Parsing Latin: Italic Branch vs. Hellenic Branch
Part of Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), p. 315-320, 2021.
-
Survey and reproduction of computational approaches to dating of historical texts
Part of Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), p. 145-156, 2021.
-
Uppsala NLP at SemEval-2021 Task 2: Multilingual Language Models for Fine-tuning and Feature Extraction in Word-in-Context Disambiguation
Part of Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), p. 150-156, 2021.
-
Swedish FrameNet++ and comparative linguistics
Part of The Swedish FrameNet++, p. 139-166, 2021.
-
Universal Dependencies
Part of Computational Linguistics, p. 255-308, 2021.
DOI for Universal Dependencies Download full text (pdf) of Universal Dependencies
-
Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics
Part of Dagstuhl Reports, p. 89-138, 2021.
DOI for Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics
-
Revisiting Negation in Neural Machine Translation
Part of Transactions of the Association for Computational Linguistics, p. 740-755, 2021.
-
Bidirectional Domain Adaptation Using Weighted Multi-Task Learning
Part of IWPT 2021, p. 93-105, 2021.
DOI for Bidirectional Domain Adaptation Using Weighted Multi-Task Learning
-
Unsupervised Alphabet Matching in Historical Encrypted Manuscript Images
Part of Proceedings of the 4th International Conference on Historical Cryptology HistoCrypt 2021, 2021.
DOI for Unsupervised Alphabet Matching in Historical Encrypted Manuscript Images Download full text (pdf) of Unsupervised Alphabet Matching in Historical Encrypted Manuscript Images
-
Key Design in the Early Modern Era in Europe
Part of Proceedings of the 4th International Conference on Historical Cryptology (HistoCrypt 2021), 2021.
DOI for Key Design in the Early Modern Era in Europe Download full text (pdf) of Key Design in the Early Modern Era in Europe
-
A bird’s-eye view on South Asian languages through LSI
Part of Journal of South Asian Languages and Linguistics, p. 203-237, 2020.
DOI for A bird’s-eye view on South Asian languages through LSI Download full text (pdf) of A bird’s-eye view on South Asian languages through LSI
-
Tang, Gongbo
Understanding Neural Machine Translation: An investigation into linguistic phenomena and attention mechanisms
2020.
-
Czech Historical Named Entity Corpus v 1.0
Part of Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), p. 4458-4465, 2020.
Download full text (pdf) of Czech Historical Named Entity Corpus v 1.0
-
What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions?
Part of Computational linguistics - Association for Computational Linguistics (Print), p. 763-784, 2020.
DOI for What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions? Download full text (pdf) of What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions?
-
The DReaM Corpus: A Multilingual Annotated Corpus of Grammars for the World's Languages
Part of Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), p. 878-884, 2020.
-
Czech Historical Named Entity Corpus v 1.0
Part of 12th Conference on Language Resources and Evaluation (LREC 2020), p. 4458-4465, 2020.
Download full text (pdf) of Czech Historical Named Entity Corpus v 1.0
-
Exploiting Cross-lingual Hints to Discover Event Pronouns
Part of Proceedings of the 12th Conference on Linguistic Resources and Evaluation (LREC), p. 99-103, 2020.
Download full text (pdf) of Exploiting Cross-lingual Hints to Discover Event Pronouns
-
A Tale of Three Parsers: Towards Diagnostic Evaluation for Meaning Representation Parsing
Part of Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), p. 1902-1909, 2020.
-
SLäNDa: An Annotated Corpus of Narrative and Dialogue in Swedish Literary Fiction
Part of Proceedings of the 12th Language Resources and Evaluation Conference, p. 826-834, 2020.
-
Understanding Pure Character-Based Neural Machine Translation: The Case of Translating Finnish into English
Part of Proceedings of the 28th International Conference on Computational Linguistics, p. 4251-4262, 2020.
-
Real-valued syntactic word vectors
Part of Journal of experimental and theoretical artificial intelligence (Print), p. 557-579, 2020.
DOI for Real-valued syntactic word vectors Download full text (pdf) of Real-valued syntactic word vectors
-
Multilingual Dependency Parsing from Universal Dependencies to Sesame Street
Part of Text, Speech, and Dialogue (TSD 2020), p. 11-29, 2020.
DOI for Multilingual Dependency Parsing from Universal Dependencies to Sesame Street
-
Cross-Lingual Domain Adaptation for Dependency Parsing
Part of Proceedings of the 19th International Workshop on Treebanks and Linguistic Theories (TLT), p. 62-69, 2020.
DOI for Cross-Lingual Domain Adaptation for Dependency Parsing Download full text (pdf) of Cross-Lingual Domain Adaptation for Dependency Parsing
-
Cross-lingual Embeddings Reveal Universal and Lineage-Specific Patterns in Grammatical Gender Assignment
Part of Proceedings of the the 24th Conference on Computational Natural Language Learning, p. 265-275, 2020.
DOI for Cross-lingual Embeddings Reveal Universal and Lineage-Specific Patterns in Grammatical Gender Assignment Download full text (pdf) of Cross-lingual Embeddings Reveal Universal and Lineage-Specific Patterns in Grammatical Gender Assignment
-
Evaluating Word Embeddings for Indonesian–English Code-Mixed Text Based on Synthetic Data
Part of Proceedings of the 4th Workshop on Computational Approaches to Code Switching, p. 26-35, 2020.
-
Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions
Part of Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, p. 107-118, 2020.
-
The University of Edinburgh-Uppsala University's Submission to the WMT 2020 Chat Translation Task
Part of Proceedings of the 5th Conference on Machine Translation (WMT), p. 473-478, 2020.
-
Coreference Strategies in English-German Translation
Part of Proceedings of the 3rd Workshop on Computational Models of Reference, Anaphora and Coreference, p. 139-153, 2020.
Download full text (pdf) of Coreference Strategies in English-German Translation
-
IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine Translation
Part of Proceedings of the First International Workshop on Natural Language Processing Beyond Text, p. 41-50, 2020.
DOI for IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine Translation
-
Text Processing Procedures for Analysing a Corpus with Medieval Marian Miracle Tales in Old Swedish
Part of Proceedings of the 12th International Conference on Agents and Artificial Intelligence, p. 452-458, 2020.
-
Marian Miracles in Old Swedish Texts
Part of Les miracles de Notre-Dame du Moyen Âge à nos jours, p. 179-190, 2020.
-
Towards Privacy by Design in Learner Corpora Research: A Case of On-the-fly Pseudonymization of Swedish Learner Essays
Part of Proceedings of the 28th International Conference on Computational Linguistics. COLING 2020, p. 357-369, 2020.
-
Kopsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
Part of 16th International Conference on Parsing Technologies and IWPT 2020 Shared Task on Parsing Into Enhanced Universal Dependencies, p. 236-244, 2020.
DOI for Kopsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
-
Transcription of Historical Ciphers and Keys
Part of Proceedings of the 3rd International Conference on Historical Cryptology, p. 106-115, 2020.
Download full text (pdf) of Transcription of Historical Ciphers and Keys
-
Automatic Key Structure Extraction
Part of Proceedings of the 3rd International Conference on Historical Cryptology, p. 146-152, 2020.
Download full text (pdf) of Automatic Key Structure Extraction
-
Rubenson on the Move: A Biographical Journey
Part of Wisdom on the Move, p. 247-250, 2020.
-
Classification of Medieval Documents: Determining the Issuer, Place of Issue, and Decade for Old Swedish Charters
Part of DHN 2020 Digital Humanities in the Nordic Countries, p. 12-23, 2020.