Publications in Computational Linguistics

  • Dahllöf, Mats; Berglund, Karl

    Faces, Fights, and Families: Topic Modeling and Gendered Themes in Two Corpora of Swedish Prose Fiction

    Part of DHN 2019 Copenhagen, Proceedings of 4th Conference of The Association Digital Humanities in the Nordic Countries Copenhagen, March 6-8 2019, p. 92-111, 2019.

    Open access
  • Megyesi, Beáta; Volodina, Elena

    Pseudonymization of Language Learner Data

    Part of Workshop om pseudonymisering av textdata, 2019.

    Open access
  • Baró, Arnau; Chen, Jialuo; Fornés, Alicia; Megyesi, Beáta

    Towards a Generic Unsupervised Method for Transcription of Encoded Manuscripts

    Part of Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage, 2019.

  • Megyesi, Beáta; Palmér, Anne; Näsman, Jesper

    SWEGRAM: Annotering och analys av svenska texter

    2019.

    Open access
  • Megyesi, Beáta; Blomqvist, Nils; Pettersson, Eva

    The DECODE Database: Collection of Historical Ciphers and Keys

    Part of Proceedings of the 2nd International Conference on Historical Cryptology, p. 69-78, 2019.

    Open access
  • Dahllöf, Mats

    Clustering writing components from medieval manuscripts

    Part of Proceedings of the Workshop on Computational Methods in the Humanities 2018, p. 23-32, 2019.

    Open access
  • Humanistiska teatern, Thunbergsvägen 3H, Uppsala 2019-03-09 10:15

    Tang, Marc

    A typology of classifiers and gender: From description to computation

    Open access
  • Dubremetz, Marie; Nivre, Joakim

    Rhetorical Figure Detection: Chiasmus, Epanaphora, Epiphora

    Part of Frontiers in Digital Humanities, 2018.

  • Dahllöf, Mats

    Automatic Scribe Attribution for Medieval Manuscripts

    Part of Digital Medievalist, p. 1-26, 2018.

    Open access
  • Tang, Gongbo; Müller, Mathias; Rios, Annette; Sennrich, Rico

    Why Self-Attention?: A Targeted Evaluation of Neural Machine Translation Architectures

    Part of Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, p. 4263-4272, 2018.

  • Schuster, Sebastian; Nivre, Joakim; Manning, Christopher D.

    Sentences with Gapping: Parsing and Reconstructing Elided Predicates

    Part of Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, p. 1156-1168, 2018.

  • Nivre, Joakim; Marongiu, Paola; Ginter, Filip; Kanerva, Jenna et al.

    Enhancing Universal Dependency Treebanks: A Case Study

    Part of Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), p. 102-107, 2018.

  • Bouma, Gosse; Hajič, Jan; Haug, Dag; Nivre, Joakim et al.

    Expletives in Universal Dependency Treebanks

    Part of Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), p. 18-26, 2018.

  • Tang, Gongbo; Sennrich, Rico; Nivre, Joakim

    An analysis of Attention Mechanism: The Case of Word Sense Disambiguation in Neural Machine Translation

    Part of Proceedings of the Third Conference on Machine Translation, p. 26-35, 2018.

  • Smith, Aaron; Bohnet, Bernd; de Lhoneux, Miryam; Nivre, Joakim et al.

    82 Treebanks, 34 Models: Universal Dependency Parsing with Multi-Treebank Models

    Part of Proceedings of the CoNLL 2018 Shared Task, p. 113-123, 2018.

  • Smith, Aaron; de Lhoneux, Miryam; Stymne, Sara; Nivre, Joakim

    An Investigation of the Interactions Between Pre-Trained Word Embeddings, Character Models and POS Tags in Dependency Parsing

    Part of Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, p. 2711-2720, 2018.

  • Tang, Gongbo; Cap, Fabienne; Pettersson, Eva; Nivre, Joakim

    An evaluation of neural machine translation models on historical spelling normalization

    Part of Proceedings of the 27th International Conference on Computational Linguistics, p. 1320-1331, 2018.

  • Megyesi, Beáta; Granstedt, Lena; Johansson, Sofia; Prentice, Julia et al.

    Learner Corpus Anonymization in the Age of GDPR: Insights from the Creation of a Learner Corpus of Swedish

    Part of Proceedings of the 7th NLP4CALL, 2018.

    Open access
  • Her, One-Soon; Tang, Marc

    A Statistical Explanation of the Distribution of Sortal Classifiers in Languages of the World via Computational Classifiers

    Part of Journal of Quantitative Linguistics, 2018.

    Open access
  • Søgaard, Anders; de Lhoneux, Miryam; Augenstein, Isabelle

    Nightmare at test time: How punctuation prevents parsers from generalizing

    Part of Proceedings of the 2018 EMNLP Workshop BlackboxNLP, p. 25-29, 2018.

    Open access
  • de Lhoneux, Miryam; Bjerva, Johannes; Augenstein, Isabelle; Søgaard, Anders

    Parameter sharing between dependency parsers for related languages

    Part of Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, p. 4992-4997, 2018.

    Open access
  • Dahllöf, Mats

    Clustering Writing Components from Medieval Manuscripts

    Part of COMHUM 2018: Book of Abstracts for the Workshop on Computational Methods in the Humanities 2018, p. 11-13, 2018.

    Open access
  • Stymne, Sara; de Lhoneux, Miryam; Smith, Aaron; Nivre, Joakim

    Parser Training with Heterogeneous Treebanks

    Part of Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), p. 619-625, 2018.

    Open access
  • Megyesi, Beáta

    Proceedings of the 1st International Conference on Historical Cryptology: HistoCrypt 2018

    2018.

    Open access
  • Room 22-0008, Humanistiska teatern, 752 38, Uppsala 2018-09-08 09:00

    Basirat, Ali

    Principal Word Vectors

    Open access
  • Shao, Yan; Hardmeier, Christian; Nivre, Joakim

    Universal Word Segmentation: Implementation and Interpretation

    Part of Transactions of the Association for Computational Linguistics, p. 421-435, 2018.

    Open access
  • Pettersson, Eva; Megyesi, Beata

    The HistCorp Collection of Historical Corpora and Resources

    Part of DHN 2018, p. 306-320, 2018.

    Open access
  • Humanistiska teatern, Thunbergsvägen 3, Uppsala 2018-06-09 10:15

    Shao, Yan

    Segmenting and Tagging Text with Neural Networks

    Open access
  • Dubremetz, Marie; Nivre, Joakim

    Machine Learning for Rhetorical Figure Detection: More Chiasmus with Less Annotation

    Part of Proceedings of the 21st Nordic Conference of Computational Linguistics, p. 37-45, 2017.

  • Virk, Shafqat Mumtaz; Borin, Lars; Saxena, Anju; Hammarström, Harald

    Automatic extraction of typological linguistic features from descriptive grammars

    Part of Text, Speech, and Dialogue, p. 111-119, 2017.

  • Hammarström, Harald; Virk, Shafqat Mumtaz; Forsberg, Markus

    Poor Man’s OCR Post-Correction: Unsupervised Recognition of Variant Spelling Applied to a Multilingual Document Collection

    Part of Proceedings of the Digital Access to Textual Cultural Heritage (DATeCH) conference, p. 71-75, 2017.

  • Shao, Yan; Hardmeier, Christian; Tiedemann, Jörg; Nivre, Joakim

    Character-based Joint Segmentation and POS Tagging for Chinese using Bidirectional RNN-CRF

    Part of Proceedings of the The 8th International Joint Conference on Natural Language Processing, p. 173-183, 2017.

    Open access
  • Shao, Yan; Hardmeier, Christian; Nivre, Joakim

    Recall is the Proper Evaluation Metric for Word Segmentation

    Part of Proceedings of the The 8th International Joint Conference on Natural Language Processing, p. 86-90, 2017.

    Open access
  • de Lhoneux, Miryam; Yan, Shao; Basirat, Ali; Kiperwasser, Eliyahu et al.

    From raw text to Universal Dependencies: look, no tags!

    Part of Proceedings of the CoNLL 2017 Shared Task, p. 207-217, 2017.

    Open access
  • Shao, Yan

    Cross-lingual Word Segmentation and Morpheme Segmentation as Sequence Labelling

    Part of Proceedings of MLP 2017, p. 75-80, 2017.

    Open access
  • Parks, Magdalena; Karlgren, Jussi; Stymne, Sara

    Plausibility Testing for Lexical Resources

    Part of Proceedings of CLEF 2017, p. 132-137, 2017.

  • Adams, Allison; Stymne, Sara

    Learning with learner corpora: Using the TLE for native language identification

    Part of Proceedings of the joint workshop on NLP for Computer Assisted Language Learning and NLP for Language Acquisition, p. 1-7, 2017.

    Open access
  • Stymne, Sara

    The Effect of Translationese on Tuning for Statistical Machine Translation

    Part of Proceedings of the 21st Nordic Conference on Computational Linguistics, p. 241-246, 2017.

    Open access
  • Loáiciga, Sharid; Stymne, Sara; Nakov, Preslav; Hardmeier, Christian et al.

    Findings of the 2017 DiscoMT Shared Task on Cross-lingual Pronoun Prediction

    Part of Proceedings of the Third Workshop on Discourse in Machine Translation, 2017.

    Open access
  • Stymne, Sara; Loàiciga, Sharid; Cap, Fabienne

    A BiLSTM-based System for Cross-lingual Pronoun Prediction

    2017.

  • Padilla López, Rebeca; Cap, Fabienne

    Did you ever read about Frogs drinking Coffee?: Investigating the Compositionality of Multi-Emoji Expressions

    Part of Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, p. 113-117, 2017.

    Open access
  • Savary, Agata; Ramisch, Carlos; Cordeiro, Silvio Ricardo; Sangati, Federico et al.

    The PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions

    Part of Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), p. 31-47, 2017.

    Open access
  • Cap, Fabienne

    Show me your variance and I tell you who you are: Deriving compound compositionality from word alignments

    Part of Proceedings of the 13th Workshop on Multiword Expressions, p. 102-107, 2017.

    Open access
  • Cap, Fabienne

    Approximating Compound Compositionality based on Word Alignments

    2017.

  • Stymne, Sara; Pettersson, Eva; Megyesi, Beáta; Palmér, Anne

    Annotating Errors in Student Texts: First Experiences and Experiments

    Part of Proceedings of Joint 6th NLP4CALL and 2nd NLP4LA Nodalida workshop, p. 47-60, 2017.

    Open access
  • Näsman, Jesper; Megyesi, Beáta; Palmér, Anne

    SWEGRAM: A Web-Based Tool for Automatic Annotation and Analysis of Swedish Texts

    Part of Proceedings of the 21st Nordic Conference on Computational Linguistics, Nodalida 2017., p. 132-141, 2017.

    Open access
  • Fornes, Alicia; Megyesi, Beáta; Mas, Joan

    Transcription of Encoded Manuscripts with Image Processing Techniques

    Part of Proceedings of Digital Humanities 2017., 2017.

    Open access
  • Basirat, Ali; Tang, Marc

    Neural network and human cognition: A case study of grammatical gender in Swedish

    Part of Proceedings of the 13th Swedish Cognitive Science Society (SweCog) national conference, p. 28-30, 2017.

    Open access
  • Zeman, Daniel; Popel, Martin; Straka, Milan; Hajic, Jan et al.

    CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

    Part of Proceedings of the CoNLL 2017 Shared Task, p. 1-19, 2017.

  • Ide, Nancy; Calzolari, Nicoletta; Eckle-Kohler, Judith; Gibbon, Dafydd et al.

    Community Standards for Linguistically-Annotated Resources

    Part of Handbook of Linguistic Annotation, p. 113-165, 2017.

  • Nivre, Joakim; Fang, Chiao-Ting

    Universal Dependency Evaluation

    Part of Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW 2017), p. 86-95, 2017.

  • Basirat, Ali; Nivre, Joakim

    Real-valued Syntactic Word Vectors (RSV) for Greedy Neural Dependency Parsing

    p. 21-28 2017.

    Open access
  • Humanistiska teatern, Thunbergsvägen 3H, Uppsala 2018-01-20 10:15

    Dubremetz, Marie

    Detecting Rhetorical Figures Based on Repetition of Words: Chiasmus, Epanaphora, Epiphora

    Open access
  • de Lhoneux, Miryam; Stymne, Sara; Nivre, Joakim

    Arc-Hybrid Non-Projective Dependency Parsing with a Static-Dynamic Oracle

    Part of IWPT 2017 15th International Conference on Parsing Technologies, p. 99-104, 2017.

    Open access
  • Dubremetz, Marie; Nivre, Joakim

    Syntax Matters for Rhetorical Structure: The Case of Chiasmus

    Part of Proceedings of the Fifth Workshop on Computational Linguistics for Literature, p. 47-53, 2016.

  • Shao, Yan; Nivre, Joakim

    Applying Neural Networks to English-Chinese Named Entity Transliteration

    Part of Proceedings of the Sixth Named Entity Workshop, joint with 54th ACL, 2016.

    Open access
  • Cap, Fabienne; Adesam, Yvonne; Ahrenberg, Lars; Borin, Lars et al.

    SWORD: Towards Cutting-Edge Swedish Word Processing

    2016.

  • Borin, Lars; Tahmasebi, Nina; Volodina, Elena; Ekman, Stefan et al.

    Swe-Clarin: Language Resources and Technology for Digital Humanities

    Part of Extended Papers of the International Symposium on Digital Humanities, p. 29-51, 2016.

    Open access
  • Volodina, Elena; Megyesi, Beáta; Wirén, Mats; Granstedt, Lena et al.

    A Friend in Need?: Research agenda for electronic Second Language infrastructure

    Part of Proceedings of SLTC 2016, 2016.

    Open access
  • Cap, Fabienne; Stymne, Sara

    Using Word Alignments to Determine the Compositionality of Swedish Compound Nouns

    2016.

  • Constant, Matthieu; Nivre, Joakim

    A Transition-Based System for Joint Lexical and Syntactic Analysis

    Part of Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, p. 161-171, 2016.

  • Shao, Yan; Hardmeier, Christian; Nivre, Joakim

    Multilingual Named Entity Recognition using Hybrid Neural Networks

    2016.

    Open access
  • Guillou, Liane; Hardmeier, Christian; Nakov, Preslav; Stymne, Sara et al.

    Findings of the 2016 WMT Shared Taskon Cross-lingual Pronoun Prediction

    Part of Proceedings of the First Conference on Machine Translation, p. 525-542, 2016.

    Open access
  • Tiedemann, Jörg; Cap, Fabienne; Kanerva, Jenna; Ginter, Filip et al.

    Phrase-Based SMT for Finnish with More Data, Better Models and Alternative Alignment and Translation Tools

    Part of Proceedings of the First Conference on Machine Translation, p. 391-398, 2016.

    Open access
  • Sagemo, Oscar; Stymne, Sara

    The UU Submission to the Machine Translation Quality Estimation Task

    Part of Proceedings of the First Conference on Machine Translation, p. 825-830, 2016.

    Open access
  • Stymne, Sara

    Feature Exploration for Cross-Lingual Pronoun Prediction

    Part of Proceedings of the First Conference on Machine Translation, p. 609-615, 2016.

    Open access
  • Stymne, Sara

    The Effect of Translationese on SMT Tuning

    2016.

    Open access
  • Nivre, Joakim; de Marneffe, Marie-Catherine; Ginter, Filip; Goldberg, Yoav et al.

    Universal Dependencies v1: A Multilingual Treebank Collection

    Part of Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), 2016.

  • Dobrovoljc, Kaja; Nivre, Joakim

    The Universal Dependencies Treebank of Spoken Slovenian

    Part of Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), 2016.

  • Seraji, Mojgan; Ginter, Filip; Nivre, Joakim

    Universal Dependencies for Persian

    Part of Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), 2016.

  • Basirat, Ali; Faili, Heshaam; Nivre, Joakim

    A statistical model for grammar mapping

    Part of Natural Language Engineering, p. 215-255, 2016.

  • Smith, Aaron; Hardmeier, Christian; Tiedemann, Jorg

    Climbing Mount BLEU: The Strange World of Reachable High-BLEU Translations

    Part of Baltic Journal of Modern Computing, p. 269-281, 2016.

  • Hardmeier, Christian; Guillou, Liane

    A Graphical Pronoun Analysis Tool for the PROTEST Pronoun Evaluation Test Suite

    Part of Baltic Journal of Modern Computing, p. 318-330, 2016.

  • Megyesi, Beata; Näsman, Jesper; Palmér, Anne

    The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis

    Part of Language Resources and Evaluation, 2016.

  • Ballesteros, Miguel; Nivre, Joakim

    MaltOptimizer: Fast and Effective Parser Optimization

    Part of Natural Language Engineering, p. 187-213, 2016.

  • Dubremetz, Marie; Nivre, Joakim

    Rhetorical Figure Detection: the Case of Chiasmus

    Part of Proceedings of the Fourth Workshop on Computational Linguistics for Literature, p. 23-31, 2015.

  • Shao, Yan; Tiedemann, Jörg; Nivre, Joakim

    Boosting English-Chinese Machine Transliteration via High Quality Alignment and Multilingual Resources

    Part of Proceedings of the Fifth Named Entity Workshop, p. 56-60, 2015.

  • Pettersson, Eva; Megyesi, Beata; Nivre, Joakim

    Ranking Relevant Verb Phrases Extracted from Historical Text

    Part of Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, 2015.

  • Megyesi, Beata

    Proceedings of the 20th Nordic Conference of Computational Linguistics

    2015.

  • Webber, Bonnie; Carpuat, Marine; Popescu-Belis, Andrei; Hardmeier, Christian

    Proceedings of the Second Workshop on Discourse in Machine Translation

    2015.

  • Hardmeier, Christian; Nakov, Preslav; Stymne, Sara; Tiedemann, Jörg et al.

    Pronoun-Focused MT and Cross-Lingual Pronoun Prediction: Findings of the 2015 DiscoMT Shared Task on Pronoun Translation

    Part of Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), p. 1-16, 2015.

    Open access
  • Callin, Jimmy; Hardmeier, Christian; Tiedemann, Jörg

    Part-of-Speech Driven Cross-Lingual Pronoun Prediction with Feed-Forward Neural Networks

    Part of Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), p. 59-64, 2015.

    Open access
  • Hardmeier, Christian

    On Statistical Machine Translation and Translation Theory

    Part of Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), p. 168-172, 2015.

    Open access
  • Hardmeier, Christian

    A Document-Level SMT System with Integrated Pronoun Prediction

    Part of Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), p. 72-77, 2015.

    Open access
  • Universitetshuset / IX, Uppsala 2015-05-27 10:15

    Seraji, Mojgan

    Morphosyntactic Corpora and Tools for Persian

    Open access
  • Xiong, Deyi; Duh, Kevin; Hardmeier, Christian; Navigli, Roberto

    Proceedings of the 1st Workshop on Semantics-Driven Statistical Machine Translation (S2MT 2015)

    2015.

    Open access
  • Beck, Daniel; Cohn, Trevor; Hardmeier, Christian; Specia, Lucia

    Learning Structural Kernels for Natural Language Processing

    Part of Transactions of the Association for Computational Linguistics, p. 461-473, 2015.

    Open access
  • Kavathatzopoulos, Iordanis; Björk, Ingrid

    How ethical robots process information, communicate and act

    Part of 1st TRANSOR Workshop, 2015.

  • Nivre, Joakim

    Om datorer och språkförståelse

    Part of Ņrsbok 2015, p. 75-82, 2015.

  • Pettersson, Eva; Nivre, Joakim

    Improving Verb Phrase Extraction from Historical Text by Use of Verb Valency Frames

    Part of Proceedings of the 20th Nordic Conference of Computational Linguistics, p. 153-161, 2015.

  • Seraji, Mojgan; Bernd, Bohnet; Nivre, Joakim

    ParsPer: A Dependency Parser for Persian

    Part of Depling 2015, p. 300-309, 2015.

    Open access
  • Farahmand, Meghdad; Smith, Aaron; Nivre, Joakim

    A Multiword Expression Data Set: Annotating Non-Compositionality and Conventionalization for English Noun Compounds

    Part of Proceedings of the 11th Workshop on Multiword Expressions, p. 29-33, 2015.

  • Björkelund, Anders; Nivre, Joakim

    Non-Deterministic Oracles for Unrestricted Non-Projective Transition-Based Dependency Parsing

    Part of Proceedings of the 14th International Conference on Parsing Technologies, p. 76-86, 2015.

  • Farahmand, Meghdad; Nivre, Joakim

    Modeling the Statistical Idiosyncrasy of Multiword Expressions

    Part of Proceedings of the 11th Workshop on Multiword Expressions, p. 34-38, 2015.

  • Dubremetz, Marie; Nivre, Joakim

    Extraction of Nominal Multiword Expressions in French

    Part of Proceedings of the 10th Workshop on Multiword Expressions (MWE), p. 72-76, 2014.

  • Pettersson, Eva; Megyesi, Beata; Nivre, Joakim

    A Multilingual Evaluation of Three Spelling Normalization Methods for Historical Text.

    Part of Workshop on Language Technology for Cultural Heritage, Social Sciences and Humanities, LaTeCH 2014, 2014.

  • Tengstrand, Lisa; Megyesi, Beata; Henriksson, Aron; Duneld, Martin et al.

    EACL - Expansion of Abbreviations in CLinical text

    Part of Workshop on Predicting and Improving Text Readability for Target Reader Populations, PITR 2014, 2014.

  • Tan, Liling; Zampieri, Marcos; Ljubesic, Nikola; Tiedemann, Jorg

    Merging Comparable Data Sources for the Discrimination of Similar Languages: The DSL Corpus Collection

    Part of LREC 2014 - Ninth International Conference On Language Resources And Evaluation, 2014.

  • Hardmeier, Christian; Tiedemann, Jörg; Nivre, Joakim

    Translating Pronouns with Latent Anaphora Resolution

    2014.

    Open access
  • Hardmeier, Christian

    A Dependency Projection Model for Phrase-Based SMT

    2014.

    Open access