Mehrworteinheiten

Raum:  23.32.01.48
Termin: Dienstags, 16:30–18:00

Kursbeschreibung

Mehrworteinheiten (Multi-word Expressions, MWE) wie ins Gras beißen sind komplexe syntaktische Einheiten mit Eigenschaften, die — gemäß üblicher Annahmen — nicht aus den Eigenschaften der Teilkomponenten regelhaft abgeleitet werden können. Mit anderen Worten: MWE sind irgendwie idiosynkratisch und “überraschend”, aber dies zeigt sich nur im Zusammenspiel bestimmter Konstruktionstypen, Lexeme und Wortformen. Es verwundert daher nicht, dass MWE von vielen als “pain in the neck” empfunden werden. In diesem Seminar wollen wir uns anhand einschlägiger und aktueller Texte mit dem Phänomenbereich und den darauf abzielenden computerlinguistischen Modellen und Methoden auseinandersetzen.

Das Seminar ist als Lektüreseminar angelegt und soll von Referaten begleitet werden. Es richtet sich an Masterstudenten oder fortgeschrittene Bachelorstudenten, die bereits Kenntnisse über Syntax und Grammatikformalismen mitbringen.

Zeitplan (vorläufig)

DATUM THEMA LITERATUR FOLIEN
10.10. Vorbesprechung und Seminarüberblick  [pdf]
17.10. MWE und NLP: Ein klassischer Überblick Sag et al. (2002)  [pdf]
24.10. MWE revisited: Weitere Aspekte Baldwin & Kim (2010)
31.10. FEIERTAG
07.11. MWE in der Generativen Grammatik Newmeyer (1974)
14.11. MWE in TAG und CxG Abeillé & Schabes (1996)
Lichte & Kallmeyer (2017)
21.11. MWE in der HPSG Richter & Sailer (2009)
Müller (2010)
28.11. MWE in der Lexikographie Francopoulo et al. (2006) + Odijk (2013)
Lichte et al. (To appear)
(Pötschke)
(Dreschers,Kornienko)
05.12. MWE-Identifikation Fazl et al. (2009): Types
Fazl et al. (2009): Tokens
12.12. MWE-Identifikation Sporleder & Li (2009) + Li & Sporleder (2010)
19.12. MWE-Identifikation Daniel & Last (2016)
Salehi et al. (2014)
09.01. MWE-Parsing Waszczuk et al.  (2016;2017) (Hense)
16.01. MWE in der maschinellen Übersetzung Cap et al.  (2015) (v.d. Kohle)
23.01. MWE in der Psycholinguistik Libben & Titone (2008) (Conen)
30.01. Zusammenfassung

Literaturvorschläge

Empirie und Grundbegriffe

Baldwin, Timothy & Su Nam Kim. 2010. Multiword expressions. In Nitin Indurkhya & Fred J. Damera (eds.), Handbook of Natural Language Processing. 2nd edn. CRC Press.

Nunberg, Geoffrey, Ivan A. Sag & Thomas Wasow. 1994. Idioms. Language 70(3). 491–538.

Sag, Ivan A., Timothy Baldwin, Francis Bond, Ann Copestake & Dan Flickinger. 2002. Multiword expressions: A pain in the neck for NLP. In Alexander Gelbukh (ed.), Computational Linguistics and Intelligent Text Processing (Lecture Notes in Computer Science 2276), 1–15. Berlin: Springer.

MWE in der Generativen Grammatik

Fraser, Bruce. 1970. Idioms within a Transformational Grammar. Foundations of Language 6(1). 22–42.

Newmeyer, Frederick J. 1974. The regularity of idiom behavior. Lingua 34(4). 327–342.

Nunberg, Geoffrey, Ivan A. Sag & Thomas Wasow. 1994. Idioms. Language 70(3). 491–538.

MWE in TAG und CxG

Abeillé, Anne & Yves Schabes. 1996. Non-compositional discontinuous constituents in Tree Adjoining Grammar. In Harry Bunt & Arthur van Horck (eds.), Discontinuous Constituency, 279–306. Berlin, Germany: Mouton de Gruyter.

Lichte, Timm & Laura Kallmeyer. 2016. Same syntax, different semantics: A compositional approach to idiomaticity in multi-word expressions. In Christopher Piñón (ed.), Empirical Issues in Syntax and Semantics 11, 111–140. http://www.cssp.cnrs.fr/eiss11/.

MWE in der HPSG

Müller, Stefan. 2010. Persian Complex Predicates and the Limits of Inheritance-Based Analyses. Journal of Linguistics 46(3). 601–655.

Richter, Frank & Manfred Sailer. 2009. Phraseological Clauses in Constructional HPSG. In Stefan Müller (ed.), Proceedings of the 16th International Conference on Head-Driven Phrase Structure Grammar, University of Göttingen, Germany, 297–317. Stanford,CA: CSLI Publications.

Sailer, Manfred & Frank Richter. 2002. Not for Love or Money: Collocations!. In Gerhard Jäger, Paola Monachesi, Gerald Penn & Shuly Wintner Wintner (eds.), Proceedings of Formal Grammar, 149–160.

Soehn, Jan-Philipp. 2006. On Idiom Parts and their Contexts. Linguistik Online 27(2).

MWE in der LFG

Findlay, Jamie Y. 2017. Multiword expressions and lexicalism: The view from LFG. In Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), 73–79. Valencia, Spain: Association for Computational Linguistics.

MWE in der Lexikographie

Daelemans, Walter & Erik-Jan van der Linden. 1992. Evaluation of lexical representation formalisms. https://pure.uvt.nl/portal/files/5362743/DWLEJ5613490.pdf.

Dormeyer, Ricarda, Ingrid Fischer & Martina Keil. 1998. A Database for Verbal Idioms. In Th. Fontenelle, Ph. Hilismann, A. Michiels, A. Moulin & S. Theissen (eds.), Euralex`98 Proceedings, 99–109. Universite de Liège, Department dánglais et de neerlandais/Belgium.

Francopoulo, Gil, Monte George, Nicoletta Calzolari, Monica Monachini, Nuria Bel, Mandy Pet & Claudia Soria. 2006. Lexical Markup Framework (LMF). In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2006), 233–236.

Lichte, Timm, Simon Petitjean, Agata Savary & Jakub Waszczuk. To appear. Lexical encoding formats for multi-word expressions: The challenge of “irregular” regularities. In Yannick Parmentier & Jakub Waszczuk (eds.), Representation and parsing of multiword expressions: Current trends (Phraseology and Multiword Expressions 1). Berlin: Language Science Press.

Odijk, Jan. 2013. DUELME: Dutch Electronic Lexicon of Multiword Expressions. In Gil Francopoulo (ed.), LMF: Lexical Markup Framework, 133–144. Hoboken, NJ: John Wiley & Sons. doi:10.1002/9781118712696.ch9.

MWE-Identifikation

Daniel, Tal & Mark Last. 2016. Exploring Long-Term Temporal Trends in the Use of Multiword Expressions. In Proceedings of the 12th Workshop on Multiword Expressions, 11–20. Berlin, Germany.

Fazly, Afsaneh, Paul Cook & Suzanne Stevenson. 2009. Unsupervised type and token identification of idiomatic expressions. Computational Linguistics 35(1). 61–103.

Katz, Graham & Eugenie Giesbrecht. 2006. Automatic Identification of Non-Compositional Multi-Word Expressions using Latent Semantic Analysis. In Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties, 12–19. Sydney, Australia.

Li, Linlin & Caroline Sporleder. 2010. Linguistic cues for distinguishing literal and non-literal usages. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING): Posters, 683–691.

Salehi, Bahar, Paul Cook & Timothy Baldwin. 2014. Using Distributional Similarity of Multi-way Translations to Predict Multiword Expression Compositionality. In Proceedings of the 14th conference of the European Chapter of the Association for Computational Linguistics (EACL), 472–481. Gothenburg, Sweden: Association for Computational Linguistics. http://www.aclweb.org/anthology/E14-1050.

Schulte im Walde, Sabine, Stefan Müller & Stephen Roller. 2013. Exploring vector space models to predict the compositionality of German noun-noun compounds. In Mona Diab, Timothy Baldwin & Marco Baroni (eds.), Proceedings of the 2nd Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task, 255–265.

Sporleder, Caroline & Linlin Li. 2009. Unsupervised recognition of literal and non-literal use of idiomatic expressions. In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 754–762.

MWE beim Parsen

Waszczuk, Jakub, Agata Savary & Yannick Parmentier. 2016. Promoting multiword expressions in A* TAG parsing. In 26th International Conference on Computational Linguistics (COLING 2016). Osaka, Japan. https://hal.archives-ouvertes.fr/hal-01378903.

Waszczuk, Jakub, Agata Savary & Yannick Parmentier. 2017. Multiword expression-aware A* TAG parsing revisited. In 13th International Workshop on Tree-Adjoining Grammar and Related Formalisms. Umeå, Sweden.

MWE in der maschinellen Übersetzung

Cap, Fabienne, Manju Nirmal, Marion Weller & Sabine Schulte im Walde. 2015. How to Account for Idiomatic German Support Verb Constructions in Statistical Machine Translation. In Proceedings of the 11th Workshop on Multiword Expressions, 19–28. Denver, Colorado. http://www.aclweb.org/anthology/W15-0903.

MWE in der Psycholinguistik

Libben, Maya R. & Debra A. Titone. 2008. The multidetermined nature of idiom processing. Memory & Cognition 36(6). 1103–1121.

Gibbs Raymond W., Jr., Nandini P. Nayak & Cooper Cutting. 1989. How to kick the bucket and not decompose: Analyzability and idiom processing. Journal of Memory and Language 28(5). 576–593.