Seminar Natural Language Processing I/II im Wintersemester 2020/21

Allgemeine Hinweise

Wenn Sie an einem Seminar bei uns interessiert sind, dann wenden Sie sich bitte an eine:n unserer Mitarbeiter:innen. Eigene Themenvorschläge im Rahmen unserer Forschungsschwerpunkte sind erwünscht. Bitte teilen Sie uns Ihren Studiengang, Ihr Semester und bereits an der Professur belegte Lehrveranstaltungen mit, um die Themenfindung zu erleichtern.

Wichtige Termine

Anmeldung: bis 16.10.2020 per Mail an eine:n unserer Mitarbeiter:innen
Themenvergabe: bis 44. KW
Paperbesprechung: 48. KW (fristgerecht und eigenständig Termin mit Betreuer:in vereinbaren)
- Ziel: in der Lage sein, das Problem und die beschriebenen Ansätze zu skizzieren und auf Nachfragen zu reagieren
- Fragen an Betreuer:in müssen konkret formuliert werden
Vorabversion Vortragsmaterialen: bis 03.01.2021 an Betreuer:in senden
Besprechung Vortragsmaterialien: 1. KW 2021 (fristgerecht und eigenständig Termin mit Betreuer:in vereinbaren)

Zielgruppen

Engagierte Studenten folgender Studiengänge:

Bachelor Informatik: Module INF-B-510 (Proseminar), INF-B-520 (Proseminar), INF-B-610 (Proseminar)
Master Informatik: Modul INF-AQUA (Seminar)
Diplom Informatik: Module INF-D-520 (Proseminar), INF-D-940 (Seminar)
Diplom Informatik (Studienordnung 2004 oder älter): Hauptseminar, Fachgebiet Theorie der Programmierung

Voraussetzungen für die Teilnahme

Bachelor Informatik: Modul INF-B-290 (Theoretische Informatik und Logik)
Master Informatik: keine; Grundlagenwissen über die Themengebiete wird empfohlen
Diplom Informatik: Modul INF-D-330 (Theoretische Informatik und Logik)
Diplom Informatik (Studienordnung 2004 oder älter): Grundlagen der Theoretischen Informatik

Voraussetzungen für das Bestehen

fristgerecht und eigenständig Termine mit den Betreuer:innen verabreden (mindestens 1 Woche im Voraus) sowie die geforderten Materialien abgeben
(nicht für Proseminar) Seminararbeit von 12–15 Seiten, mit Titel, Autor, Einleitung (mind. 1 Seite), vollständigen Referenzen, begrifflich und notationell in sich abgeschlossen, Inhalte mit Beispielen und Bildern anschaulich gemacht; davon eine Vorabversion: inhaltlich vollständig, aber in der Darstellung noch rudimentär
(nur für Proseminar) Handout 1–2 Seiten (nur ein Blatt!)
(für alle) Vortrag von 30–35 Minuten, durch Medien geeignet unterstützt: Beamer-Folien, Tafel, physische Folien, Handout etc.; von allem eine Vorabversion: inhaltlich vollständig, aber in der Darstellung noch rudimentär
Anwesenheit bei allen Vorträgen, aktive Teilnahme an den Diskussionen
bei Einbringen in eine mündliche Prüfung: Übersichtswissen über die Seminarbeiträge (Kernthesen)

Themen

Die Themen wurden zu Beginn des Seminars kurz eingeführt.

Thema	Student:in	Betreuer:in
Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference [Kitaev and Klein 2020]	Tim Rieß	Luisa Herrmann
Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(n6) down to O(n3) [Corro 2020]	Yash Khandelwal	Richard Mörbitz
Discontinuous Constituency Parsing with a Stack-Free Transition System and a Dynamic Oracle [Coavoux and Cohen 2019]	Jonas Knobloch	Thomas Ruprecht
Discontinuous Constituent Parsing as Sequence Labeling [Vilares and Gómez-Rodríguez 2020]	Jakob Klinger	Thomas Ruprecht
Conditional random fields: Probabilistic models for segmenting and labeling sequence data [Lafferty et al. 2001]	Erik Nikulski	Richard Mörbitz
Towards Probabilistic Acceptors and Transducers for Feature Structures [Quernheim and Knight 2012]	Teodora Ivoniciu	Luisa Herrmann
The problem with probabilistic DAG automata for semantic graphs [Vasiljeva et al. 2019]	Jan Sommer	Luisa Herrmann

Referenzen

Coavoux, M. and Cohen, S.B. 2019. Discontinuous Constituency Parsing with a Stack-Free Transition System and a Dynamic Oracle. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics, 204–217. [doi, url]
Corro, C. 2020. Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(n^6) down to O(n^3). https://arxiv.org/abs/2003.13785. [url, arXiv]
Kitaev, N. and Klein, D. 2020. Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, 6255–6261. [doi, url]
Lafferty, J., McCallum, A., and Pereira, F.C.N. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. . [url]
Quernheim, D. and Knight, K. 2012. Towards Probabilistic Acceptors and Transducers for Feature Structures. Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Association for Computational Linguistics, 76–85. [url]
Stanojević, M. and Steedman, M. 2020. Span-Based LCFRS-2 Parsing. Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, Association for Computational Linguistics, 111–121. [doi, url]
Vasiljeva, I., Gilroy, S., and Lopez, A. 2019. The problem with probabilistic DAG automata for semantic graphs. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics, 902–911. [doi, url]
Vilares, D. and Gómez-Rodríguez, C. 2020. Discontinuous Constituent Parsing as Sequence Labeling. https://arxiv.org/abs/2010.00633. [url, arXiv]

Kontakt

Prof. Dr.-Ing. habil. Dr. h.c./Univ. Szeged
Heiko Vogler
Tel.: +49 (0) 351 463-38232
Dr. rer. nat. Luisa Herrmann
Tel.: +49 (0) 351 463-38487
Dr. rer. nat. Richard Mörbitz
Tel.: +49 (0) 351 463-38487
Thomas Ruprecht, M.Sc.
Tel.: +49 (0) 351 463-38469

Stand: 26.10.2020 12:01 Uhr