Information Extraction - Seminar (WS 2019-2020)

Summary

Bei der Informationsextraktion (IE) geht es um die automatische Extraktion von Information aus Volltexten. Die Anwendungen erstrecken sich von der Unterstützung von Internet-Suchmaschinen bis hin zum automatischen Aufbau von Fachdatenbanken. Die Methoden reichen von der Analyse natürlicher Sprache über automatische Termerkennung bis zu automatischen Lernverfahren, wobei symbolische, statistische und hybride Methoden zum Einsatz kommen. Komplexe Informationsstrukturen können mit sogenannten Templates (Informationsmustern) repräsentiert werden. In der Veranstaltung werden verschiedene Anwendungen und Methoden für diverse Anwendungsdomänen betrachtet.

Inhalte:

Das Seminar behandelt Ansätze, Verfahren und Werkzeuge der Informationsextraktion und legt einen besonderen Fokus auf die Erkennung von Eigennamen und von domänen- bzw. fachspezifischer Information. Auch sollen Fragestellungen der Evaluation derartiger Verfahren diskutiert werden.

Lernziele:

Die Teilnehmer sollen lernen, wie sie Ressourcen für IE Systeme bewerten können. Außerdem sollen sie befähigt werden, dass sie bei der Entwicklung, beim Einsatz und bei der Bewertung von IE-Systemen mitwirken können.

Here is a link to the Lecture

Instructor

Alexander Fraser

Email Address: SubstituteMyLastName@cis.uni-muenchen.de

CIS, LMU Munich

Tutor: Tobias Eder

Email Address: tobias.eder@in.tum.de

Schedule

There are *two separate seminars*. You EITHER go on Mondays, OR you go on Thursdays, NOT BOTH!

Mon: 16:00 c.t., Room 061

Thurs: 10:00 c.t., Room 061

For a LaTeX template for the Hausarbeit, click here.

If this web page does not seem to be up to date, use the refresh button in your browser.

Date Topic Materials

October 14th and October 17th Introduction, Information on Participants

October 21st and October 24th Referat Topics Alexander Fraser
Viktor Hangya
Simon Riess
Matthias Huck
Jindrich Libovicky
Dario Stojanovski
Alexandra Chronopoulou

October 28th and October 31st Cancelled (Vorlesung on October 30th is not cancelled!)

November 4th and November 7th First Exercise (in Gobi) IE_exercise1.tar.xz IE_exercise1_notes.txt

December 2nd and December 5th Second Exercise (in Gobi) IE_exercise2.tgz

February 3rd and February 6th (Feb 6th starting early at 9:45am!) Third Exercise (in Gobi) IE_exercise3.tar.gz

Aktuelles

Moodle

MONDAY Referatsthemen (name: topic)

Date Topic Language Materials Hausarbeit Received

11.11 (AF) Gege Ruan: NER for Twitter EN

11.11 (VH) Kateryna Chernysh: IE from CVs EN

18.11 (MH) Sahar Zahidi: Structural Segmentation of Email EN

18.11 (MH) Luise Schulz: Automatic Intent Identification in Emails EN

18.11 (AC) Suteera Seeha: Named Entity Recognition using Joint Word- and Character-level Embeddings EN

25.11 (SR) Claudia Müller: QA Systems DE

25.11 (SR) Dianora Herashchenko: Hate Speech Detection DE

25.11 (SR) Markus Pfaffl: Propaganda Detection DE

09.12 (VH) Mochamet Machmout: User Opinion Extraction EN

09.12 (JL) Huy Nguyen: Word attributes from large amount of text EN

09.12 (JL) Laura Lehmann: Depression and Self-Harm Risk Assessment in Online Forums EN

16.12 (DS) Haotian Ye: Cross-lingual NER EN

16.12 (DS) Paulina Zahlbaum: Hyperpartisan News Detection EN

16.12 (JL) Alina Fastowski: Extractive and Abstractive Text Summarization EN

13.01 (JL) Nadja Seeberg: Fake Reviews Detection EN

13.01 (AC) Jessica Schleiermacher: Relation extraction from clinical texts EN

20.01 (DS) Thang Pham: Automatically Labeled Data for Event Extraction EN

20.01 (AC) Andrej Vershynin: Event Detection using Recurrent Neural Networks EN

27.01 (MH) Halyna Tonkoshkura: Semantic Role Labeling DE

27.01 (MH) Natalia Amelina: Neural Open IE DE

06.02 um 9:45 s.t. (AF) Khanh-Van Zenz: Chatbots DE

THURSDAY Referatsthemen (name: topic)

Date Topic Language Materials Hausarbeit Received

14.11 (AF) Alexandru Porfir: History of IE EN

14.11 (AF) Vlora Murselji: Rule-based vs. Statistical EN

14.11 (AF) Leonard Kassow: NER German EN

21.11 (VH) Julia Eppler: IE from CVs EN

21.11 (SR) Sinem Demiraslan: Question Generation EN

28.11 (AF) Anna Pugina: NER for Twitter DE

28.11 (SR) Merve Erkoc: QA systems DE

28.11 (SR) Marcella Valente: Hate Speech Detection DE

12.12 (SR) Benno Krojer: Fake News Detection EN

12.12 (AC) Aylin Dedek: Fake News - FEVER EN

19.12 (MH) Dominik Kisiala: Event Historical DE

19.12 (MH) Olha Syplyvets: Semantic Role Labeling DE

09.01 (AF) Laurin Gerhardt: Coreference EN

09.01 (VH) Thomas Grieb: Clinical Temporal Relation Extraction with Neural Networks EN

16.01 (JL) Anna Kohler: Word attributes from large amount of text EN

16.01 (JL) Nam Hoang: Unsupervised Text Summarization EN

16.01 (MH) Leonhard Wabro: Automatic Intent Identification in Emails EN

21.01 at 11:00 s.t. in C105 (AC) Michael Anzer: Event Detection using RNNs EN

23.01 (AF) Frank Pöhlmann: Disaster Events in Social Media EN

23.01 (AF) Ann-Kathrin Fochler: Creating Training Data with Weak Supervision for Relation Extraction EN

23.01 (JL) Andrea Augustin: Fake Reviews Detection EN

Date	Topic	Materials
October 14th and October 17th	Introduction, Information on Participants
October 21st and October 24th	Referat Topics	Alexander Fraser Viktor Hangya Simon Riess Matthias Huck Jindrich Libovicky Dario Stojanovski Alexandra Chronopoulou
October 28th and October 31st	Cancelled (Vorlesung on October 30th is not cancelled!)
November 4th and November 7th	First Exercise (in Gobi)	IE_exercise1.tar.xz IE_exercise1_notes.txt
December 2nd and December 5th	Second Exercise (in Gobi)	IE_exercise2.tgz
February 3rd and February 6th (Feb 6th starting early at 9:45am!)	Third Exercise (in Gobi)	IE_exercise3.tar.gz

Date	Topic	Language	Materials	Hausarbeit Received
11.11	(AF) Gege Ruan: NER for Twitter	EN
11.11	(VH) Kateryna Chernysh: IE from CVs	EN
18.11	(MH) Sahar Zahidi: Structural Segmentation of Email	EN
18.11	(MH) Luise Schulz: Automatic Intent Identification in Emails	EN
18.11	(AC) Suteera Seeha: Named Entity Recognition using Joint Word- and Character-level Embeddings	EN
25.11	(SR) Claudia Müller: QA Systems	DE
25.11	(SR) Dianora Herashchenko: Hate Speech Detection	DE
25.11	(SR) Markus Pfaffl: Propaganda Detection	DE
09.12	(VH) Mochamet Machmout: User Opinion Extraction	EN
09.12	(JL) Huy Nguyen: Word attributes from large amount of text	EN
09.12	(JL) Laura Lehmann: Depression and Self-Harm Risk Assessment in Online Forums	EN
16.12	(DS) Haotian Ye: Cross-lingual NER	EN
16.12	(DS) Paulina Zahlbaum: Hyperpartisan News Detection	EN
16.12	(JL) Alina Fastowski: Extractive and Abstractive Text Summarization	EN
13.01	(JL) Nadja Seeberg: Fake Reviews Detection	EN
13.01	(AC) Jessica Schleiermacher: Relation extraction from clinical texts	EN
20.01	(DS) Thang Pham: Automatically Labeled Data for Event Extraction	EN
20.01	(AC) Andrej Vershynin: Event Detection using Recurrent Neural Networks	EN
27.01	(MH) Halyna Tonkoshkura: Semantic Role Labeling	DE
27.01	(MH) Natalia Amelina: Neural Open IE	DE
06.02 um 9:45 s.t.	(AF) Khanh-Van Zenz: Chatbots	DE