Bei der Informationsextraktion (IE) geht es um die automatische Extraktion von Information aus Volltexten. Die Anwendungen erstrecken sich von der Unterstützung von Internet-Suchmaschinen bis hin zum automatischen Aufbau von Fachdatenbanken. Die Methoden reichen von der Analyse natürlicher Sprache über automatische Termerkennung bis zu automatischen Lernverfahren, wobei symbolische, statistische und hybride Methoden zum Einsatz kommen. Komplexe Informationsstrukturen können mit sogenannten Templates (Informationsmustern) repräsentiert werden. In der Veranstaltung werden verschiedene Anwendungen und Methoden für diverse Anwendungsdomänen betrachtet.
Inhalte:
Das Seminar behandelt Ansätze, Verfahren und Werkzeuge der Informationsextraktion und legt einen besonderen Fokus auf die Erkennung von Eigennamen und von domänen- bzw. fachspezifischer Information. Auch sollen Fragestellungen der Evaluation derartiger Verfahren diskutiert werden.
Lernziele:
Die Teilnehmer sollen lernen, wie sie Ressourcen für IE Systeme bewerten können. Außerdem sollen sie befähigt werden, dass sie bei der Entwicklung, beim Einsatz und bei der Bewertung von IE-Systemen mitwirken können.
Here is a link to the Lecture
Email Address: SubstituteMyLastName@cis.uni-muenchen.de
Tutor: Tobias Eder
Email Address: tobias.eder@in.tum.de
There are *two separate seminars*. You EITHER go on Wednesdays, OR you go on Thursdays, NOT BOTH!
Wed: 10:00 c.t., Room M203 (main building!)
Thurs: 10:00 c.t., Room 061 (Oettingenstr)
Fabian Dreer created a LaTeX template for the Hausarbeit, click here.
If this web page does not seem to be up to date, use the refresh button in your browser.
Date | Topic | Materials |
October 18th and October 19th | Introduction, Information on Participants | |
October 25th and October 26th | Referat Topics | alex_fraser.pdf fabienne_braune.pdf matthias_huck.pdf dario_stojanovski.pdf |
November 1st and November 2nd | Holiday/Cancelled | |
November 8th and November 9th | Übung in ***Gobi computer lab*** | IE_exercise1.tgz |
February 7th and February 8th | 2. Übung in *** Gobi computer lab *** | IE_exercise2.tgz sklearn ubuntu |
WEDNESDAY Referatsthemen (name: topic)
Date | Topic | Materials | Hausarbeit Received |
November 15th | (AF) Oksana Budurova: History of IE | slides | yes |
November 22nd | (AF) Sofia Chaves: Fine-Grained Named Entities | slides | yes |
November 29th | (FB) Luca Ionescu: Crowd-sourcing | slides | yes |
November 29th | (AF) Lina Garcia: Rule-based vs. Statistical | slides | yes |
December 6th | (AF) Katja Bertholdt: NER Twitter | slides | yes |
December 6th | (FB) David Ewald: Domain Adaptation | slides | yes |
December 13th | (FB) Erika Worm: NN architecture for bilingual sentence extraction | slides | yes |
December 20th | (MH) Mohamed Mehdi Abdelkefi: Joint Entity Recognition and Disambiguation | slides | yes |
January 10th | (DS) Karolina Spiel: Social Media Sentiment Analysis | slides | yes |
January 17th | (MH) Leonie Lanfrit: Biomedical Knowledge Extraction | slides | yes |
January 17th | (AF) Monica Zanetta: Disasters in Social Media | slides | yes |
January 24th | (MH) Ziad Elsayes: NER with NN | slides | yes |
January 24th | (DS) Fabian Drach: Clinical TempEval | slides | yes |
January 31st | (MH) Carolina Lang: Event detection in Social Media | slides | yes |
January 31st | (MH) Bettina Winkler: Open-Domain Question Answering | slides | yes |
THURSDAY Referatsthemen (name: topic)
Date | Topic | Materials | Hausarbeit Received |
November 16th | (AF) Andrey Zubarev: History of IE | slides | yes |
November 16th | (DS) Diana Dutka: Unsupervised Named Entity Recognition | slides | yes |
November 23rd | (FB) Yang He: Crowd-sourcing | slides | yes |
November 23rd | (AF) Vadim Ott: Rule-based vs. Statistical | slides | yes |
November 30th | (AF) NER Twitter | ||
November 30th | (FB) CNN for Named Entity Recognition | ||
December 7th | (FB) Julia Khobotova: LSTM for Named Entity Recognition | ||
December 7th | (MH) Mariia Lobushkova: NER with NN | slides | yes |
December 14th | (FB) Armela Meleqi: Bilingual Sentence Extraction | slides | yes |
December 14th | (FB) Valentyna Walner: NN architecture for Bilingual Sentence Extraction | slides | yes |
December 21st | (AF) Veronika Hintzen: Coreference | slides | yes |
December 21st | (FB) Anna Lobushkova: Weakly Supervised NER | slides | yes |
January 11th | (MH) Meltem Sagir: Credibility Prediction | slides | yes |
January 18th | (AF) Christoph Weber: Disasters in Social Media | slides | yes |
January 18th | (DS) Aurelien Levecq: Automatically labeled data for event extraction | slides | yes |
January 25th | (MH) Sophia Antonin: Event detection in Social Media | slides | yes |
January 25th | (AF) Sarah Bosch: Keyword Extraction | slides | yes |
February 1st | (MH) Constantin Rebouskos: Extracting Structured Information from Conversations | slides | yes |
February 1st | (AF) Yannick Kaiser: NER Twitter | slides | yes |
February 8th | (AF) Ivana Daskalovska: Bilingual Sentence Extraction | slides | yes |