Bei der Informationsextraktion (IE) geht es um die automatische Extraktion von Information aus Volltexten. Die Anwendungen erstrecken sich von der Unterstützung von Internet-Suchmaschinen bis hin zum automatischen Aufbau von Fachdatenbanken. Die Methoden reichen von der Analyse natürlicher Sprache über automatische Termerkennung bis zu automatischen Lernverfahren, wobei symbolische, statistische und hybride Methoden zum Einsatz kommen. Komplexe Informationsstrukturen können mit sogenannten Templates (Informationsmustern) repräsentiert werden. In der Veranstaltung werden verschiedene Anwendungen und Methoden für diverse Anwendungsdomänen betrachtet.
Inhalte:
Das Seminar behandelt Ansätze, Verfahren und Werkzeuge der Informationsextraktion und legt einen besonderen Fokus auf die Erkennung von Eigennamen und von domänen- bzw. fachspezifischer Information. Auch sollen Fragestellungen der Evaluation derartiger Verfahren diskutiert werden.
Lernziele:
Die Teilnehmer sollen lernen, wie sie Ressourcen für IE Systeme bewerten können. Außerdem sollen sie befähigt werden, dass sie bei der Entwicklung, beim Einsatz und bei der Bewertung von IE-Systemen mitwirken können.
Here is a link to the Lecture
Email Address: SubstituteMyLastName@cis.uni-muenchen.de
Tutor: Fabian Dreer
Email Address: SubstituteFabiansLastName@cip.ifi.lmu.de
There are *two separate seminars*. You EITHER go on Wednesdays, OR you go on Thursdays, NOT BOTH!
Wed: 12:00 c.t., Room U127
Thurs: 10:00 c.t., Room U127
Fabian created a LaTeX template for the Hausarbeit, click here.
If this web page does not seem to be up to date, use the refresh button in your browser.
Date | Topic | Materials |
October 19th and October 20th | Introduction, Information on Participants | |
October 26th and October 27th | Referatsthemen presentation, with registration for topics Wednesday November 2nd at 14:00 by email (to Fraser) | main.pdf okita.pdf braune_huck.pdf |
November 2nd and 3rd | Practical Exercise (Manual Rules) *** in Gobi *** | tar.gz (see included slides) |
December 7th and December 8th | Machine Learning Exercise *** in Gobi *** | tar.gz (See included slides) |
January 25th and January 26th | NO MEETING | |
February 1st and February 2nd | Sequence Learning Exercise *** in Gobi *** | tar.gz (See included slides) |
WEDNESDAY Referatsthemen (name: topic)
Date | Topic | Materials | Hausarbeit Received |
November 9th | Klug: History of IE | slides | yes |
November 16th | Flemmig: Training data for NER | slides | yes |
November 16th | Serjogin: Mechanical Turk | slides | yes |
November 23rd | Kryvosheya: Rule-based vs. Statistical IE | slides | yes |
November 23rd | Malih: NER Twitter | slides | yes |
November 30th | Birkner: Weakly supervised NER | slides | yes |
November 30th | Stalpouskaya: Automatic Extraction of Agendas for Action from News Coverage of Violent Conflict | NA | |
December 14th | Pongratz: NER of Biological Entities | slides | yes |
December 14th | Atanasova: NER OpenNLP | slides | yes |
December 21st | Guldener: Neural Networks for NER | slides | yes |
January 11th | Okisheva: Event Detection | slides | yes |
January 11th | Datcenko: Natural Language Comprehension with Neural Networks | slides | yes |
January 18th | Smith: IE for Question Answering | slides | yes |
THURSDAY Referatsthemen (name: topic)
Date | Topic | Materials | Hausarbeit Received |
November 10th | Alberteris: History of IE | slides | yes |
November 10th | Eder: Focused Web Crawling | slides | yes |
November 17th | Sharab: Annotated Data for Medical NER | slides | yes |
November 17th | Pham: Mechanical Turk | slides | yes |
November 24th | Vordermaier: Rule-based vs. Statistical IE | slides | yes |
November 24th | Weißweiler: NER Stanford | slides | yes |
December 1st | Röhrer: NER Twitter | slides | yes |
December 15th | Strohmayer: NER OpenNLP | slides | yes |
December 15th | Ebert: Parsing Citations | slides | yes |
December 22nd | Bespala: Coreference | slides | yes |
December 22nd | Schönl: Neural Networks for NER | slides | yes |
January 12th | Ramoser: Event Detection | slides | yes |
January 12th | Smirnov: Natural Language Comprehension with Neural Networks | slides | yes |
January 19th | Klyueva: IE for Question Answering | slides | yes |
January 19th | Steidl: Sentiment Analysis | slides | yes |