Current Topics in Natural Language Processing (WS 2016)

Summary

Deep Learning is an interesting new branch of machine learning where neural networks consisting of multiple layers have shown new generalization capabilities. The seminar will look at advances in both general deep learning approaches, and at the specific case of Neural Machine Translation (NMT). NMT is a new paradigm in data-driven machine translation. In Neural Machine Translation, the entire translation process is posed as an end-to-end supervised classification problem, where the training data is pairs of sentences and the full sequence to sequence task is handled in one model.

Here is a link to last semester's seminar.

There is a Munich interest group for Deep Learning, which has an associated mailing list (initially organized by David Kaumanns). See the link here: http://www.cis.uni-muenchen.de/~davidk/deep-munich/

Instructors

Alexander Fraser

Email Address: SubstituteLastName@cis.uni-muenchen.de

CIS, LMU Munich

Hinrich Schütze

CIS, LMU Munich

Schedule

Thursdays 14:30 s.t., location is C105 (CIS Besprechungsraum).

Click here for directions to CIS.

If this page appears to be out of date, use the refresh button of your browser

Date Paper Links Discussion Leader

Thursday, October 6th Dan Gillick, Cliff Brunk, Oriol Vinyals, Amarnag Subramanya (2016). Multilingual Language Processing From Bytes. HLT-NAACL 2016. paper Hinrich Schütze

Thursday, October 13th Zhaopeng Tu, Zhengdong Lu, Yang Liu, Xiaohua Liu, Hang Li (2016). Modeling Coverage for Neural Machine Translation. ACL 2016 paper Tsuyoshi Okita

Thursday, October 20th Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, Pavel Kuksa (2011). Natural Language Processing (Almost) from Scratch. Journal of Machine Learning Research 2011. paper, focus on sections 1,3,7,8 Hinrich Schütze

Thursday, October 27th Yonghui Wu (and many others). Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation. arXiv. paper Matthias Huck

Thursday, November 10th Ivan Vulić and Anna Korhonen (2016). On the Role of Seed Lexicons in Learning Bilingual Word Embeddings. ACL 2016 paper Viktor Hangya

Thursday, November 17th Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin (2016). "Why Should I Trust You?": Explaining the Predictions of Any Classifier. Knowledge Discovery and Data Mining (KDD) 2016. paper Hinrich Schütze

Thursday, November 24th Melvin Johnson, Mike Schuster et al. (2016). Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation. arXiv. paper Matthias Huck

Thursday, December 1st Yftah Ziser and Roi Reichart (2016). Neural Structural Correspondence Learning for Domain Adaptation. arXiv. *AND* Ira Leviant and Roi Reichart (2015). Separated by an Un-common Language: Towards Judgment Language Informed Vector Space Modeling. arXiv. paper paper2 Roi Reichart

Thursday, December 8th Nal Kalchbrenner, Lasse Espeholt, Karen Simonyan, Aaron van den Oord, Alex Graves, Koray Kavukcuoglu (2016). Neural Machine Translation in Linear Time. arXiv. paper Valentin Deyringer

Thursday, December 15th Maxima Hashimoto and Yoshimasa Tsuruoka (2016). Adaptive Joint Learning of Compositional and Non-Compositional Phrase Embeddings. ACL 2016. paper Eva Maria Vecchi

Thursday, December 22nd Xilun Chen, Yu Sun, Ben Athiwaratkun, Claire Cardie, Kilian Weinberger (2016). Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification. arXiv. paper Fabienne Braune

Thursday, January 12th Daniel Andor, Chris Alberti, David Weiss, Aliaksei Severyn, Alessandro Presta, Kuzman Ganchev, Slav Petrov, Michael Collins (2016). Globally Normalized Transition-Based Neural Networks. ACL 2016. paper Heike Adel

Thursday, January 19th Jason Lee, Kyunghyun Cho, Thomas Hofmann (2016). Fully Character-Level Neural Machine Translation without Explicit Segmentation. arXiv. paper Alex Fraser

Thursday, January 26th NO MEETING

Thursday, February 2nd Junyoung Chung, Sungjin Ahn, Yoshua Bengio (2016). Hierarchical Multiscale Recurrent Neural Networks. arXiv. paper Also: ICLR 2017 reviews Helmut Schmid

Thursday, February 9th Yann N. Dauphin, Angela Fan, Michael Auli, David Grangier (2016). Language Modeling with Gated Convolutional Networks. arXiv. paper Ben Roth

Thursday, February 16th Maria Nadejde, Siva Reddy, Rico Sennrich, Tomasz Dwojak, Marcin Junczys-Dowmunt, Philipp Koehn, Alexandra Birch (2017). Syntax-aware Neural Machine Translation Using CCG. arXiv. paper Matthias Huck

Thursday, February 23rd W. James Murdoch, Arthur Szlam (2017). Automatic Rule Extraction from Long Short Term Memory Networks. arXiv. paper Heike Adel

Thursday, March 2nd Angeliki Lazaridou, Georgiana Dinu, Marco Baroni (2015). Hubness and Pollution: Delving into Cross-Space Mapping for Zero-Shot Learning. ACL 2015. paper Fabienne Braune

Thursday, March 16th Minh-Thang Luong, Quoc V. Le, Ilya Sutskever, Oriol Vinyals, Lukasz Kaiser (2016). Multi-task Sequence to Sequence Learning. ICLR 2016. paper Matthias Huck

Thursday, March 30th Shonosuke Ishiwatari, Nobuhiro Kaji, Naoki Yoshinaga, Masashi Toyoda, Masaru Kitsuregawa (2015). Accurate Cross-lingual Projection between Count-based Word Vectors by Exploiting Translatable Context Pairs. CoNLL 2015
*and* Shonosuke Ishiwatari, Naoki Yoshinaga, Masashi Toyoda and Masaru Kitsuregawa (2016) Instant Translation Model Adaptation by Translating Unseen Words in Continuous Vector Space. CICLing 2016 paper1
paper2 Fabienne Braune

Thursday, April 20th Katharina Kann and Yadollah Yaghoobzadeh will present ongoing work (for paper, contact alex fraser)

Further literature:

Please click here for an NMT reading list, but also see the more general RNN reading list here (scroll down).

Date	Paper	Links	Discussion Leader
Thursday, October 6th	Dan Gillick, Cliff Brunk, Oriol Vinyals, Amarnag Subramanya (2016). Multilingual Language Processing From Bytes. HLT-NAACL 2016.	paper	Hinrich Schütze
Thursday, October 13th	Zhaopeng Tu, Zhengdong Lu, Yang Liu, Xiaohua Liu, Hang Li (2016). Modeling Coverage for Neural Machine Translation. ACL 2016	paper	Tsuyoshi Okita
Thursday, October 20th	Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, Pavel Kuksa (2011). Natural Language Processing (Almost) from Scratch. Journal of Machine Learning Research 2011.	paper, focus on sections 1,3,7,8	Hinrich Schütze
Thursday, October 27th	Yonghui Wu (and many others). Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation. arXiv.	paper	Matthias Huck
Thursday, November 10th	Ivan Vulić and Anna Korhonen (2016). On the Role of Seed Lexicons in Learning Bilingual Word Embeddings. ACL 2016	paper	Viktor Hangya
Thursday, November 17th	Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin (2016). "Why Should I Trust You?": Explaining the Predictions of Any Classifier. Knowledge Discovery and Data Mining (KDD) 2016.	paper	Hinrich Schütze
Thursday, November 24th	Melvin Johnson, Mike Schuster et al. (2016). Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation. arXiv.	paper	Matthias Huck
Thursday, December 1st	Yftah Ziser and Roi Reichart (2016). Neural Structural Correspondence Learning for Domain Adaptation. arXiv. AND Ira Leviant and Roi Reichart (2015). Separated by an Un-common Language: Towards Judgment Language Informed Vector Space Modeling. arXiv.	paper paper2	Roi Reichart
Thursday, December 8th	Nal Kalchbrenner, Lasse Espeholt, Karen Simonyan, Aaron van den Oord, Alex Graves, Koray Kavukcuoglu (2016). Neural Machine Translation in Linear Time. arXiv.	paper	Valentin Deyringer
Thursday, December 15th	Maxima Hashimoto and Yoshimasa Tsuruoka (2016). Adaptive Joint Learning of Compositional and Non-Compositional Phrase Embeddings. ACL 2016.	paper	Eva Maria Vecchi
Thursday, December 22nd	Xilun Chen, Yu Sun, Ben Athiwaratkun, Claire Cardie, Kilian Weinberger (2016). Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification. arXiv.	paper	Fabienne Braune
Thursday, January 12th	Daniel Andor, Chris Alberti, David Weiss, Aliaksei Severyn, Alessandro Presta, Kuzman Ganchev, Slav Petrov, Michael Collins (2016). Globally Normalized Transition-Based Neural Networks. ACL 2016.	paper	Heike Adel
Thursday, January 19th	Jason Lee, Kyunghyun Cho, Thomas Hofmann (2016). Fully Character-Level Neural Machine Translation without Explicit Segmentation. arXiv.	paper	Alex Fraser
Thursday, January 26th	NO MEETING
Thursday, February 2nd	Junyoung Chung, Sungjin Ahn, Yoshua Bengio (2016). Hierarchical Multiscale Recurrent Neural Networks. arXiv.	paper Also: ICLR 2017 reviews	Helmut Schmid
Thursday, February 9th	Yann N. Dauphin, Angela Fan, Michael Auli, David Grangier (2016). Language Modeling with Gated Convolutional Networks. arXiv.	paper	Ben Roth
Thursday, February 16th	Maria Nadejde, Siva Reddy, Rico Sennrich, Tomasz Dwojak, Marcin Junczys-Dowmunt, Philipp Koehn, Alexandra Birch (2017). Syntax-aware Neural Machine Translation Using CCG. arXiv.	paper	Matthias Huck
Thursday, February 23rd	W. James Murdoch, Arthur Szlam (2017). Automatic Rule Extraction from Long Short Term Memory Networks. arXiv.	paper	Heike Adel
Thursday, March 2nd	Angeliki Lazaridou, Georgiana Dinu, Marco Baroni (2015). Hubness and Pollution: Delving into Cross-Space Mapping for Zero-Shot Learning. ACL 2015.	paper	Fabienne Braune
Thursday, March 16th	Minh-Thang Luong, Quoc V. Le, Ilya Sutskever, Oriol Vinyals, Lukasz Kaiser (2016). Multi-task Sequence to Sequence Learning. ICLR 2016.	paper	Matthias Huck
Thursday, March 30th	Shonosuke Ishiwatari, Nobuhiro Kaji, Naoki Yoshinaga, Masashi Toyoda, Masaru Kitsuregawa (2015). Accurate Cross-lingual Projection between Count-based Word Vectors by Exploiting Translatable Context Pairs. CoNLL 2015 and Shonosuke Ishiwatari, Naoki Yoshinaga, Masashi Toyoda and Masaru Kitsuregawa (2016) Instant Translation Model Adaptation by Translating Unseen Words in Continuous Vector Space. CICLing 2016	paper1 paper2	Fabienne Braune
Thursday, April 20th	Katharina Kann and Yadollah Yaghoobzadeh will present ongoing work	(for paper, contact alex fraser)