Photograph of Michael Strube

I lead the Natural Language Processing (NLP) Group at HITS gGmbH, Heidelberg, Germany. There, I am involved in NLP related projects, work with the computational linguists at HITS, and supervise PhD. students. In addition, I am Honorarprofessor in the Computational Linguistics Department at the University of Heidelberg.

 

news

Long paper accepted at EMNLP '16: Daraksha Parveen, Mohsen Mesgar and Michael Strube: Generating Coherent Summaries of Scientific Articles Using Coherence Patterns.

Long paper accepted as oral presentation at ACL '16: Nafise Sadat Moosavi and Michael Strube: Which Coreference Evaluation Metric Do You Trust? A Proposal for a Link-based Entity Aware Metric.

Two papers accepted at NAACL '16: Mohsen Mesgar and Michael Strube: Lexical Coherence Graph Modeling by Word Embeddings (long paper) and Nafise Moosavi and Michael Strube: Search Space Pruning: A Simple Solution for Better Coreference Resolvers (short paper).

I am (co-)area chair for (believe it or not, I'm serious) Summarization, Generation, Discourse, and Dialogue at EMNLP '16 held in Austin, Texas, November 2-6, 2016.

I was PC Co-Chair at the ACL's 2015 flagship conference ACL-IJCNLP '15 held in Beijing, China, July 26-31, 2015.

Paper accepted in TACL: Sebastian Martschat and Michael Strube: Latent Structures for Coreference Resolution (PDF).

I am member of a new research training group funded by DFG: AIPHES (Adaptive Information Processing from Heterogeneous Sources). This is a collaboration between the CS Department at the Technical University of Darmstadt, the Institute for Computational Linguistics at the University of Heidelberg and the NLP Group at HITS. I'll focus on concept disambiguation in relation with discourse processing and sentiment analysis.

Recent invited talks: At the LSDsem '15 and the CORBON '16 workshops: "The (Non-)Utility of Semantics for Coreference Resolution" (LSDsem, CORBON Remix). At the University of Mannheim, School of Business Informatics and Mathematics (and a few other places ...): "The Dark Side of NLP: An Introduction into Natural Language Processing for Spooks, Stalkers, and other Scoundrels" (PDF, Announcement). At the University of Edinburgh, Institute for Language, Cognition and Computation: "Issues I Don't Understand About Coreference and Coherence".

Teaching winter term 2016: The Dark Side of NLP: Gefahren automatischer Sprachverarbeitung in the CL Department at the University of Heidelberg.

(This isn't really news ...) According to Google Scholar the AAAI '06 and AAAI '07 papers Simone Paolo Ponzetto and I co-authored are the most cited papers of the top conference in AI (Google Scholar query for AAAI 2006 and AAAI 2007)!

 

research interests

Linguistics:

  • Text and Dialogue
  • Pragmatics

Computational Linguistics:

  • Anaphora and Coreference Resolution
  • Generation of Referring Expressions
  • Modeling Local (and maybe also Global) Coherence
  • Discourse and Dialogue Structure (though I don't believe in it)

Natural Language Processing:

  • Automatic Summarization
  • Concept Disambiguation, Entity Linking, Cross-document Coreference Resolution
  • Information Extraction
  • Knowledge Acquisition, Ontology Learning
  • Natural Language Generation Systems

 

publications

Publications in Journals and Books, Conference Proceedings, Workshop Proceedings, and Complete List of Publications

See my publications at Google Scholar.

Send me email if you want to get a copy of a paper not linked on these pages (in those cases we had to transfer the copyright to the respective publishers; maybe linguists should follow the good example of JAIR and start to publish in open access journals).


A Few Recent Publications


  • Parveen, Daraksha; Mesgar, Mohsen; Strube, Michael (2016).
    Generating Coherent Summaries of Scientific Articles Using Coherence Patterns
    In: EMNLP '16 (long paper), to appear.
  • Moosavi, Nafise Sadat; Strube, Michael (2015).
    Which Coreference Evaluation Metric Do You Trust? A Proposal for a Link-based Entity Aware Metric.
    In: ACL '16 (long paper), pp.632-644. (PDF)
  • Mesgar, Mohsen; Strube, Michael (2015).
    Lexical Coherence Graph Modeling by Word Embeddings.
    In: NAACL '16 (long paper), pp.1414-1423. (PDF)
  • Martschat, Sebastian; Strube, Michael (2015).
    Latent Structures for Coreference Resolution.
    In: TACL, 3, pp.405-418. (PDF)
  • Pradhan, Sameer; Luo, Xiaoqiang; Recasens, Marta; Hovy, Eduard; Ng, Vincent; Strube, Michael (2014).
    Scoring coreference partitions of predicted mentions: A reference implementation.
    In: ACL '14 (Short Papers), pp.30-35. (PDF)
  • Nastase, Vivi; Strube, Michael (2013).
    Transforming Wikipedia into a Large Scale Multilingual Concept Network.
    In: Artificial Intelligence, 194, pp.62-85. (DOI) (PDF)
  • Zirn, Cäcilia; Niepert, Mathias; Stuckenschmidt, Heiner; Strube, Michael (2011).
    Fine-grained Sentiment Analysis with Structural Features.
    In: IJCNLP '11, pp.336-344. (PDF)
    Best Paper Award!
  • Ponzetto, Simone Paolo; Strube, Michael (2011).
    Taxonomy Induction Based on a Collaboratively Built Knowledge Repository.
    In: Artificial Intelligence, 175 (9/10), pp.1737-1756. (PDF)
  • Ponzetto, Simone Paolo; Strube, Michael (2007).
    Knowledge Derived from Wikipedia for Computing Semantic Relatedness.
    In: Journal of Artificial Intelligence Research 30, pp.181-212. (PDF).
    Honorable Mention for the IJCAI-JAIR best paper prize 2010 (awarded to an outstanding paper published in JAIR in the preceding five calendar years).
 

short biography

I received my Ph.D. from the (now defunct) Computational Linguistics Department at the University of Freiburg, Germany, in December 1996 under the supervision of Udo Hahn. Between 1997 and 1999 I was a postdoctoral fellow at the Institute for Research in Cognitive Science at the University of Pennsylvania, Philadelphia, PA. In 2000 I joined the European Media Lab in Heidelberg, Germany, as a researcher. Since 2001 I am group leader of the Natural Language Processing (NLP) Group of HITS, an institute which rapidly underwent several transformations before (hopefully) arriving at its final destination:

European Media Lab → EML Research → HITS.

Anyway, it has been a lot of fun to be here ...

In 2010 I got appointed Honorarprofessor in the Computational Linguistics Department at the University of Heidelberg.

In 2015/16 I serve as deputy scientific director at HITS.

In 2015 I was PC Co-Chair of the ACL's flagship conference ACL-IJCNLP '15 held in Beijing, China, July 26-31, 2015. We created about 3000 pages of printed paper as output.

 

some addictions

Photograph of a sign: Tourists are not permitted beyond this point

Literature

What is Jazz?

Running

Photography