Photograph of Michael Strube

I lead the Natural Language Processing (NLP) Group at HITS gGmbH, Heidelberg, Germany. There, I am involved in NLP related projects, work with the computational linguists at HITS, and supervise PhD. students. In addition, I am Honorarprofessor in the Computational Linguistics Department at the University of Heidelberg.

 

news

The paper Fine-grained sentiment analysis with structural features (PDF) co-authored by Cäcilia Zirn, Mathias Niepert, Heiner Stuckenschmidt and myself received the Best Paper Award at IJCNLP 2011!

The results for the 2011 shared tasks were very encouraging for us. The HITS cross-lingual link discovery system (led by Angela Fahrni) won the NTCIR 9 shared task on Cross-lingual Link Discovery (linking English source documents to Korean, Chinese and Japanese target documents) hands down. We led 11 out of 18 evaluation metrics, winning all Wikipedia ground truth evaluation measures and even 4 of the manual assessment results. The HITS coreference resolution system (led by Jie Cai) made it to second place in the open setting of the CoNLL 2011 Shared Task on Coreference Resolution (ranked fourth over all settings, not significantly different from second and third).

I am tutorial chair at the 50th Annual Meeting of the ACL held on Jeju Island, Korea, in July 2012.

I am area chair for the area Discourse, Dialogue, and Pragmatics at the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012) held in Avignon, France, in April 2012.

Together with Anette Frank, Stefan Riezler and Sebastian Padó I lead the PhD Colloquium in the CL Department at the University of Heidelberg. At the same time slot we also do the Computational Linguistics Colloquium.

I teach a class on Sentiment Analysis in the CL Department at the University of Heidelberg in the winter term 2011.

(This isn't really news ...) According to Google Scholar the AAAI '06 and AAAI '07 papers Simone Paolo Ponzetto and I co-authored are the most cited papers of the top conference in AI (Google Scholar query for AAAI 2006 and AAAI 2007)!

 

research interests

Linguistics:

  • Text and Dialogue
  • Pragmatics

Computational Linguistics:

  • Coreference Resolution
  • Anaphora and Deixis in Spoken Dialogue
  • Generation of Referring Expressions
  • Models of Attentional State
  • Discourse and Dialogue Structure (though I don't believe in it)
  • Lexical Semantics
  • Hedge Detection

Natural Language Processing:

  • Information Extraction
  • Knowledge Acquisition, Ontology Learning
  • Automatic Summarization
  • Detecting Vague and Manipulative Language
  • Natural Language Generation Systems
  • Multi-modal Dialogue Systems

 

publications

Publications in Journals and Books, Conference Proceedings, Workshop Proceedings, and Complete List of Publications

See my publications at Google Scholar.

Send me email if you want to get a copy of a paper not linked on these pages (in those cases we had to transfer the copyright to the respective publishers; maybe linguists should follow the good example of JAIR and start to publish in open access journals).


A Few Recent Publications


  • Zirn, Cäcilia; Niepert, Mathias; Stuckenschmidt, Heiner; Strube, Michael (2011).
    Fine-grained Sentiment Analysis with Structural Features.
    In: IJCNLP '11, pp.336-344. (PDF)
    Best Paper Award!
  • Ponzetto, Simone Paolo; Strube, Michael (2011).
    Taxonomy Induction Based on a Collaboratively Built Knowledge Repository.
    In: Artificial Intelligence, 175 (9/10), pp.1737-1756. (DOI)
  • Cai, Jie; Strube, Michael (2010).
    End-to-End Coreference Resolution via Hypergraph Partitioning.
    In: COLING '10, pp.143-151. (PDF)
  • Filippova, Katja; Strube, Michael (2008).
    Sentence Fusion via Dependency Graph Compression.
    In: EMNLP '08, pp.177-185. (PDF)
  • Nastase, Vivi; Strube, Michael (2008).
    Decoding Wikipedia Categories for Knowledge Acquisition.
    In: AAAI '08, pp.1219-1224. (PDF)
  • Ponzetto, Simone Paolo; Strube, Michael (2007).
    Knowledge Derived from Wikipedia for Computing Semantic Relatedness.
    In: Journal of Artificial Intelligence Research 30, pp.181-212. (PDF).
    Honorable Mention for the IJCAI-JAIR best paper prize 2010 (awarded to an outstanding paper published in JAIR in the preceding five calendar years).
  • Filippova, Katja; Strube, Michael (2007).
    The German Vorfeld and Local Coherence.
    In: Journal of Logic, Language and Information, 16(4), pp.465-485.
    (locked up at Springer's page, accessible maybe to you, but not to me)
  • Ponzetto, Simone Paolo; Strube, Michael (2007).
    Deriving a Large Scale Taxonomy from Wikipedia.
    In: AAAI '07, pp.1440-1445. (PDF)
  • Strube, Michael; Ponzetto, Simone Paolo (2006).
    WikiRelate! Computing Semantic Relatedness Using Wikipedia.
    In: AAAI '06, pp.1419-1424. (PDF)
    This paper did not receive the best paper award (though it was nominated for it), but appears to be the most cited paper of the 2006 conference (out of 236 published papers)!
 

short biography

I received my Ph.D. from the (now defunct) Computational Linguistics Department at the University of Freiburg, Germany, in December 1996 under the supervision of Udo Hahn. Between 1997 and 1999 I was a postdoctoral fellow at the Institute for Research in Cognitive Science at the University of Pennsylvania, Philadelphia, PA. In 2000 I joined the European Media Lab in Heidelberg, Germany, as a researcher. Since 2001 I am group leader of the Natural Language Processing (NLP) Group of HITS, an institute which rapidly underwent several transformations before (hopefully) arriving at its final destination:

European Media Lab → EML Research → HITS.

Anyway, it has been a lot of fun to be here ...

In 2010 I got appointed Honorarprofessor in the Computational Linguistics Department at the University of Heidelberg.

 

some addictions

Photograph of a sign: Tourists are not permitted beyond this point

Literature

What is Jazz?

Running

Photography