Exploring the subtopic-based relationship map strategy for multi-document summarization

Authors

  • Rafael Ribaldo Universidade de São Paulo
  • Paula Christina Figueira Cardoso
  • Thiago Alexandre Salgueiro Pardo

DOI:

https://doi.org/10.22456/2175-2745.59104

Abstract

In this paper we adapt and explore strategies for generating multi-document summaries based on relationship maps, which represent texts as graphs (maps) of interrelated segments and apply different traversing techniques for producing the summaries. In particular, we focus on the Segmented Bushy Path, a sophisticated method which tries to represent in a summary the main subtopics from source texts while keeping its informativeness. In addition, we also investigate some well-known subtopic segmentation and clustering techniques in order to correctly select the most relevant information to compose the final summary. We show that this subtopic-based method outperforms other methods for multi-document summarization and that achieves state of the art results, competing with the most sophisticated deep summarization methods in the area.

Downloads

Download data is not yet available.

Author Biography

Rafael Ribaldo, Universidade de São Paulo

Núcleo Interinstitucional de Linguística Computacional (NILC) - Instituto de Ciências Matemáticas e de Computação (ICMC)

Downloads

Published

2016-05-29

How to Cite

Ribaldo, R., Christina Figueira Cardoso, P., & Alexandre Salgueiro Pardo, T. (2016). Exploring the subtopic-based relationship map strategy for multi-document summarization. Revista De Informática Teórica E Aplicada, 23(1), 183–211. https://doi.org/10.22456/2175-2745.59104

Issue

Section

Regular Papers