EXTRACTIVE SUMMARIZATION USING SENTENCE EMBEDDINGS: Automatic summarization of news articles at Blendle
Publication date
Authors
DOI
Document Type
Master Thesis
Metadata
Show full item recordCollections
License
CC-BY-NC-ND
Abstract
In this thesis, we investigate embedding-based extractive summarization techniques, in order to automatically summarize news articles at Blendle. The thesis is comprised of three studies. In the first two studies, we compare existing methods and explore the added value of substituting sentence embeddings. In the third study, we propose a summarization method based on a recurrent neural network (RNN) architecture. This model is an adaptation of the model by Cheng and Lapata (2016). In order to make the RNN training more flexible, we further propose a semi-supervised training framework for this RNN architecture by using unsupervised methods for pre- or co-training the RNN summarizer.
Keywords
extractive summarization, recurrent neural network, neural network, sequence2sequence, sentence embeddings, embeddings, word2vec, Blendle