pith. sign in

arxiv: 1906.02126 · v1 · pith:Z5YFCFIVnew · submitted 2019-05-15 · 💻 cs.CL · cs.AI

Extractive Summarization via Weighted Dissimilarity and Importance Aligned Key Iterative Algorithm

classification 💻 cs.CL cs.AI
keywords algorithmdissimilarityimportancesentencesiterativesummarizationweightedalgorithms
0
0 comments X
read the original abstract

We present importance aligned key iterative algorithm for extractive summarization that is faster than conventional algorithms keeping its accuracy. The computational complexity of our algorithm is O($SNlogN$) to summarize original $N$ sentences into final $S$ sentences. Our algorithm maximizes the weighted dissimilarity defined by the product of importance and cosine dissimilarity so that the summary represents the document and at the same time the sentences of the summary are not similar to each other. The weighted dissimilarity is heuristically maximized by iterative greedy search and binary search to the sentences ordered by importance. We finally show a benchmark score based on summarization of customer reviews of products, which highlights the quality of our algorithm comparable to human and existing algorithms. We provide the source code of our algorithm on github https://github.com/qhapaq-49/imakita .

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.