pith. sign in

arxiv: 1304.8016 · v1 · pith:6P36Y74Onew · submitted 2013-04-23 · 💻 cs.DS · cs.CL

On Semantic Word Cloud Representation

classification 💻 cs.DS cs.CL
keywords wordalgorithmsproblemrectanglerelatedsemanticallyseveralvariants
0
0 comments X
read the original abstract

We study the problem of computing semantic-preserving word clouds in which semantically related words are close to each other. While several heuristic approaches have been described in the literature, we formalize the underlying geometric algorithm problem: Word Rectangle Adjacency Contact (WRAC). In this model each word is associated with rectangle with fixed dimensions, and the goal is to represent semantically related words by ensuring that the two corresponding rectangles touch. We design and analyze efficient polynomial-time algorithms for some variants of the WRAC problem, show that several general variants are NP-hard, and describe a number of approximation algorithms. Finally, we experimentally demonstrate that our theoretically-sound algorithms outperform the early heuristics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.