pith. sign in

arxiv: 1305.1422 · v4 · pith:7RJGZBWRnew · submitted 2013-05-07 · 💻 cs.DC · cs.MS· cs.NE

Somoclu: An Efficient Parallel Library for Self-Organizing Maps

classification 💻 cs.DC cs.MScs.NE
keywords mapstrainingdataexecutionlargeparallelself-organizingsomoclu
0
0 comments X
read the original abstract

Somoclu is a massively parallel tool for training self-organizing maps on large data sets written in C++. It builds on OpenMP for multicore execution, and on MPI for distributing the workload across the nodes in a cluster. It is also able to boost training by using CUDA if graphics processing units are available. A sparse kernel is included, which is useful for high-dimensional but sparse data, such as the vector spaces common in text mining workflows. Python, R and MATLAB interfaces facilitate interactive use. Apart from fast execution, memory use is highly optimized, enabling training large emergent maps even on a single computer.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.