pith. sign in

arxiv: 1804.11287 · v3 · pith:CHM5N5WPnew · submitted 2018-04-30 · 💻 cs.CG

Computing Approximate Statistical Discrepancy

classification 💻 cs.CG
keywords rangeconsiderdefineddiscrepancymanyproblemvaluesalgorithms
0
0 comments X
read the original abstract

Consider a geometric range space $(X,\c{A})$ where each data point $x \in X$ has two or more values (say $r(x)$ and $b(x)$). Also consider a function $\Phi(A)$ defined on any subset $A \in (X,\c{A})$ on the sum of values in that range e.g., $r_A = \sum_{x \in A} r(x)$ and $b_A = \sum_{x \in A} b(x)$. The $\Phi$-maximum range is $A^* = \arg \max_{A \in (X,\c{A})} \Phi(A)$. Our goal is to find some $\hat{A}$ such that $|\Phi(\hat{A}) - \Phi(A^*)| \leq \varepsilon.$ We develop algorithms for this problem for range spaces with bounded VC-dimension, as well as significant improvements for those defined by balls, halfspaces, and axis-aligned rectangles. This problem has many applications in many areas including discrepancy evaluation, classification, and spatial scan statistics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.