Graph-Based Two-Sample Tests for Data with Repeated Observations

Hao Chen; Jingru Zhang

arxiv: 1711.04349 · v2 · pith:LSIP26D2new · submitted 2017-11-12 · 📊 stat.ME

Graph-Based Two-Sample Tests for Data with Repeated Observations

Jingru Zhang , Hao Chen This is my paper

classification 📊 stat.ME

keywords testsgraph-basedobservationsdataextendedgraphrepeatedsimilarity

0 comments

read the original abstract

In the regime of two-sample comparison, tests based on a graph constructed on observations by utilizing similarity information among them is gaining attention due to their flexibility and good performances for high-dimensional/non-Euclidean data. However, when there are repeated observations, these graph-based tests could be problematic as they are versatile to the choice of the similarity graph. We propose extended graph-based test statistics to resolve this problem. The analytic p-value approximations to these extended graph-based tests are derived to facilitate the application of these tests to large datasets. The new tests are illustrated in the analysis of a phone-call network dataset. All tests are implemented in an R package gTests.

This paper has not been read by Pith yet.

Graph-Based Two-Sample Tests for Data with Repeated Observations

discussion (0)