Interpreting subboundary statistics

A subboundary is a set of connected Boundary Elements (BEs). The set of subboundaries found for a data set or data sets make up the boundary. There are two alternative hypotheses in subboundary statistics, either large-scale boundaries or boundary fragmentation .

Under a boundary-generating process, we would expect a contiguous boundary with few subboundaries (Ns), few singletons (N1), high subboundary length (L, both mean and max), high subboundary diameter (D, both mean and max), and low subboundary branchiness (diameter to length ratio, D/L).

Under boundary fragmentation, we would expect lots of singleton subboundaries (high Ns and N1), low subboundary length, low diameter, and high branchiness.

The following table summarizes the predictions of each alternative hypothesis.

Statistic	Meaning	Boundaries (Ha1)	Fragmentation (Ha2)
NS	number of subboundaries	low (lower tail of distribution is significant)	high (upper tail of distribution is significant)
N1	number of singleton Boundary Elements	low	high
Lmax	maximum subboundary length (number of linked BEs)	high	low
Lmean	mean subboundary length	high	low
Dmax	maximum subboundary diameter	high	low
Dmean	mean subboundary diameter	high	low
D/L	mean diameter-to-length ratio (indicates branchiness)	low	high

You can use Monte Carlo randomization to determine whether the observed value of a test statistic is either significantly high or low. BoundarySeer will present the p-values for the upper and lower tails of the Monte Carlo distribution. Use the table above to determine which tail to evaluate for which alternative hypothesis. To evaluate whether a test statistic is unusually low, examine the lower tail p-value (from the lower end of the distribution). To evaluate whether a test statistic is unusually high, examine the upper tail p-value (from the upper end of the distribution).

See also:

Examples:

Leukemia tutorial
Songbirds tutorial