Skip to contents

This function generates a dataset consisting of multiple circular clusters together with a single Gaussian cluster in a \(p\)-dimensional space. The curvycycle are placed concentrically at the origin with varying scales, while the Gaussian cluster serves as an additional background or center cluster.

Usage

make_gaucurvycycle(
  n = c(200, 100, 100),
  p = 4,
  num_curvycycle = 2,
  scale_curvycycle = c(1, 2)
)

Arguments

n

An integer vector of length num_curvycycle + 1, giving the number of points in each cluster (curvycycle first, followed by the Gaussian). Default is c(200, 100, 100).

p

Integer, the dimensionality of the embedding space. Must be at least 3. Default is 4.

num_curvycycle

Integer, the number of circular clusters to generate. Default is 2.

scale_curvycycle

Numeric vector of length num_curvycycle, giving the scale (radius) of each circular cluster. Default is c(1, 2).

Value

A data frame (or tibble, depending on gen_multicluster()) containing the generated dataset with cluster assignments.

Examples

# Two curvycycle (radii 1 and 2) plus one Gaussian cluster in 4-D
gaucurvycycle <- make_gaucurvycycle()
#>  Data generation completed successfully!!!
#>  Data generation completed successfully!!!
#>  Data generation completed successfully!!!
#>  Multiple clusters generation completed successfully!!!