1. Data preprocessing • quollr

Here, we’ll walk through the process of preprocessing 2D embedding data to obtain regular hexagons.

library(quollr)
library(dplyr)

First, you’ll need 2D embedding data generated for your training data. For our example, we’ll use a 3- $d$ S-curve dataset with four additional noise dimensions. We’ve used UMAP as our non-linear dimension reduction technique to generate embeddings for the S-curve data.

scaled_umap <- gen_scaled_data(data = s_curve_noise_umap)

glimpse(scaled_umap)
#> List of 3
#>  $ scaled_nldr: tibble [3,750 × 3] (S3: tbl_df/tbl/data.frame)
#>   ..$ emb1: num [1:3750] 0.27 0.788 0.771 0.306 0.549 ...
#>   ..$ emb2: num [1:3750] 0.839 0.466 0.319 0.542 0.806 ...
#>   ..$ ID  : int [1:3750] 1 2 3 5 6 7 9 10 11 12 ...
#>  $ lim1       : num [1:2] -13.9 13.4
#>  $ lim2       : num [1:2] -12.8 11.4

gen_scaled_data function preprocesses the 2D embedding data to obtain regular hexagons.