split_spxy — S P X Y Splitter

Group: Splitters · Binding: n4m.sklearn.SPXYSplitter · C ABI: n4m_split_spxy_*

Description

SPXY (Sample set Partitioning based on X and Y) train/test split.

Parameters

Name

Type

Default

test_size

float

0.25

Explanations

Bibliographic source

Standard spectroscopic operator — see the nirs4all preprocessing / augmentation handbook and the cited literature within the binding docstring.

Mathematical principle

SPXY (Sample set Partitioning based on X and Y) train/test split.

Implementation

C ABI n4m_split_spxy_* in libn4m (create / apply / destroy lifecycle), wrapped by n4m.sklearn.SPXYSplitter. The same numerical kernel backs every language binding.

Usage

from n4m.sklearn import SPXYSplitter
op = SPXYSplitter()
X_transformed = op.fit_transform(X)

Benchmarks

Adaptive wall-clock per cell measured against full_matrix.csv. Only backends that implement this method are listed; libraries without the method are omitted.

Verdict  ·  ✓ ref / ≈ ref / ~ shape mark a reference-gate pass at strict / relaxed / qualitative tolerance  ·  ✓ bind = pls4all binding agrees with the C++ baseline  ·  ⇄ cross-check = documented by-design selector/RNG/model, noncanonical API/facade convention, or secondary oracle  ·  ✗ divergent  ·  ⚠ error  ·  — not run. The fastest backend per column is marked 🏆.

Reference gate: strict — numeric equivalence (rmse_rel_tol 1e-12).

BackendParity50×250 (ms)250×50 (ms)
C++ native · libn4m
pls4all.cpp.blas✓ ref0.28 ms
pls4all.cpp.blas+omp✓ ref0.28 ms1.76 ms
pls4all.cpp.omp✓ ref0.28 ms
pls4all.cpp.ref✓ ref0.16 ms1.49 ms
Python · pls4all
pls4all.python✓ bind0.12 ms🏆0.71 ms🏆
pls4all.sklearn✓ bind0.13 ms0.73 ms
Python · external
nirs4allsource0.40 ms3.42 ms

See also: methods index · interactive dashboard