Splinets – Orthogonal Splines for Functional Data Analysis

This study introduces an efficient workflow for functional data analysis in classification problems, utilizing advanced orthogonal spline bases. The methodology is based on the flexible Splinets package, featuring a novel spline representation designed for enhanced data efficiency. The focus here is to show that the novel features make the package a powerful and efficient tool for advanced functional data analysis. Two main aspects of spline implemented in the package are behind this effectiveness: 1) Utilization of Orthonormal Spline Bases – the workflow incorporates orthonormal spline bases, known as splinets, ensuring a robust foundation for data representation; 2) Consideration of Spline Support Sets – the implemented spline object representation accounts for spline support sets, which refines the accuracy of sparse data representation. Particularly noteworthy are the improvements achieved in scenarios where data sparsity and dimension reduction are critical factors. The computational engine of the package is the dyadic orthonormalization of B-splines that leads the so-called splinets – the efficient orthonormal basis of splines spanned over arbitrarily distributed knots. Importantly, the locality of \(B\)-splines concerning support sets is preserved in the corresponding splinet. This allows for the mathematical elegance of the data representation in an orthogonal basis. However, if one wishes to traditionally use the \(B\)-splines it is equally easy and efficient because all the computational burden is then carried in the background by the splinets. Using the locality of the orthogonal splinet, along with implemented algorithms, the functional data classification workflow is presented in a case study in which the classic Fashion MINST dataset is used. Significant efficiency gains obtained by utilization of the package are highlighted including functional data representation through stable and efficient computations of the functional principal components. Several examples based on classical functional data sets, such as the wine data set, showing the convenience and elegance of working with Splinets are included as well.

Rani Basna (Department of Clinical Sciences) , Hiba Nassar (Cognitive Systems, Department of Applied Mathematics and Computer Science) , Krzysztof Podgórski (Department of Statistics)
2025-07-14

0.1 Supplementary materials

Supplementary materials are available in addition to this article. It can be downloaded at RJ-2024-034.zip

0.2 CRAN packages used

Splinets

0.3 CRAN Task Views implied by cited packages

M. Bader. Space-filling curves: An introduction with applications in scientific computing. Springer Science & Business Media, 2012.
R. Basna, X. Liu, H. Nassar, K. Podgórski and H. Wu. Splinets: Functional data analysis using splines and orthogonal spline bases. 2025. URL https://CRAN.R-project.org/package=Splinets. R package version 1.5.1.
R. Basna, H. Nassar and K. Podgórski. Data driven orthogonal basis selection for functional data analysis. Journal of Multivariate Analysis, 189: 104868, 2022.
R. Basna, H. Nassar and K. Podgórski. Spline-based methods for functional data on multivariate domains. Journal of Mathematics in Industry, 14(1): 13, 2024.
N. Benoudjit, E. Cools, M. Meurens and M. Verleysen. Chemometric calibration of infrared spectrometers: Selection and validation of variables by non-linear models. Chemometrics and intelligent laboratory systems, 70(1): 47–53, 2004.
C. de Boor. A practical guide to splines. In Applied mathematical sciences, 1978.
O. Cho and M. J. Lai. A class of compactly supported orthonormal B-spline wavelets. Splines and Wavelets, 123–151, 2005.
T. N. Goodman. A class of orthogonal refinable functions and wavelets. Constructive approximation, 19(4): 525–540, 2003.
X. Liu, H. Nassar and K. Podgórski. Dyadic diagonalization of positive definite band matrices and efficient B-spline orthogonalization. Journal of Computational and Applied Mathematics, 414: 114444, 2022.
J. Mason, G. Rodriguez and S. Seatzu. Orthogonal splines based on B-splines – with applications to least squares, smoothing and regularisation problems. Numerical Algorithms, 5(1): 25–40, 1993.
H. Nassar and K. Podgórski. Periodic splinets. Communications in Statistics - Simulation and Computation, 1–16, 2025.
T. Nguyen. Construction of spline type orthogonal scaling functions and wavelets. Paper 19. Illinois Wesleyan University. 2015.
A. Perperoglou, W. Sauerbrei, M. Abrahamowicz and M. Schmid. A review of spline function procedures in R. BMC Medical Research Methodology, 19(1): 46, 2019.
E. F. Petricoin, A. M. Ardekani, B. A. Hitt, P. J. Levine, V. A. Fusaro, S. M. Steinberg, G. B. Mills, C. Simone, D. A. Fishman, E. C. Kohn, et al. Use of proteomic patterns in serum to identify ovarian cancer. Lancet, 359(9306): 572–577, 2002.
K. Podgórski. Splinets – splines through the Taylor expansion, their support sets and orthogonal bases. 2021. URL https://arxiv.org/abs/2102.00733.
K. Qin. General matrix representations for B-splines. Vis. Comput., 16: 177–186, 2000.
A. Redd. A comment on the orthogonalization of B-spline basis functions and their derivatives. Stat. Comput, 22: 251–257, 2012.
A. Redd. Orthogonalsplinebasis: Orthogonal b-spline basis functions. 2022. URL https://CRAN.R-project.org/package=orthogonalsplinebasis. R package version 0.1.7.
L. Schumaker. Spline functions: Basic theory. Cambridge University Press, 2007.
A. C. Valdez. Gghilbertstrings: A fast ’ggplot2’-based implementation of hilbert curves. 2021. URL https://CRAN.R-project.org/package=gghilbertstrings. R package version 0.3.3.
H. Xiao, K. Rasul and R. Vollgraf. Fashion-mnist: A novel image dataset for benchmarking machine learning algorithms. 2017. URL https://arxiv.org/abs/1708.07747.
L. Zhou, J. Z. Huang and R. J. Carroll. Joint modeling of paired sparse functional data using principal components. Biometrika, 95: 601–619, 2008.

References

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

Citation

For attribution, please cite this work as

Basna, et al., "Splinets -- Orthogonal Splines for Functional Data Analysis", The R Journal, 2025

BibTeX citation

@article{RJ-2024-034,
  author = {Basna, Rani and Nassar, Hiba and Podgórski, Krzysztof},
  title = {Splinets -- Orthogonal Splines for Functional Data Analysis},
  journal = {The R Journal},
  year = {2025},
  note = {https://doi.org/10.32614/RJ-2024-034},
  doi = {10.32614/RJ-2024-034},
  volume = {16},
  issue = {4},
  issn = {2073-4859},
  pages = {42-61}
}