History of research
 S. Kaczmarz, Bulletin international de l’Acadámie polonaise des sciences et des lettres. Classe des sciences mathématiques et naturelles. Série A, Sciences mathématiques, 35:355357, 1937.
Introduction of iterative method for solution of system of linear algebraic equations by sequential projecting an approximation from one hyper plane to another. The
method is very effective when involved equations form near orthogonal hype plane system.
 V.I. Arnold, Doklady Akademii Nauk SSSR 114(4):679681 (1957).
Representations of the functions with three variables by the sum of univariate funcions.
 A.N. Kolmogorov, Doklady Akademii Nauk SSSR 114(5):953956 (1957).
Generalization of the previous concept to the case of any continuous multivariate function.
 G.G. Lorentz, Am. Math. Mon. 69(6): 469485 (1962).
Introduced different adequate form of KolmogorovArnold representation.
 D.A. Sprecher, Trans. Am. Math. Soc. 115(3):340355 (1965).
Also introduced different from Lorentz adequate form of KolmogorovArnold representation.
 B.L. Fridman, Doklady Akademii Nauk SSSR 177(5):10191022 (1967).
Proven that inner functions of KolmogorovArnold representation can be chosen as satisfying Lipschitz conditions and are independent from the given multivariate function.
 D.A. Sprecher, J. Math. Anal. Appl. 38(1):208213 (1972).
Proven that inner functions can be chosen monotonic.
 V.V. Krylov, Avtomatika i Telemehanika 40(5):8288 (1979).
Researched properties of discrete Urysohn operators. These operators are building blocks of KolmorogorovArnold representation.
 K.H. Meyn, Numerische Mathematik, 42(2):161172 (1983).
Convergence of nonlinear extension of Kaczmarz method is proven.
 R. HechtNielsen, in Proceedings of the International Conference on Neural Networks, pp. 1114 (1987).
First clear justification that KolmogorovArnold representation is a neural network equivalent.
 T.J. Hastie, R.J. Tibshirani, Generalized Additive Models 1(3):297318 (1986).
The approximate GAM model was constructed.
 V. Kůrková, Neural Networks 5(3):501506 (1992).
Proposed the first neural network based on KolmogorovArnold representation with two hidden layers.
 M. Nees, Journal of Computational and Applied Mathematics 54(2):239250 (1994).
Provided constructive proof of Kolmogorov superposition theorem.
 D.A. Sprecher, Neural Networks 9(5):765772 (1996).
The paper gives an explicit numerical implementation of the hidden layer that also enables the implementation of the output layer.
 D.A. Sprecher, Neural Networks 10(3):447457 (1997).
Paper presents a numerical algorithm for the parallel computations of the outer functions in Kolmogorov superpositions.
 M. Köppen, in Artificial Neural Networks – ICANN 2002, pp. 474479 (2002).
Customization of Sprecher algorithm for resampling of image function.
 B. Igelnik, N. Parikh, IEEE Transactions on Neural Networks 14(4):725733 (2003).
First implementation of the cubic splines for approximation of KolmogorovArnold representation.
 M. Coppejans, Journal of Econometrics 123(1):131 (2004).
Another application of cubic splines for slightly different form of nested functions tree.
 D. Bryant, PhD thesis, University of Central Florida (2008).
Analysis of Kolmogorov's Superposition theorem and its implementation in applications with low and high dimensional data.
 J. Braun, M. Griebel, Constructive Approximation 30(3):653675 (2009).
Theoretical research of the properties of the inner functions.
 P.E. Leni, PhD thesis, Université de Bourgogne (2010).
Using KA model for image processing.
 X. Liu, PhD thesis, Imperial College London (2015).
Using KA model for image processing.
 J. Actor, MA thesis, Rice University (2018).
Particular algorithm for KA approximation using monotonic properties of inner functions.
 M. Poluektov, A. Polar, Journal of the Franklin Institute 357:38653892 (2020).
Kaczmarz algorithm for identification of discrete Urysohn operators which are building blocks of KA representation.
 H. Montanelli, H. Yang, Neural Networks 129:16 (2020).
Using deep ReLU networks for approximation of KA representation.
 A. Polar, M. Poluektov, Engineering Applications of Artificial Intelligence, 99:104137 (2021).
Using Kaczmarz algorithm and piecewise linear functions for training of KA networks.
 J. SchmidtHieber, Neural Networks 137:119126 (2021).
Some improvements of using deep ReLU networks for KA representation.
 H. van Deventer, P. J. van Rensburg et al., arXiv:2205.06376 (2022).
Another example of using cubic splines for KA approximation.
 M. Poluektov, A. Polar, arXiv:2305.08194 (2023).
Using Kaczmarz algorithm with arbitrary basis functions and proof of local convergence.
 A. Ismayilova, V.E. Ismailov, Neural Networks 176:106333 (2024).
Theoretical research on the properties of KAN functions. Suggestion that used functions can be discontinuous.
 Z. Liu, Y. Wang et al., arXiv:2404.19756 (2024).
Using Broyden algorithm for training KAN, pruning, using KAN for numerical solution of partial derivative equations.

