PyTorch implementation for convolutional Kolmogorov-Arnold Networks research
Top 61.8% on sourcepulse
This repository provides a comprehensive collection of Convolutional Kolmogorov-Arnold Networks (CKANs) for researchers and practitioners exploring novel neural network architectures. It offers implementations of various CKAN variants, model architectures like ResNet and DenseNet adapted with CKANs, and training scripts for benchmarking on standard datasets, aiming to advance the state-of-the-art in deep learning.
How It Works
The core innovation lies in replacing traditional convolutional kernels (weight matrices) with learnable univariate functions (phi functions), inspired by the Kolmogorov-Arnold representation theorem. This approach allows for more flexible and potentially more efficient feature extraction by learning non-linear transformations directly within the convolution operation, rather than relying solely on subsequent activation functions.
Quick Start & Requirements
pip install -r requirements.txt
python mnist_conv.py
accelerate launch <script_name>.py
(e.g., cifar.py
, tiny_imagenet.py
)Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README notes that results are preliminary and model architectures are not exhaustively explored. ChebyKAN-based convolutions exhibit stability issues. Performance on CIFAR-10/100 is reported to be significantly lower than classical convolutions, indicating a need for further architectural research for these datasets.
8 months ago
1 day