Efficient Large Dimensional Self-Organising Maps with PyTorch | by Mathieu d’Aquin | Dec, 2024

Because it’s fun to self-organise

Self-organising maps (or Kohonen maps) are an interesting kind of neural networks: they don’t follow the same kind of architecture and are definitely trained differently from the usual backpropagation methods. There is a good reason for this: they are meant to be used for unsupervised learning. They are to the usual multi-layer neural networks what K-Means is to SVM. They create clusters; they discretise the data space. But they have one thing that makes them different from other clustering methods: The clusters that they create form a map of the data (a grid of clusters) where the distance between clusters in that map represents the distance that exists between the average members of those clusters in the data space.

Because they are slightly atypical, there has not been as much work done on creating efficient implementations of self-organising maps (SOMs) as for other forms of neural networks, in particular with respect to enabling them to handle highly dimensional data on GPUs (i.e., they are typically used on data with not more than a few dozen features). Too bad, since that is exactly what I needed for a project: fast SOM training on data with thousands of features. I had tried existing libraries, including those based on PyTorch, and was not quite satisfied, so I made my own: ksom (admittedly also because it is fun to do, especially as a way to get better at using PyTorch).

Efficient Large Dimensional Self-Organising Maps with PyTorch | by Mathieu d’Aquin | Dec, 2024

Because it’s fun to self-organise

Recent Articles

Advanced Time Intelligence in DAX with Performance in Mind

North Korean Hackers Target Freelance Developers in Job Scam to Deploy Malware

Nvidia’s Big Plan to Beat Back RTX 5090 Scalpers Is the Same as Everybody Else

Why Data Scientists Should Care about Containers — and Stand Out with This Knowledge

KGGen: Advancing Knowledge Graph Extraction with Language Models and Clustering Techniques

Related Stories

Leave A Reply Cancel reply