Categorical Deep Learning

Categorical deep learning is pivotal in areas where data can be naturally divided into discrete categories, such as image recognition, natural language processing, or any classification task. This approach often involves transforming categorical variables into a format suitable for neural network models, using techniques such as one-hot encoding or embedding layers, which represent categorical variables as dense vectors. These embeddings capture the relationships and semantics of the categories more effectively than traditional methods. The neural networks are then trained to recognize patterns and make predictions based on these embeddings, optimizing their weights through backpropagation.

The concept of processing categorical data through neural networks has been around since the 1980s but saw significant advancements in the 2010s with the rise of deep learning. Techniques such as embedding layers became popular around 2013, particularly highlighted by their use in Google’s Word2Vec for processing words (which can be seen as categories in natural language).

Key developments in this area have often been tied to broader advances in neural network research. Notable figures include Geoffrey Hinton, Yann LeCun, and Yoshua Bengio, who have broadly contributed to the methods that underpin categorical deep learning. Google’s Tomas Mikolov was instrumental in the development of Word2Vec, a technique integral to the handling of categorical data in natural language processing.

Categorical Deep Learning

Newsletter

Academic Papers

Introduction to machine learning, neural networks, and deep learning

Deep Learning over Multi-field Categorical Data: –A Case Study on User Response Prediction

Survey on categorical data for neural networks

Introduction to Deep Learning: from logical calculus to artificial intelligence

An investigation of categorical variable encoding techniques in machine learning: binary versus one-hot and feature hashing