Contextual HyperNetworks for Novel Feature Adaptation

Angus Lamb; Camilla Longden; Cheng Zhang; Evgeny Saveliev; Jos\'e Miguel Hern\'andez-Lobato; Pashmina Cameron; Richard E. Turner; Sebastian Tschiatschek; Simon Woodhead; Yingzhen Li

arxiv: 2104.05860 · v1 · pith:2SH6I3YFnew · submitted 2021-04-12 · 💻 cs.LG

Contextual HyperNetworks for Novel Feature Adaptation

Angus Lamb , Evgeny Saveliev , Yingzhen Li , Sebastian Tschiatschek , Camilla Longden , Simon Woodhead , Jos\'e Miguel Hern\'andez-Lobato , Richard E. Turner

show 2 more authors

Pashmina Cameron Cheng Zhang

This is my paper

classification 💻 cs.LG

keywords featuresneuralfeaturelearningmodelnoveloutputadaptation

0 comments

read the original abstract

While deep learning has obtained state-of-the-art results in many applications, the adaptation of neural network architectures to incorporate new output features remains a challenge, as neural networks are commonly trained to produce a fixed output dimension. This issue is particularly severe in online learning settings, where new output features, such as items in a recommender system, are added continually with few or no associated observations. As such, methods for adapting neural networks to novel features which are both time and data-efficient are desired. To address this, we propose the Contextual HyperNetwork (CHN), an auxiliary model which generates parameters for extending the base model to a new feature, by utilizing both existing data as well as any observations and/or metadata associated with the new feature. At prediction time, the CHN requires only a single forward pass through a neural network, yielding a significant speed-up when compared to re-training and fine-tuning approaches. To assess the performance of CHNs, we use a CHN to augment a partial variational autoencoder (P-VAE), a deep generative model which can impute the values of missing features in sparsely-observed data. We show that this system obtains improved few-shot learning performance for novel features over existing imputation and meta-learning baselines across recommender systems, e-learning, and healthcare tasks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

TabICL: A Tabular Foundation Model for In-Context Learning on Large Data
cs.LG 2025-02 unverdicted novelty 6.0

TabICL scales in-context learning to large tabular data via column-then-row attention for row embeddings followed by a transformer, matching TabPFNv2 speed and performance while outperforming it and CatBoost on datase...