Unlocking the Transferability of Tokens in Deep Models for Tabular Data

التفاصيل البيبلوغرافية
العنوان:	Unlocking the Transferability of Tokens in Deep Models for Tabular Data
المؤلفون:	Zhou, Qi-Le, Ye, Han-Jia, Wang, Le-Ye, Zhan, De-Chuan
سنة النشر:	2023
المجموعة:	Computer Science
مصطلحات موضوعية:	Computer Science - Machine Learning
الوصف:	Fine-tuning a pre-trained deep neural network has become a successful paradigm in various machine learning tasks. However, such a paradigm becomes particularly challenging with tabular data when there are discrepancies between the feature sets of pre-trained models and the target tasks. In this paper, we propose TabToken, a method aims at enhancing the quality of feature tokens (i.e., embeddings of tabular features). TabToken allows for the utilization of pre-trained models when the upstream and downstream tasks share overlapping features, facilitating model fine-tuning even with limited training examples. Specifically, we introduce a contrastive objective that regularizes the tokens, capturing the semantics within and across features. During the pre-training stage, the tokens are learned jointly with top-layer deep models such as transformer. In the downstream task, tokens of the shared features are kept fixed while TabToken efficiently fine-tunes the remaining parts of the model. TabToken not only enables knowledge transfer from a pre-trained model to tasks with heterogeneous features, but also enhances the discriminative ability of deep tabular models in standard classification and regression tasks.
نوع الوثيقة:	Working Paper
الوصول الحر:	http://arxiv.org/abs/2310.15149Test
رقم الانضمام:	edsarx.2310.15149
قاعدة البيانات:	arXiv

View record in Arxiv

الوصف
الوصف غير متاح.