تقرير
Fair Interpretable Learning via Correction Vectors
العنوان: | Fair Interpretable Learning via Correction Vectors |
---|---|
المؤلفون: | Cerrato, Mattia, Köppel, Marius, Segner, Alexander, Kramer, Stefan |
سنة النشر: | 2022 |
المجموعة: | Computer Science Statistics |
مصطلحات موضوعية: | Computer Science - Machine Learning, Statistics - Machine Learning |
الوصف: | Neural network architectures have been extensively employed in the fair representation learning setting, where the objective is to learn a new representation for a given vector which is independent of sensitive information. Various "representation debiasing" techniques have been proposed in the literature. However, as neural networks are inherently opaque, these methods are hard to comprehend, which limits their usefulness. We propose a new framework for fair representation learning which is centered around the learning of "correction vectors", which have the same dimensionality as the given data vectors. The corrections are then simply summed up to the original features, and can therefore be analyzed as an explicit penalty or bonus to each feature. We show experimentally that a fair representation learning problem constrained in such a way does not impact performance. Comment: ICLR-21 Workshop on Responsible AI |
نوع الوثيقة: | Working Paper |
الوصول الحر: | http://arxiv.org/abs/2201.06343Test |
رقم الانضمام: | edsarx.2201.06343 |
قاعدة البيانات: | arXiv |
الوصف غير متاح. |