Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01s4655k321
Title: An Exploration of Multiclass Multidomain Image Classification
Authors: Nair, Prem
Advisors: Russakovsky, Olga
Department: Computer Science
Class Year: 2018
Abstract: Convolutional neural networks (CNNs) are state of the art in image multiclass classification tasks. Often, these tasks involves more complex scenarios involving multiple attributes we want to either classify or reduce bias against. We call non-primary attributes in these tasks "domains." A problem arises where performance and fairness of our classifiers deteriorate as a result of these additional requirements, often in an unexpectedly large way. First, we examine the intersection of this problem and the area of multi-task learning. We construct two multi-task image classification tasks: (1) predicting coarse and fine-grained labels on the CIFAR-100 dataset, and (2) predicting dataset domain origin and digit classification from the combination of the SVHN and MNIST datasets. We demonstrate the failure of an automatic method to balance performance across the classification tasks and consider explanations for this behavior. Next, we construct a novel dataset from CIFAR-10 that provides a framework for studying, benchmarking, and mitigating bias in a multidomain setting. While the classification task appears deceptively simple, we show experimental results of novel CNN training and inference procedures that demonstrate some success toward the challenge of bias mitigation. Finally, we apply the mitigation strategies we have developed on an activity classification task on the imSitu dataset, and reveal real-world improvements on gender bias.
URI: http://arks.princeton.edu/ark:/88435/dsp01s4655k321
Type of Material: Princeton University Senior Theses
Language: en
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File Description SizeFormat 
NAIR-PREM-THESIS.pdf589.39 kBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.