Table of Contents

Competition

The competition is hosted on Kaggle at this link.

The teams will be composed of 2 people.

We provide you with a Starter Code which demonstrates how to read the data, train a network and make a submission. You are encouraged to start your work from this notebook.

Description

Traditional image classification models heavily rely on accurately labeled data for training, but in real-world scenarios, acquiring large quantities of labeled images can be costly and time-consuming.

Additionally, label noise or mislabeling can further complicate the training process, leading to decreased model performance.

In this challenge, we provide you with a dataset that poses both obstacles: a significant portion of the training data remains unlabeled, and an unknown percentage of the labeled data contains mislabeled instances.

Your task is to develop innovative deep learning algorithms and techniques to overcome these challenges and build a robust image classification model.

To succeed in this competition, participants are encouraged to explore semi-supervised learning methods that leverage the unlabeled data to improve the model's performance. Developing strategies to mitigate the impact of mislabeled instances and enhance the model's ability to generalize effectively will be crucial. We encourage creative ideas.

Data

Deadline

You can make submissions until 3:55 PM Friday (12 July).

Rules

  • Using pretrained models is not allowed.
  • Using additional training data apart from the data provided is not allowed.
  • Searching on the internet for the clean dataset or labels for the test set is not allowed.

Slides

  • Max 2 slides
  • Use Google Slides
  • Submit the link to your slides here