Artificial Neural Networks and Deep Learning

Introduction

This is the repository for the Homeworks of Artificial Neural Networks and Deep Learning in the academic year 2024/2025 at Polytechnic of Milan.

Subject: 054307 - Artificial Neural Networks And Deep Learning

Professors: Boracchi Giacomo and Matteucci Matteo

Academic Year: 2024/2025

Description of the Homeworks

The homeworks includes two different projects involving neural networks and two different tasks:

First Homework: Image Classification
Second Homework: Semantic Segmentation

First Homework

In this assignment, you will classify 96x96 RGB images of blood cells. These images are categorized into eight classes, each representing a particular cell state. This is a multi-class classification problem, so your goal is to assign the correct class label to each RGB image.

Blood Cells

Dataset

To enlarge the dataset and to make the proposed neural network robust to changes in the dataset, the technique called Augmentation was used.

Augmentation 1

Augmentation 2

Network Architecture

The implemented network is a ConvNeXtBase-based transfer learning model for multi-class image classification, featuring a pretrained backbone with custom fully connected layers to enhance feature representation and regularization.

Backbone

ConvNeXtBase:

Pretrained on ImageNet, used as a feature extractor.
include_top=False excludes the original classification head.
Frozen weights to preserve learned low- and mid-level features.

Custom Top Layers

Global Average Pooling:

Reduces spatial dimensions while retaining global feature information. Batch Normalization:
Stabilizes training and accelerates convergence. Dropout Layers:
Two stages of dropout (0.2 and 0.25) reduce overfitting. Fully Connected (Dense) Layers:
Four layers of decreasing size: 512 → 256 → 128 → 64 neurons.
ReLU activation introduces non-linearity.
Optional BatchNormalization (commented out) for further stabilization.

Output Layer

Dense Layer with Softmax:

Produces class probabilities for NUM_CLASSES.

Training Configuration

Loss Function:

Categorical cross-entropy with label smoothing (0.1) to improve generalization. Optimizer:
AdamW with Cosine Decay Restarts for adaptive learning rate scheduling. Metrics:
Accuracy, Precision, and Recall for comprehensive evaluation.

Key Advantages

Pretrained ConvNeXtBase backbone: leverages rich, hierarchical features from ImageNet.
Custom top layers: provide flexibility for different classification tasks.
Dropout and BatchNorm: reduce overfitting and improve training stability.
Mixed precision training: speeds up computation and reduces memory usage.

This combination makes the architecture robust, efficient, and suitable for high-performance image classification tasks.
It was first trained on the dataset as is and next transfer leanring with fine tuning was performed by unfreezing the layers of the network.

Results

A validation accuracy of 85.7% has been obtained after fine tuning. The technique called Test-Time Augmetation was also used to improve the performances.
For further details, check the report.
Some results of the trained networks for comparison:

Model	Accuracy	Precision	Recall
MobileNet	0.5769	0.689	0.121
InceptionV3	0.658	0.7848	0.169
EfficientNet	0.7272	0.9765	0.253
ConvNeXtBase	0.857	0.9879	0.2776
ConvNeXtLarge	0.8248	0.9926	0.2643

Second Homework

In this assignment, you will receive 64x128 grayscale real images from Mars terrain. Pixels in these images are categorized into five classes, each representing a particular type of terrain. This is a semantic segmentation problem, so your goal is to assign the correct class label to each mask pixel.
Pretrained models are forbidden.

Mars Terrain

Mask