Files
COGMOD-HWI/Project/tmp/writeup.md
T
2025-04-19 16:52:03 -04:00

5.3 KiB
Raw Blame History

Investigating the Impact of Training Algorithms and Data Augmentation on Network Generalization and Robustness

Authors: Itamar Oren-Naftalovich, Annabelle Choi

Date: [TODO lmao]


Abstract

In this paper we will look at the impact of various training algorithms and data augmentation techniques on the generalization and robustness of deep neural networks (DNNs). With a simple convolutional neural network (CNN) model trained on CIFAR-10, experimentally we compared the performance of two optimizers (SGD and Adam) under three augmentation strategies (none, standard, and aggressive). Strong main effects of both training algorithms and augmentation techniques were confirmed by our results but no significant interaction between the factors. These findings emphasize important considerations for optimizing network training in cognitive modeling and real-world applications.


1. Introduction

1.1 Background

Deep Neural Networks (DNNs) are the recent emphasis of cognitive process modeling due to their ability to learn high-level data representations. Standard training algorithms like Stochastic Gradient Descent (SGD) and Adam yield diverse impacts on learning efficacy, while data augmentation techniques are aimed at improving network generalization by artificially increasing dataset diversity.

1.2 Motivation

Understanding the impact of training algorithm choice and data augmentation methods on robustness (resistance to input perturbations) and generalization (novel data performance) is similar to basic questions in cognitive science regarding human learning and adaptability.

1.3 Research Question

"What are the impacts of modifying a neural network's training algorithm or data augmentation rule on its robustness and generalization abilities?"

1.4 Objectives

  • Compare convergence and robustness of different training algorithms.
  • Quantify the impact of various data augmentation methods on generalization.
  • Identify the optimal combinations for maximizing robustness and generalization.

2. Methods

2.1 Experimental Setup

Dataset

We used the CIFAR-10 dataset, which consists of 60,000 32×32 color images in 10 classes, a standard benchmark to evaluate model generalization and robustness.

Model Architecture

We employed a straightforward CNN architecture with two convolutional layers followed by pooling and fully-connected layers, appropriate for basic cognitive modeling and initial robustness testing.

2.2 Training Algorithms

We contrasted:

  • SGD: Stochastic Gradient Descent with momentum (0.9).
  • Adam: Adaptive moment estimation.

Both optimizers had a learning rate of 0.01.

2.3 Data Augmentation Strategies

We contrasted three augmentation regimes:

  • None: No augmentation.
  • Standard: Horizontal flips and random crops.
  • Aggressive: Baseline augmentations with rotation and color jitter.

2.4 Experimental Design

2 (optimizer) × 3 (augmentation) factorial design with three replicates per condition (random seeds: 42, 123, 999). Robustness was tested using Gaussian noise perturbations.

2.5 Implementation Environment

Experiments were run in Python with PyTorch and torchvision. Analyses were done with pandas, matplotlib, and statsmodels.


3. Results

3.1 Training Performance

SGD consistently had better test accuracies than Adam in augmentation conditions (see attached figures).

3.2 Robustness Analysis

Those trained with SGD were more resistant to varying noise levels than Adam, obviously under strong augmentation.

3.3 Statistical Analysis (ANOVA)

Two-way ANOVA:

  • Optimizer: Significant effect, F(1,12)=230.19, p<0.0001.
  • Augmentation: Significant effect, F(2,12)=12.46, p=0.0012.
  • Interaction: Not significant, F(2,12)=2.42, p=0.1305.

4. Discussion

4.1 Interpretation of Results

Optimizer choice had the greatest effect on model stability and accuracy, with SGD significantly outperforming Adam. Augmentation also had a significant effect on performance, affirming its application in improving generalization, but the lack of significant interaction suggests that augmentation gains are robust across optimizers.

4.2 Comparison with Literature

Our findings are in line with existing machine learning research, affirming that vanilla SGD with momentum generally outperforms adaptive methods like Adam in image classification. The clear benefit of augmentation also aligns with cognitive modeling views about considering varied exposure to improve generalization.

4.3 Limitations

Having fewer replicates per condition (3 seeds) can reduce statistical power to detect weak interactions. Future work should include more extensive replication as well as other forms of augmentation.

4.4 Future Directions

It would be desirable in future research to explore more complex models, additional datasets, and cognitive modeling-specific scenarios. Additionally, integrating adversarial robustness testing could add further insight.


5. Conclusion

We rigorously analyzed the impact of training algorithms and augmentation methods on CNN robustness and generalization comprehensively. Results indicate unambiguously that optimizer and augmentation choices significantly impact network performance, and this has significant implications for cognitive modeling and real-world deep learning deployments.