DLT

Abstract

Deep learning has brought unprecedented success in various tasks ranging from natural language processing, computer vision, to playing strategic games. Nevertheless, deep learning research is mostly guided by empirical observations, and the successful deployment of deep learning technology often requires various heuristics and extensive hyperparameter tuning. In this project, we intend to develop rigorous theories to understand (and possibly solve) various aspects of deep learning including trainability, generalization, and robustness of neural networks.

Publications

Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings.
Evan Markou, Thalaiyasingam Ajanthan, and Stephen Gould.
Preprint, 2025.
[pdf] [arxiv] [bib]

@article{markou_reduction_arxiv25,
  author = {Markou, Evan, and Ajanthan, Thalaiyasingam, and Gould, Stephen},
  title = {Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings},
  journal = {Preprint},
  year = {2025}
}

Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame.
Evan Markou, Thalaiyasingam Ajanthan, and Stephen Gould.
Neural Information Processing Systems (NeurIPS), December 2024.
[pdf] [arxiv] [code] [bib]

@article{markou_gnc_neurips24,
  author = {Markou, Evan, and Ajanthan, Thalaiyasingam, and Gould, Stephen},
  title = {Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame},
  journal = {NeurIPS},
  year = {2024}
}

Bidirectional Self-Normalizing Neural Networks.
Yao Lu, Stephen Gould, and Thalaiyasingam Ajanthan.
Neural Networks, August 2023.
[pdf] [arxiv] [talk] [bib]

@article{lu_bsnn_nn23,
  author = {Lu, Yao and Gould, Stephen and Ajanthan, Thalaiyasingam},
  title = {Bidirectional Self-Normalizing Neural Networks},
  journal = {NN},
  year = {2023}
}

Deep Learning Theory

Abstract

Publications