Learning and generalization of one-hidden-layer neural networks, going beyond standard Gaussian data

Li, Hongkang; Zhang, Shuai; Wang, Meng

Computer Science > Machine Learning

arXiv:2207.03615 (cs)

[Submitted on 7 Jul 2022 (v1), last revised 25 Jan 2023 (this version, v2)]

Title:Learning and generalization of one-hidden-layer neural networks, going beyond standard Gaussian data

Authors:Hongkang Li, Shuai Zhang, Meng Wang

View PDF

Abstract:This paper analyzes the convergence and generalization of training a one-hidden-layer neural network when the input features follow the Gaussian mixture model consisting of a finite number of Gaussian distributions. Assuming the labels are generated from a teacher model with an unknown ground truth weight, the learning problem is to estimate the underlying teacher model by minimizing a non-convex risk function over a student neural network. With a finite number of training samples, referred to the sample complexity, the iterations are proved to converge linearly to a critical point with guaranteed generalization error. In addition, for the first time, this paper characterizes the impact of the input distributions on the sample complexity and the learning rate.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2207.03615 [cs.LG]
	(or arXiv:2207.03615v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2207.03615

Submission history

From: Hongkang Li [view email]
[v1] Thu, 7 Jul 2022 23:27:44 UTC (91 KB)
[v2] Wed, 25 Jan 2023 20:28:09 UTC (10,076 KB)

Computer Science > Machine Learning

Title:Learning and generalization of one-hidden-layer neural networks, going beyond standard Gaussian data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Machine Learning

Title:Learning and generalization of one-hidden-layer neural networks, going beyond standard Gaussian data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.