当前位置:网站首页>Deep learning | three concepts: epoch, batch, iteration
Deep learning | three concepts: epoch, batch, iteration
2022-07-01 23:35:00 【Rihe Princess】

- Epoch( period ):
When a complete data set passes through the neural network once and returns once , This process is called a time >epoch.( in other words , All training samples In the neural network all the A forward propagation and A back propagation )
A little more general , One Epoch Namely Train all training samples once The process of .However , When one Epoch The sample of ( That's all the training samples ) The number may be too large ( For computers ), You need to break it into small pieces , That is, share Multiple Batch To train .**
Batch( batch / A batch of samples ):
Divide the whole training sample into several Batch.Batch_Size( Batch size ):
The size of each batch of samples .Iteration( One iteration ):
Train one Batch Just once Iteration( This concept is similar to iterators in programming languages ).
- Why use more than one epoch?
It is not enough to pass a complete data set in a neural network at one time , And we need to pass the complete data set many times in the same neural network . But remember , We use a limited set of data , And we use an iterative process called gradient descent to optimize the learning process . As shown in the figure below . So just update it once or use one epoch It's not enough. .
With epoch increase in numbers , The number of updates of weights in neural networks is also increasing , The curve changes from under fitting to over fitting .
that , The problem is coming. , How many? epoch That's the right thing to do ?
Unfortunately , There is no right answer to this question . For different data sets , The answer is different . But the diversity of data can affect the right epoch The number of . such as , Only the black cat dataset , And data sets of cats in all colors .
Conversion relation :

actually , gradient descent The fundamental difference between the above methods lies in the Batch_Size Different .

for instance :
边栏推荐
- 写给当前及未来博士研究生一些建议整理分享
- 为什么PHP叫超文本预处理器
- cookie、session、tooken
- Distance measurement - Hamming distance
- 硅谷产品实战学习感触
- sql 优化
- algolia 搜索需求,做的快自闭了...
- 2021 RoboCom 世界机器人开发者大赛-本科组初赛
- from pip._internal.cli.main import main ModuleNotFoundError: No module named ‘pip‘
- The difference between timer and scheduledthreadpoolexecutor
猜你喜欢

Matplotlib common charts

Current situation and future development trend of Internet of things
![[must] bm41 output the right view of the binary tree [medium +]](/img/a5/00b2f0df5ab448665a2b062d145e52.png)
[must] bm41 output the right view of the binary tree [medium +]

安全协议重点

物联网现状及未来发展趋势

2021 RoboCom 世界机器人开发者大赛-本科组初赛

神经网络物联网的发展趋势和未来方向

【.Net Core】程序相关各种全局文件

2022 examination questions and online simulation examination for safety management personnel of hazardous chemical business units

Write some suggestions to current and future doctoral students to sort out and share
随机推荐
【C#】依赖注入及Autofac
excel如何打开100万行以上的csv文件
mysql:insert ignore、insert和replace区别
2021 robocom world robot developer competition - preliminary competition of undergraduate group
ShanDong Multi-University Training #3
Daily three questions 6.28
TS初次使用、ts类型
每日三题 6.30(2)
Distance measurement - Hamming distance
使用uni-simple-router,动态传参 TypeError: Cannot convert undefined or null to object
Redis RDB快照
SWT/ANR问题--SWT 导致 low memory killer(LMK)
Paramètres communs de matplotlib
Development trend and future direction of neural network Internet of things
MT manager test skiing Adventure
The essence of software architecture
【ES实战】ES上的安全性运行方式
云信小课堂 | IM及音视频中常见的认知误区
Practical application and extension of plain framework
jpa手写sql,用自定义实体类接收

