当前位置:网站首页>Deep learning | three concepts: epoch, batch, iteration
Deep learning | three concepts: epoch, batch, iteration
2022-07-01 23:35:00 【Rihe Princess】

- Epoch( period ):
When a complete data set passes through the neural network once and returns once , This process is called a time >epoch.( in other words , All training samples In the neural network all the A forward propagation and A back propagation )
A little more general , One Epoch Namely Train all training samples once The process of .However , When one Epoch The sample of ( That's all the training samples ) The number may be too large ( For computers ), You need to break it into small pieces , That is, share Multiple Batch To train .**
Batch( batch / A batch of samples ):
Divide the whole training sample into several Batch.Batch_Size( Batch size ):
The size of each batch of samples .Iteration( One iteration ):
Train one Batch Just once Iteration( This concept is similar to iterators in programming languages ).
- Why use more than one epoch?
It is not enough to pass a complete data set in a neural network at one time , And we need to pass the complete data set many times in the same neural network . But remember , We use a limited set of data , And we use an iterative process called gradient descent to optimize the learning process . As shown in the figure below . So just update it once or use one epoch It's not enough. .
With epoch increase in numbers , The number of updates of weights in neural networks is also increasing , The curve changes from under fitting to over fitting .
that , The problem is coming. , How many? epoch That's the right thing to do ?
Unfortunately , There is no right answer to this question . For different data sets , The answer is different . But the diversity of data can affect the right epoch The number of . such as , Only the black cat dataset , And data sets of cats in all colors .
Conversion relation :

actually , gradient descent The fundamental difference between the above methods lies in the Batch_Size Different .

for instance :
边栏推荐
- Postgresql随手记(10)动态执行EXECUTING语法解析过程
- ARP报文头部格式和请求流程
- [micro service sentinel] sentinelresourceaspect details
- The difference between timer and scheduledthreadpoolexecutor
- 物联网技术应用属于什么专业分类
- Create Ca and issue certificate through go language
- 问题随记 —— file /usr/share/mysql/charsets/README from install of MySQL-server-5.1.73-1.glibc23.x86_64 c
- 使用 pair 做 unordered_map 的键值
- Door level modeling - after class exercises
- Create Ca and issue certificate through go language
猜你喜欢

Notes on problems - /usr/bin/perl is needed by mysql-server-5.1.73-1 glibc23.x86_ sixty-four

写给当前及未来博士研究生一些建议整理分享

Development trend and future direction of neural network Internet of things

kubernetes资源对象介绍及常用命令(三)

from pip._ internal. cli. main import main ModuleNotFoundError: No module named ‘pip‘

边缘计算概述

有没有一段代码,让你为人类的智慧所折服

Huisheng Huiying 2022 intelligent, fast and simple video editing software

Matplotlib常用图表

Redis AOF log
随机推荐
Oracle中已定义者身份执行函数AUTHID DEFINER与Postgresql行为的异同
ADO.NET之SqlDataAdpter对象
Daily three questions 6.29
在长城证券上买基金安全吗?
写给当前及未来博士研究生一些建议整理分享
2021 robocom world robot developer competition - preliminary competition of undergraduate group
每日三题 6.28
Windows 7 安装MYSQL 错误:1067
神经网络物联网的未来趋势与发展
PostgreSQL source code (58) tuple splicing heap_ form_ Tuple analysis
【C#】依赖注入及Autofac
Postgresql源码(57)HOT更新为什么性能差距那么大?
Glass mosaic
cookie、session、tooken
Concepts of dictionary, hash table and array
Applet form verification encapsulation
What category does the Internet of things application technology major belong to
Yunxin small class | common cognitive misunderstandings in IM and audio and video
Why is PHP called hypertext preprocessor
Current situation and future development trend of Internet of things

