当前位置:网站首页>Deep learning | three concepts: epoch, batch, iteration
Deep learning | three concepts: epoch, batch, iteration
2022-07-01 23:35:00 【Rihe Princess】

- Epoch( period ):
When a complete data set passes through the neural network once and returns once , This process is called a time >epoch.( in other words , All training samples In the neural network all the A forward propagation and A back propagation )
A little more general , One Epoch Namely Train all training samples once The process of .However , When one Epoch The sample of ( That's all the training samples ) The number may be too large ( For computers ), You need to break it into small pieces , That is, share Multiple Batch To train .**
Batch( batch / A batch of samples ):
Divide the whole training sample into several Batch.Batch_Size( Batch size ):
The size of each batch of samples .Iteration( One iteration ):
Train one Batch Just once Iteration( This concept is similar to iterators in programming languages ).
- Why use more than one epoch?
It is not enough to pass a complete data set in a neural network at one time , And we need to pass the complete data set many times in the same neural network . But remember , We use a limited set of data , And we use an iterative process called gradient descent to optimize the learning process . As shown in the figure below . So just update it once or use one epoch It's not enough. .
With epoch increase in numbers , The number of updates of weights in neural networks is also increasing , The curve changes from under fitting to over fitting .
that , The problem is coming. , How many? epoch That's the right thing to do ?
Unfortunately , There is no right answer to this question . For different data sets , The answer is different . But the diversity of data can affect the right epoch The number of . such as , Only the black cat dataset , And data sets of cats in all colors .
Conversion relation :

actually , gradient descent The fundamental difference between the above methods lies in the Batch_Size Different .

for instance :
边栏推荐
- Notes on problems - /usr/bin/perl is needed by mysql-server-5.1.73-1 glibc23.x86_ sixty-four
- What are the common types of points mall games?
- [swoole Series 1] what will you learn in the world of swoole?
- SQL optimization
- Three development trends of enterprise application from the perspective of the third technological revolution
- 第六章 数据流建模
- 物联网现状及未来发展趋势
- Airserver latest win64 bit personal screen projection software
- Redis AOF日志
- ARP message header format and request flow
猜你喜欢

Write some suggestions to current and future doctoral students to sort out and share

2021 robocom world robot developer competition - semi finals of higher vocational group

kubernetes资源对象介绍及常用命令(三)

Stm32f030f4 drives tim1637 nixie tube chip

Redis 主从同步

MySQL binlog cleanup

TS初次使用、ts类型

notBlank 和 notEmpty

Matplotlib常用设置

物联网技术应用属于什么专业分类
随机推荐
Redis RDB snapshot
上海炒股开户选择手机办理安全吗?
How to display real-time 2D map after rviz is opened
2021 robocom world robot developer competition - preliminary competition of undergraduate group
Redis数据类型和应用场景
Depth first search and breadth first search of graph traversal
What professional classification does the application of Internet of things technology belong to
2022年最佳智能家居开源系统:Alexa、Home Assistant、HomeKit生态系统介绍
Linux基础 —— CentOS7 离线安装 MySQL
Future trend and development of neural network Internet of things
openwrt 开启KV漫游
[LeetCode] 最后一个单词的长度【58】
Material Design组件 - 使用BottomSheet展现扩展内容(一)
共享电商的背后: 共创、共生、共享、共富,共赢的共富精神
Linux foundation - centos7 offline installation of MySQL
JPA handwritten SQL, received with user-defined entity classes
深度学习 | 三个概念:Epoch, Batch, Iteration
股票开户哪个证券公司最好,有安全保障吗
每日三题 6.28
Notblank and notempty

