当前位置:网站首页>深度学习Course2第一周Practical aspects of Deep Learning习题整理
深度学习Course2第一周Practical aspects of Deep Learning习题整理
2022-08-01 21:59:00 【l8947943】
Practical aspects of Deep Learning
- If you have 10,000,000 examples, how would you split the train/dev/test set?
- 33% train. 33% dev. 33% test
- 60% train. 20% dev. 20% test
- 98% train. 1% dev. 1% test
- When designing a neural network to detect if a house cat is present in the picture, 500,000 pictures of cats were taken by their owners. These are used to make the training, dev and test sets. It is decided that to increase the size of the test set, 10,000 new images of cats taken from security cameras are going to be used in the test set. Which of the following is true?
- This will increase the bias of the model so the new images shouldn’t be used.
- This will be harmful to the project since now dev and test sets have different distributions.
- This will reduce the bias of the model and help improve it.
- If your Neural Network model seems to have high variance, what of the following would be promising things to try?
- Make the Neural Network deeper
- Get more training data
- Add regularization
- Get more test data
- Increase the number of units in each hidden layer
- You are working on an automated check-out kiosk for a supermarket, and are building a classifier for apples, bananas and oranges. Suppose your classifier obtains a training set error of 0.5%, and a dev set error of 7%. Which of the following are promising things to try to improve your classifier? (Check all that apply.)
- Increase the regularization parameter lambda
- Decrease the regularization parameter lambda
- Get more training data
- Use a bigger neural network
- In every case it is a good practice to use dropout when training a deep neural network because it can help to prevent overfitting. True/False?
- True
- False
- The regularization hyperparameter must be set to zero during testing to avoid getting random results. True/False?
- True
- False
- With the inverted dropout technique, at test time:
- You apply dropout (randomly eliminating units) but keep the 1/keep_prob factor in the calculations used in training.
- You do not apply dropout (do not randomly eliminate units), but keep the 1/keep_prob factor in the calculations used in training.
- You apply dropout (randomly eliminating units) and do not keep the 1/keep_prob factor in the calculations used in training
- You do not apply dropout (do not randomly eliminate units) and do not keep the 1/keep_prob factor in the calculations used in training
- Increasing the parameter keep_prob from (say) 0.5 to 0.6 will likely cause the following: (Check the two that apply)
- Increasing the regularization effect
- Reducing the regularization effect
- Causing the neural network to end up with a higher training set error
- Causing the neural network to end up with a lower training set error
- Which of the following actions increase the regularization of a model? (Check all that apply)
- Decrease the value of the hyperparameter lambda.
- Decrease the value of keep_prob in dropout.
Correct. When decreasing the keep_prob value, the probability that a node gets discarded during training is higher, thus reducing the regularization effect. - Increase the value of the hyperparameter lambda.
Correct. When increasing the hyperparameter lambda, we increase the effect of the L_2 penalization. - Increase the value of keep_prob in dropout.
- Use Xavier initialization.
- Which of the following is the correct expression to normalize the input x ? \mathbf{x}? x?
- x = x − μ σ x = \frac{x-\mu }{\sigma } x=σx−μ
边栏推荐
猜你喜欢
用户体验 | 如何度量用户体验?
教你VSCode如何快速对齐代码、格式化代码
【开源】Sentinel高性能高可用集群限流解决方案
365 days challenge LeetCode1000 questions - Day 046 Generate a string with odd number of each character + add two numbers + valid parentheses
Flutter基础学习(一)Dart语言入门
高等代数_证明_矩阵的行列式为特征值之积, 矩阵的迹为特征值之和
迁移学习——Discriminative Transfer Subspace Learning via Low-Rank and Sparse Representation
Based on php Xiangxi tourism website management system acquisition (php graduation design)
The difference between groupByKey and reduceBykey
如何防范 DAO 中的治理攻击?
随机推荐
【牛客刷题-SQL大厂面试真题】NO4.出行场景(某滴打车)
Analysis of the development trend of game metaverse
统计单词数
Lecture 3: Several common table field data types in MySQL database
小程序中的多表联合查询
【开源】Sentinel高性能高可用集群限流解决方案
Spark cluster construction
如何理解 new (...args: any[]) => any
Anacoda的用途
Unity Shader 常规光照模型代码整理
教你VSCode如何快速对齐代码、格式化代码
[@synthesize in Objective-C]
HCIP---Architecture of Enterprise Network
Small program -- subcontracting
MySQL related knowledge
自建 Prometheus 采集腾讯云容器服务监控数据最佳实践
dvwa 通关记录1 - 暴力破解 Brute Force
2022-08-01 第八组 曹雨 泛型 枚举
【C语言实现】两种计算平均成绩题型,博主精心整理,值得一读
【C语言实现】求两个整数的较大值