当前位置:网站首页>Deep learning -- CNN clothing image classification, for example, discussed how to evaluate neural network model

Deep learning -- CNN clothing image classification, for example, discussed how to evaluate neural network model

2022-08-04 03:35:00 Peace of mind

  活动地址:CSDN21天学习挑战赛

目录

How to evaluate neural networks

        利用keras中的history对象

history.history字典的键值

根据key值对应的valueDraw and analyze the network model

利用model.evaluate评估

实例

彩色图片分类

数据导入

数据预处理 ——图片归一化

输出数据集大小,Adjust the dataset format

构建网络模型

编译网络模型

训练网络模型

Evaluate network models

Predict the test value

服装分类

数据导入

数据预处理——图像归一化

数据格式调整

构建网络模型

Compile and train the network model 

Evaluate network models

预测测试集


How to evaluate neural networks

利用keras中的history对象

keras中的类History对象有两个属性 history 和 epoch

history.epoch返回一个列表,列表的值是 0 ~ specified during trainingepoch{\color{Red} -1}

history.history返回一个字典,字典有4个键值key,分别是accuracy、val_accuracy、loss和val_loss,每个key的valueis a list,列表的长度=history.epoch的长度=Specified during model trainingepoch数

Here we mainly use history.history的4个键值,下面详细介绍

history.history字典的键值

First, let's introduce the training settrain_set和测试集test_set (There is actually a validation set that is not much usedvalidation set)

The training set is the dataset that participates in the training of the network model

The test set is the dataset on which the performance of the network model is evaluated,Generally do not participate in the training of the network model

The validation set is the dataset used to tune the network hyperparameters(用的不多)

Generally speaking, the ratio of training set and test set in the whole dataset is 7:3,The proportion when there is a validation set is 6:2:2

accuracy和lossBoth describe the training set

  • accuracyeach round of training,训练集的准确度
  • losseach round of training,The degree of loss in the training set

val_accuracy和val_lossBoth describe the validation set

But the validation set is generally not very useful,Usually, the test set is used instead of the validation set,So there is a training modelfit函数内的validation_dataThe parameter passed is the test set,Therefore, these two objects can also represent the test set

  • val_accuracyeach round of training,验证集的准确度
  • val_losseach round of training,The degree of loss on the validation set

根据key值对应的valueDraw and analyze the network model

这4The value corresponding to each key happens to be obtained each trainingloss、accuracy、val_loss、val_accuracy,如下图

obtained in each round of trainingloss、accuracy、val_loss、val_accuracy是通过The calculation method specified when compiling the network model计算的:

 可以利用 matplotlib.pyplot Library draws this4line changes,具体代码如下:

plt.plot(history.history['accuracy'], label='accuracy')  # 训练集准确度
plt.plot(history.history['val_accuracy'], label='val_accuracy ')  # 验证集准确度
plt.plot(history.history['loss'], label='loss')  # The degree of training set loss
plt.plot(history.history['val_loss'], label='val_loss')  # Validation set loss level
plt.xlabel('Epoch')  # 训练轮数
plt.ylabel('value')  # 值
plt.ylim([0, 1])  #  yThe axes represent ranges
plt.legend(loc='lower left')  # 图例位置
plt.show()

下面是示例图:

 Analysis of the overall change trend of the polyline

loss 

val_loss

网络情况
逐渐下降逐渐下降

 网络模型训练正常,这是最好的情况.

逐渐下降趋于不变/逐渐上升

网络过拟合.

Overfitting means that the network model can better represent the data of the training set,But the generalization ability is poor,for other datasets of the same species(比如测试集)效果很差

比如下图,The model trained is the red line,The error sum with the actual data in the training set is very small,

The blue line is another fitted curve,相比误差较大

来自知乎  过拟合(overfitting)with solutions

 但是,for some non-training data,The following picture may exist

 Overfitting is mostly Fewer datasets or more complex models引起的

 解决方法有:

  •  Add dataset data:

(1)Interpolation adds data

(2)The data is created by methods such as manual shooting

(3)Rotate an existing dataset、Processing such as cutting increases the data set

  • Reduce model complexity:Try to be able to determine the complexity with physical mathematical modeling       

1)Reduce the number of feature extraction layers in the neural network,例如CNNNeural networks reduce the number of convolutional layers

(2)Reduce the number of training epochs for the model,Stop training early at a suitable location

(3)正则化:惩罚力度

 weights can be applied、输出、on the bias parameter,Here are the weights imposed on them

  通常采用的方法有L1,L2两种

  L1:E_{in}+\lambda \sum_{j}^{}\left | w_j\right |     L2: E_in + \lambda \sum_{j}^{} w_j^{2}

 代码实现:

# 可以在Dense、Conv2D函数中 加kernel_regularizerRegularizer parameter
layers.Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28, 1),kernel_regularizer=0.01)
layers.Dense(64, activation='relu',kernel_regularizer=0.01)

(4)Drop-out:在训练阶段,A portion of neurons is randomly selected to be unused during each training session,减少复杂度.

        实现:在Flattern层后加Drop-outLayer setting inactivation ratio

layers.Dropout(rate=.数字) # 0~1 1会报错
趋于不变逐渐下降

The dataset has serious problems,It is recommended to reselect the dataset.一般不会出现这种情况.

趋于不变趋于不变

根据数据集大小有以下两种情况:

If the dataset is large,Represents a bottleneck in the learning process,需要减小学习率.Next, consider modification batchsize 大小,在input_shape中修改

data_format=channels_last默认传(batchsize一般不设置,height,width,channles).

If the dataset is very small,训练稳定,不做修改.

逐渐上升逐渐上升

May be the network structure design、训练参数、datasets, etc.This case is the worst case during training,Have to troubleshoot one by one.

利用model.evaluate评估

The function accepts test set data,output the test setloss和accuray值(accuracy只有compile时指定metrics才会有)

实例

彩色图片分类

数据导入

import tensorflow as tf
from keras import datasets, models, layers  # keras
import matplotlib.pyplot as plt
#导入数据集
(train_images, train_labels), (test_images, test_labels) =tf.keras.datasets.cifar10.load_data()  # datasetsThe color image dataset is integrated internally,

数据预处理 ——图片归一化

# 归一化
# 将像素的值标准化至0到1的区间内,rgb像素值 0~255 0为黑 1为白
train_images, test_images = train_images / 255.0, test_images / 255.0

输出数据集大小,Adjust the dataset format

print(train_images.size)  # 153600000
print(test_images.size)  # 30720000
# 换算
# Color description3通道 32*32的图片
train_images = train_images.reshape((-1, 32, 32, 3))
test_images = test_images.reshape((-1, 32, 32, 3))

构建网络模型

经过多次测试,From two-layer convolution pooling to three-layer convolution pooling to two-layer convolution pooling+Dropoutto three layers of convolutional poolingDropout,Evaluate the model multiple times,The best case is convolutional pooling in three layers+Dropout(0.3)

代码如下:

# 构建CNN网络模型
model = models.Sequential([  # 采用Sequential 顺序模型
    layers.Conv2D(32, (3, 3), activation='relu', input_shape=(32, 32, 3)),
    # 卷积层1,卷积核个数32,卷积核3*3*3 relu激活去除负值保留正值,输入是32*32*3
    layers.MaxPooling2D((2, 2)),  # 池化层1,2*2采样

    layers.Conv2D(64, (3, 3), activation='relu'),
    # 卷积层2,卷积核64个,卷积核3*3*3,relu激活去除负值保留正值
    layers.MaxPooling2D((2, 2)),  # 池化层2,2*2采样

    layers.Conv2D(64, (3, 3), activation='relu'),
    # 卷积层2,卷积核64个,卷积核3*3*3,relu激活去除负值保留正值
    layers.MaxPooling2D((2, 2)),  # 池化层2,2*2采样

    layers.Flatten(),  # Flatten层,连接卷积层与全连接层
    layers.Dropout(0.3),# 设置Dropout层
    layers.Dense(64, activation='relu'),  # 全连接层,64张特征图,特征进一步提取
    layers.Dense(10)  # 输出层,输出预期结果
])
# 打印网络结构
model.summary()

编译网络模型

# 编译
model.compile(optimizer='adam',  # 优化器
              loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),
              # 设置损失函数from_logits: 为True时,会将y_pred转化为概率
              metrics=['accuracy'])

训练网络模型

# 训练
# epochs为训练轮数
history = model.fit(train_images, train_labels, epochs=10,
                    validation_data=(test_images, test_labels))

Evaluate network models

绘制loss、val_loss、accuracy、val_accuracy折线图

# 评价
plt.plot(history.history['accuracy'], label='accuracy')  # 精确度
plt.plot(history.history['val_accuracy'], label='val_accuracy ')
plt.plot(history.history['loss'], label='loss')
plt.plot(history.history['val_loss'], label='val_loss')
plt.xlabel('Epoch')  # 循环次数
plt.ylabel('value')
plt.ylim([0, 1.5])
plt.legend(loc='lower right')
plt.show()

test_loss, test_acc = model.evaluate(test_images, test_labels, verbose=1)  # loss函数的值和 metrics值
# 输入数据和标签,Output loss value and selected metric value(如精确度accuracy)
print(test_loss, test_acc)
# 313/313 [==============================] - 1s 2ms/step - loss: 0.8412 - accuracy: 0.7079
# 0.8412009477615356 0.7078999876976013

val_loss与lossgradually decreased,belongs to the best case

Predict the test value

# 绘制测试集图片
class_names = ['airplane', 'automobile', 'bird', 'cat', 'deer', 'dog', 'frog', 'horse', 'ship', 'truck']
plt.figure(figsize=(20, 10))  # 这里只看20张,实际上并不需要可视化图片这一步骤
for i in range(20):
    plt.subplot(5, 10, i + 1)
    plt.xticks([])
    plt.yticks([])
    plt.grid(False)
    plt.imshow(test_images[i], cmap=plt.cm.binary)
    plt.xlabel(class_names[test_labels[i][0]])
plt.show()

# 预测
pre = model.predict(test_images)  # 对所有测试图片进行预测
for x in range(5):
    print(pre[x])
for x in range(5):
    print(class_names[np.array(pre[x]).argmax()])

 运行输出

 

The conclusion is consistent 

服装分类

数据导入

# 导入数据集
(train_images, train_labels), (test_images, test_labels) = tf.keras.datasets.fashion_mnist.load_data()  # datasets内部集成了MNIST数据集

数据预处理——图像归一化

# 归一化
# 将像素的值标准化至0到1的区间内,rgb像素值 0~255 0为黑 1为白
train_images, test_images = train_images / 255.0, test_images / 255.0

数据格式调整

# The format of the picture here is28*28,通道数为1
train_images = train_images.reshape((-1, 28, 28, 1))
test_images = test_images.reshape((-1, 28, 28, 1))

构建网络模型

 Edge padding is used here0,3层卷积池化

# 构建CNN网络模型
model = models.Sequential([  # 采用Sequential 顺序模型
    layers.Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28, 1),padding='same'),
    # 卷积层1,卷积核个数32,卷积核3*3*1 relu激活去除负值保留正值,输入是28*28*1
    layers.MaxPooling2D((2, 2)),  # 池化层1,2*2采样

    layers.Conv2D(64, (3, 3), activation='relu'),
    # 卷积层2,卷积核64个,卷积核3*3,relu激活去除负值保留正值
    layers.MaxPooling2D((2, 2)),  # 池化层2,2*2采样

    layers.Conv2D(64, (3, 3), activation='relu'),
    # 卷积层3,卷积核64个,卷积核3*3,relu激活去除负值保留正值
    layers.MaxPooling2D((2, 2)),  # 池化层2,2*2采样

    layers.Flatten(),  # Flatten层,连接卷积层与全连接层
    layers.Dropout(0.3),
    layers.Dense(64, activation='relu'),  # 全连接层,64张特征图,特征进一步提取
    layers.Dense(10)  # 输出层,输出预期结果
])
# 打印网络结构
model.summary()

Compile and train the network model 

# 编译
model.compile(optimizer='adam',  # 优化器
              loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),
              # 设置损失函数from_logits: 为True时,会将y_pred转化为概率
              metrics=['accuracy'])

# 训练
# epochs为训练轮数
history = model.fit(train_images, train_labels, epochs=10,
                    validation_data=(test_images, test_labels))

Evaluate network models

# 评价
plt.plot(history.history['accuracy'], label='accuracy')  # 训练集准确度
plt.plot(history.history['val_accuracy'], label='val_accuracy ')  # 验证集准确度
plt.plot(history.history['loss'], label='loss')  # The degree of training set loss
plt.plot(history.history['val_loss'], label='val_loss')  # Validation set loss level
plt.xlabel('Epoch')  # 训练轮数
plt.ylabel('value')  # 值
plt.ylim([0, 1])
plt.legend(loc='lower left')  # 图例位置
plt.show()

test_loss, test_acc = model.evaluate(test_images, test_labels, verbose=1)  # loss函数的值和 metrics值
# 输入数据和标签,Output loss value and selected metric value(如精确度accuracy)
print(test_loss, test_acc)

 观察得到,loss和val_lossgradually decreased,是最好的情况

预测测试集

# 绘制测试集图片
class_names = ['T-shirt/top', 'Trouser', 'Pullover', 'Dress', 'Coat',
               'Sandal', 'Shirt', 'Sneaker', 'Bag', 'Ankle boot']
plt.figure(figsize=(20, 10))  # 这里只看20张,实际上并不需要可视化图片这一步骤
for i in range(20):
    plt.subplot(5, 10, i + 1)
    plt.xticks([])
    plt.yticks([])
    plt.grid(False)
    plt.imshow(test_images[i], cmap=plt.cm.binary)
    plt.xlabel(class_names[test_labels[i]])
plt.show()

# 预测
pre = model.predict(test_images)  # 对所有测试图片进行预测
for x in range(5):
    print(pre[x])
for x in range(5):
    print(class_names[np.array(pre[x]).argmax()])

 

原网站

版权声明
本文为[Peace of mind]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/216/202208040309089519.html