Keras 最新《面向小數據集構建圖像分類模型》

阿新 • • 發佈：2017-12-11

網絡 ict regular n) val sent rom link prepare

本文地址：http://blog.keras.io/building-powerful-image-classification-models-using-very-little-data.html

本文作者：Francois Chollet

按照官方的文章實現過程有一些坑，徹底理解代碼細節實現，理解keras的api具體使用方法
也有很多人翻譯這篇文章，但是有些沒有具體實現細節
另外keres開發者自己有本書的jupyter:Companion Jupyter notebooks for the book "Deep Learning with Python"
另外我自己實驗三收斂的準確率並沒有0.94+，可以參考前面這本書上的實現

文章一共有三個實驗：
1. 第一個實驗使用自定義的神經網絡對數據集進行訓練，三層卷積加兩層全連接，訓練並驗證網絡的準確率；
2. 第二個實驗使用VGG16網絡對數據進行訓練，為了適應自定義的數據集，將VGG16網絡的全連接層去掉，作者稱之為 “Feature extraction”, 再在上面添加自己實現的全連接層，然後訓練並驗證網絡準確性；
3. 第三個實驗稱為 “fine-tune” ，利用第二個實驗的實驗模型和weight，重新訓練VGG16的最後一個卷積層和自定義的全連接層，然後驗證網絡準確性；

實驗二的代碼：

‘‘‘This script goes along the blog post
"Building powerful image classification models using very little data"
from blog.keras.io.
It uses data that can be downloaded at:
https://www.kaggle.com/c/dogs-vs-cats/data
In our setup, we:
- created a data/ folder
- created train/ and validation/ subfolders inside data/
- created cats/ and dogs/ subfolders inside train/ and validation/
- put the cat pictures index 0 
-999 in data/train/cats
- put the cat pictures index 1000-1400 in data/validation/cats
- put the dogs pictures index 12500-13499 in data/train/dogs
- put the dog pictures index 13500-13900 in data/validation/dogs
So that we have 1000 training examples for each class, and 400 validation examples for each class.
In summary, this is our directory structure:
```
data/
    train/
        dogs/
            dog001.jpg
            dog002.jpg
            ...
        cats/
            cat001.jpg
            cat002.jpg
            ...
    validation/
        dogs/
            dog001.jpg
            dog002.jpg
            ...
        cats/
            cat001.jpg
            cat002.jpg
            ...
```
‘‘‘
import numpy as np
from keras.preprocessing.image import ImageDataGenerator
from keras.models import Sequential
from keras.layers import Dropout, Flatten, Dense
from keras import applications

# dimensions of our images.
img_width, img_height = 150, 150

top_model_weights_path = ‘bottleneck_fc_model.h5‘


data_root = ‘M:/dataset/dog_cat/‘
train_data_dir =data_root+ ‘data/train‘
validation_data_dir = data_root+‘data/validation‘
nb_train_samples = 2000
nb_validation_samples = 800
epochs = 50
batch_size = 16


def save_bottlebeck_features():
    datagen = ImageDataGenerator(rescale=1. / 255)

    # build the VGG16 network
    model = applications.VGG16(include_top=False, weights=‘imagenet‘)

    generator = datagen.flow_from_directory(
        train_data_dir,
        target_size=(img_width, img_height),
        batch_size=batch_size,
        class_mode=None,
        shuffle=False)
    bottleneck_features_train = model.predict_generator(
        generator, nb_train_samples // batch_size) #####2000//batch_size！！！！！！！！！！
    np.save(‘bottleneck_features_train.npy‘,
            bottleneck_features_train)

    generator = datagen.flow_from_directory(
        validation_data_dir,
        target_size=(img_width, img_height),
        batch_size=batch_size,
        class_mode=None,
        shuffle=False)
    bottleneck_features_validation = model.predict_generator(
        generator, nb_validation_samples // batch_size)
    np.save(‘bottleneck_features_validation.npy‘,
            bottleneck_features_validation)


def train_top_model():
    train_data = np.load(‘bottleneck_features_train.npy‘)
    train_labels = np.array([0] * int(nb_train_samples / 2) + [1] * int(nb_train_samples / 2))

    validation_data = np.load(‘bottleneck_features_validation.npy‘)
    validation_labels = np.array([0] * int(nb_validation_samples / 2) + [1] * int(nb_validation_samples / 2))

    model = Sequential()
    model.add(Flatten(input_shape=train_data.shape[1:]))
    model.add(Dense(256, activation=‘relu‘))
    model.add(Dropout(0.5))
    model.add(Dense(1, activation=‘sigmoid‘))

    model.compile(optimizer=‘rmsprop‘,
                  loss=‘binary_crossentropy‘, metrics=[‘accuracy‘])

    model.fit(train_data, train_labels,
              epochs=epochs,
              batch_size=batch_size,
              validation_data=(validation_data, validation_labels))
    model.save_weights(top_model_weights_path)


#save_bottlebeck_features()
train_top_model()

實驗三代碼，自己添加了一些api使用方法，也是以後可以參考的：

‘‘‘This script goes along the blog post
"Building powerful image classification models using very little data"
from blog.keras.io.
It uses data that can be downloaded at:
https://www.kaggle.com/c/dogs-vs-cats/data
In our setup, we:
- created a data/ folder
- created train/ and validation/ subfolders inside data/
- created cats/ and dogs/ subfolders inside train/ and validation/
- put the cat pictures index 0-999 in data/train/cats
- put the cat pictures index 1000-1400 in data/validation/cats
- put the dogs pictures index 12500-13499 in data/train/dogs
- put the dog pictures index 13500-13900 in data/validation/dogs
So that we have 1000 training examples for each class, and 400 validation examples for each class.
In summary, this is our directory structure:
```
data/
    train/
        dogs/
            dog001.jpg
            dog002.jpg
            ...
        cats/
            cat001.jpg
            cat002.jpg
            ...
    validation/
        dogs/
            dog001.jpg
            dog002.jpg
            ...
        cats/
            cat001.jpg
            cat002.jpg
            ...
```
‘‘‘

# thanks sove bug @http://blog.csdn.net/aggresss/article/details/78588135

from keras import applications
from keras.preprocessing.image import ImageDataGenerator
from keras import optimizers
from keras.models import Sequential
from keras.layers import Dropout, Flatten, Dense
from keras.models import Model
from keras.regularizers import l2

# path to the model weights files.
weights_path = ‘../keras/examples/vgg16_weights.h5‘
top_model_weights_path = ‘bottleneck_fc_model.h5‘
# dimensions of our images.
img_width, img_height = 150, 150

data_root = ‘M:/dataset/dog_cat/‘
train_data_dir =data_root+ ‘data/train‘
validation_data_dir = data_root+‘data/validation‘

nb_train_samples = 2000
nb_validation_samples = 800
epochs = 50
batch_size = 16

# build the VGG16 network
base_model = applications.VGG16(weights=‘imagenet‘, include_top=False, input_shape=(150,150,3)) # train 指定訓練大小
print(‘Model loaded.‘)

# build a classifier model to put on top of the convolutional model
top_model = Sequential()
top_model.add(Flatten(input_shape=base_model.output_shape[1:]))  # base_model.output_shape[1:])
top_model.add(Dense(256, activation=‘relu‘,kernel_regularizer=l2(0.001),))
top_model.add(Dropout(0.8))
top_model.add(Dense(1, activation=‘sigmoid‘))

# note that it is necessary to start with a fully-trained
# classifier, including the top classifier,
# in order to successfully do fine-tuning
top_model.load_weights(top_model_weights_path)

# add the model on top of the convolutional base
# model.add(top_model) # bug

model = Model(inputs=base_model.input, outputs=top_model(base_model.output))


# set the first 25 layers (up to the last conv block)
# to non-trainable (weights will not be updated)
for layer in model.layers[:15]:  # :25 bug
    layer.trainable = False

# compile the model with a SGD/momentum optimizer
# and a very slow learning rate.
model.compile(loss=‘binary_crossentropy‘,
              optimizer=optimizers.SGD(lr=1e-4, momentum=0.9),
              metrics=[‘accuracy‘])

# prepare data augmentation configuration
train_datagen = ImageDataGenerator(
    rescale=1. / 255,
    shear_range=0.2,
    zoom_range=0.2,
    horizontal_flip=True)

test_datagen = ImageDataGenerator(rescale=1. / 255)

train_generator = train_datagen.flow_from_directory(
    train_data_dir,
    target_size=(img_height, img_width),
    batch_size=batch_size,
    class_mode=‘binary‘)

validation_generator = test_datagen.flow_from_directory(
    validation_data_dir,
    target_size=(img_height, img_width),
    batch_size=batch_size,
    class_mode=‘binary‘)

model.summary() # prints a summary representation of your model.
# let‘s visualize layer names and layer indices to see how many layers
# we should freeze:
for i, layer in enumerate(base_model.layers):
    print(i, layer.name)


from keras.utils import plot_model
plot_model(model, to_file=‘model.png‘)

from keras.callbacks import History
from keras.callbacks import ModelCheckpoint
import keras
history = History()
model_checkpoint = ModelCheckpoint(‘temp_model.hdf5‘, monitor=‘loss‘, save_best_only=True)
tb_cb = keras.callbacks.TensorBoard(log_dir=‘log‘, write_images=1, histogram_freq=0)
# 設置log的存儲位置，將網絡權值以圖片格式保持在tensorboard中顯示，設置每一個周期計算一次網絡的
# 權值，每層輸出值的分布直方圖
callbacks = [
        history,
        model_checkpoint,
        tb_cb
    ]
# model.fit()


# fine-tune the model
history=model.fit_generator(
    train_generator,
    steps_per_epoch=nb_train_samples // batch_size,
    epochs=epochs,
    callbacks=callbacks,
    validation_data=validation_generator,
    validation_steps=nb_validation_samples // batch_size,
    verbose = 2)

model.save(‘fine_tune_model.h5‘)
model.save_weights(‘fine_tune_model_weight‘)
print(history.history)


from matplotlib import pyplot as plt
history=history
plt.plot()
plt.plot(history.history[‘val_acc‘])
plt.title(‘model accuracy‘)
plt.ylabel(‘accuracy‘)
plt.xlabel(‘epoch‘)
plt.legend([‘train‘, ‘test‘], loc=‘upper left‘)
plt.show()
# summarize history for loss
plt.plot(history.history[‘loss‘])
plt.plot(history.history[‘val_loss‘])
plt.title(‘model loss‘)
plt.ylabel(‘loss‘)
plt.xlabel(‘epoch‘)
plt.legend([‘train‘, ‘test‘], loc=‘upper left‘)
plt.show()

import  numpy as np
accy=history.history[‘acc‘]
np_accy=np.array(accy)
np.savetxt(‘save_acc.txt‘,np_accy)

result

Model loaded.
Found 2000 images belonging to 2 classes.
Found 800 images belonging to 2 classes.
_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
input_1 (InputLayer)         (None, 150, 150, 3)       0         
_________________________________________________________________
block1_conv1 (Conv2D)        (None, 150, 150, 64)      1792      
_________________________________________________________________
block1_conv2 (Conv2D)        (None, 150, 150, 64)      36928     
_________________________________________________________________
block1_pool (MaxPooling2D)   (None, 75, 75, 64)        0         
_________________________________________________________________
block2_conv1 (Conv2D)        (None, 75, 75, 128)       73856     
_________________________________________________________________
block2_conv2 (Conv2D)        (None, 75, 75, 128)       147584    
_________________________________________________________________
block2_pool (MaxPooling2D)   (None, 37, 37, 128)       0         
_________________________________________________________________
block3_conv1 (Conv2D)        (None, 37, 37, 256)       295168    
_________________________________________________________________
block3_conv2 (Conv2D)        (None, 37, 37, 256)       590080    
_________________________________________________________________
block3_conv3 (Conv2D)        (None, 37, 37, 256)       590080    
_________________________________________________________________
block3_pool (MaxPooling2D)   (None, 18, 18, 256)       0         
_________________________________________________________________
block4_conv1 (Conv2D)        (None, 18, 18, 512)       1180160   
_________________________________________________________________
block4_conv2 (Conv2D)        (None, 18, 18, 512)       2359808   
_________________________________________________________________
block4_conv3 (Conv2D)        (None, 18, 18, 512)       2359808   
_________________________________________________________________
block4_pool (MaxPooling2D)   (None, 9, 9, 512)         0         
_________________________________________________________________
block5_conv1 (Conv2D)        (None, 9, 9, 512)         2359808   
_________________________________________________________________
block5_conv2 (Conv2D)        (None, 9, 9, 512)         2359808   
_________________________________________________________________
block5_conv3 (Conv2D)        (None, 9, 9, 512)         2359808   
_________________________________________________________________
block5_pool (MaxPooling2D)   (None, 4, 4, 512)         0         
_________________________________________________________________
sequential_1 (Sequential)    (None, 1)                 2097665   
=================================================================
Total params: 16,812,353
Trainable params: 9,177,089
Non-trainable params: 7,635,264
_________________________________________________________________
0 input_1
1 block1_conv1
2 block1_conv2
3 block1_pool
4 block2_conv1
5 block2_conv2
6 block2_pool
7 block3_conv1
8 block3_conv2
9 block3_conv3
10 block3_pool
11 block4_conv1
12 block4_conv2
13 block4_conv3
14 block4_pool
15 block5_conv1
16 block5_conv2
17 block5_conv3
18 block5_pool
Backend TkAgg is interactive backend. Turning interactive mode on.

reference: 第八期使用 Keras 訓練神經網絡《顯卡就是開發板》

Keras 最新《面向小數據集構建圖像分類模型》

網絡 ict regular n) val sent rom link prepare 本文地址：http://blog.keras.io/building-powerful-image-classification-models-using-very-little-dat

[MNIST數據集]輸入圖像的預處理

轉換 for mage 二值化 from ply rbo tput warn 因為MNIST數據是28*28的黑底白字圖像，而且輸入時要將其拉直，也就是可以看成1*784的二維張量（張量的值在0~1之間），所以我們要對圖片進行預處理操作，是圖片能被網絡識別。以下是代碼部分

學習筆記TF016:CNN實現、數據集、TFRecord、加載圖像、模型、訓練、調試

quest oba lose 神經元 byte 足夠 jpg eight 值轉換 AlexNet(Alex Krizhevsky,ILSVRC2012冠軍)適合做圖像分類。層自左向右、自上向下讀取，關聯層分為一組，高度、寬度減小，深度增加。深度增加減少網絡計算量。訓練模

我收集的一些目標檢測、跟蹤、識別標準測試視頻集和圖像數據庫

ima detail track 分離 urb images data mic hang 一個網友收集的運動目標檢測，陰影檢測的標準測試視頻 http://blog.csdn.net/sunbaigui/article/details/6363390 很權威的c

Keras載入mnist數據集出錯問題解決方案

內容 ret href 斜杠 cal call abs anaconda 目錄找到本地keras目錄下的mnist.py文件通常在這個目錄下。 ..\Anaconda3\Lib\site-packages\keras\datasets 下載mnist.npz文件到本地

opencv中的SVM圖像分類（二）

proc 文本 c_str lec 源碼 open right tle 特征描述 opencv中的SVM圖像分類（二）標簽： svm圖像 2015-07-30 08:45 8296人閱讀評論(35) 收藏舉報分類：【opencv應用】（5）版

tensorflow 1.0 學習：用別人訓練好的模型來進行圖像分類

ima ppi gin 什麽 dir targe spl flow blog 谷歌在大型圖像數據庫ImageNet上訓練好了一個Inception-v3模型，這個模型我們可以直接用來進來圖像分類。下載地址：https://storage.googleapis.com/d

[日常填坑]圖像分類實戰-服務器環境配置

linu nload vision 環境配置 ive blog his caffe 通過服務器Ubuntu、pytorch框架、網絡模型SE-Resnet50，優化算法Adam pytorch（python優先的深度學習框架，是一個和tensorflow,Caffe,MX

cs231n學習筆記（二）圖像分類

根據 stanford nbsp 學習筆記 cif 線性分類這一差異測距圖像分類可說是計算機視覺中的基礎任務同時也是核心任務，做好分類可為檢測，分割等高階任務打好基礎。本節課主要講了兩個內容，K近鄰和線性分類器，都是以貓的分類為例。一.　　K近鄰以貓的分

基於Windows 機器學習(Machine Learning)的圖像分類(Image classification)實現

BYD pack format ret bmp async 配置 rev 技術分享原文:基於Windows 機器學習(Machine Learning)的圖像分類(Image classification)實現今天看到一篇文章 Google’s Image

【火爐煉AI】機器學習051-視覺詞袋模型+極端隨機森林建立圖像分類器

函數自然語言處理 3.6 權重 www. 語言 tar 一行序列【火爐煉AI】機器學習051-視覺詞袋模型+極端隨機森林建立圖像分類器 (本文所使用的Python庫和版本號: Python 3.6, Numpy 1.14, scikit-learn 0.19, mat

構建短文字分類模型需要注意的幾點

一、深度學習模型　　1.CNN 　　2.LSTM 　　3.Attention 二、與傳統機器學習模型的比較　　1.SVM 　　2.LR 　　3.GBDT 　　4.XGBoost 　　5.RandomForest 　　6.LightGBM 三、文字特徵選擇　　1.一般短文字的長度在

構建７種分類模型，評分並畫出ROC曲線

構建７種分類模型，評分並畫出ROC曲線匯入的包 import pandas as pd from sklearn.model_selection import train_test_split from sklearn.linear_model import Logi

圖像分類任務不用冷啟動，PaddlePaddle一口氣發布十大預訓練模型

測試基準 ces 應用鏈接關註預訓練模型能夠 model PaddlePaddle在不斷增加官方支持的模型的同時，也在關註預訓練模型的豐富度。在過去的版本中，我們已經發布了目標檢測Faster-RCNN、MobileNet-SSD、PyramidBox和場景文字

細粒度圖像分類

inf 預測影響水平結合翻轉最終局部特征標簽細粒度屬性的圖像看起來非常相似，且在不同光線、角度和背景下拍攝，其識別精度也會受到影響。細粒度識別相比於一般的圖像分類不僅需要使用圖像的整體信息，同時應註意到子類別所獨有的局部特征。一般細粒度識別可以分為兩種，

比較 VGG, resnet和inception的圖像分類效果

eat dap pri 比較分類 pad 兩層 init 效果簡介 VGG, resnet和inception是3種典型的卷積神經網絡結構。 VGG采用了3*3的卷積核，逐步擴大通道數量 resnet中，每兩層卷積增加一個旁路 inception實現了卷積核的並聯，然

CS231N之一圖像分類

挑戰選擇數據維數不足 sta 技術分享機器質量由於這部分還是比較簡單基礎，故大致簡寫圖像分類根據各自在圖像信息中所反映的不同特征，把不同類別的目標區分開來的圖像處理方法。它利用計算機對圖像進行定量分析，把圖像或圖像中的每個像元或區域劃歸為若幹個類別中的

轉：圖像分類、物體檢測、物體分割、實例分割、語義分割

binary 一點 .cn ros https 復雜 enc 關系 sem 轉自：https://blog.csdn.net/Gerwels_JI/article/details/82990189 【深度學習之圖像理解】圖像分類、物體檢測、物體分割、實例分割、語義分割的區

圖像分類（一）GoogLenet Inception_V1：Going deeper with convolutions

地方此外 -s 數值計算 mbed 原本樸素思路並行論文地址在該論文中作者提出了一種被稱為Inception Network的深度卷積神經網絡，它由若幹個Inception modules堆疊而成。Inception的主要特點是它能提高網絡中計算資源的

基於稀疏表示學習的圖像分類

網絡公式 nbsp 數據嵌入 tps 線性技術分享函數 Deep Sparse Representation-based Classification 代碼：https://github.com/mahdiabavisani/DSRC 網絡結構網絡結構分

Keras 最新《面向小數據集構建圖像分類模型》

相關推薦