YOLOv3中K-Means聚類出新資料集的Anchor尺寸

阿新 • • 發佈：2020-08-02

參考部落格：

聚類kmeans演算法在yolov3中的應用 https://www.cnblogs.com/sdu20112013/p/10937717.html

這篇部落格寫得非常詳細，也貼出了github程式碼：https://github.com/AlexeyAB/darknet/blob/master/scripts/gen_anchors.py

整體程式碼如下：

  1 '''
  2 Created on Feb 20, 2017
  3 @author: jumabek
  4 '''
  5 from os import listdir
  6 from os.path import isfile, join
 
  7 import argparse
  8 # import cv2
  9 import numpy as np
 10 import sys
 11 import os
 12 import shutil
 13 import random
 14 import math
 15 width_in_cfg_file = 416.
 16 height_in_cfg_file = 416.
 17 
 18 
 19 def IOU(x, centroids):
 20     '''
 21     :param x: 當前gt的w和h
 22     :param centroids: 質心
 
 23     :return:當前gt與每個質心的相似度，np.array形式
 24     '''
 25     similarities = []
 26     k = len(centroids)
 27     for centroid in centroids:
 28         c_w, c_h = centroid
 29         w, h = x
 30         if c_w >= w and c_h >= h:
 31             similarity = w * h / (c_w * c_h)
 32         elif 
 c_w >= w and c_h <= h:
 33             similarity = w * c_h / (w * h + (c_w - w) * c_h)  # 交叉面積/總面積
 34         elif c_w <= w and c_h >= h:
 35             similarity = c_w * h / (w * h + c_w * (c_h - h))
 36         else:  # means both w,h are bigger than c_w and c_h respectively
 37             similarity = (c_w * c_h) / (w * h)
 38         similarities.append(similarity)  # will become (k,) shape
 39     return np.array(similarities)
 40 
 41 
 42 def avg_IOU(X, centroids):
 43     n, d = X.shape
 44     sum = 0.
 45     for i in range(X.shape[0]):
 46         # note IOU() will return array which contains IoU for each centroid and X[i] // slightly ineffective, but I am too lazy
 47         sum += max(IOU(X[i], centroids))
 48     return sum / n
 49 
 50 
 51 def write_anchors_to_file(centroids, X, anchor_file):
 52     f = open(anchor_file, 'w')
 53     anchors = centroids.copy()
 54     print(anchors.shape)
 55     for i in range(anchors.shape[0]):
 56         anchors[i][0] *= width_in_cfg_file  # / 32. YOLOv3不用除以32 # 歸一化後的寬高乘以預設的圖片寬高
 57         anchors[i][1] *= height_in_cfg_file  # / 32.
 58     widths = anchors[:, 0]
 59     sorted_indices = np.argsort(widths)
 60     print('Anchors = ', anchors[sorted_indices])
 61     for i in sorted_indices[:-1]:
 62         # 將前n-1個anchor寫入txt
 63         f.write('%0.2f,%0.2f, ' % (anchors[i, 0], anchors[i, 1]))
 64     # there should not be comma after last anchor, that's why
 65     # 最後一個anchor寫完以後需要換行，所以單獨填寫
 66     f.write('%0.2f,%0.2f\n' % (anchors[sorted_indices[-1:], 0], anchors[sorted_indices[-1:], 1]))
 67     f.write('%f\n' % (avg_IOU(X, centroids)))
 68     print()
 69 
 70 
 71 def kmeans(X, centroids, eps, anchor_file):
 72     '''
 73 
 74     :param X: annotation_dims,所有的標註資訊中的寬和高
 75     :param centroids: 隨機生成的質心
 76     :param eps:
 77     :param anchor_file: 儲存結果的檔案
 78     :return:
 79     '''
 80     N = X.shape[0]
 81     iterations = 0
 82     k, dim = centroids.shape
 83     prev_assignments = np.ones(N) * (-1)
 84     iter = 0
 85     old_D = np.zeros((N, k))
 86     while True:
 87         D = []
 88         iter += 1
 89         for i in range(N):
 90             # 計算gt框與質心之間的距離，相似度越大，說明當前gt越接近於質心，此距離就應該越小
 91             d = 1 - IOU(X[i], centroids)
 92             D.append(d)
 93         D = np.array(D)  # D.shape = (N,k)
 94         print("iter {}: dists = {}".format(iter, np.sum(np.abs(old_D - D))))
 95         # assign samples to centroids
 96         assignments = np.argmin(D, axis=1)  # 返回每一行的最小值的下標.即當前樣本應該歸為k個質心中的哪一個質心.
 97         if (assignments == prev_assignments).all():  # 質心已經不再變化
 98             print("Centroids = ", centroids)
 99             write_anchors_to_file(centroids, X, anchor_file)
100             return
101         # calculate new centroids，更新質心
102         centroid_sums = np.zeros((k, dim), np.float)
103         for i in range(N):
104             centroid_sums[assignments[i]] += X[i]
105         for j in range(k):
106             centroids[j] = centroid_sums[j] / (np.sum(assignments == j))
107         prev_assignments = assignments.copy()
108         old_D = D.copy()
109 
110 
111 def main(argv):
112     parser = argparse.ArgumentParser()
113     parser.add_argument('-filelist', default='F://BaiduNetdiskDownload//trainall_name.txt',
114                         help='path to filelist\n')
115     parser.add_argument('-output_dir', default='F://BaiduNetdiskDownload//generated_anchors//anchors//', type=str,
116                         help='Output anchor directory\n')
117     parser.add_argument('-num_clusters', default=6, type=int,
118                         help='number of clusters\n')
119     args = parser.parse_args()
120     if not os.path.exists(args.output_dir):
121         os.mkdir(args.output_dir)
122     f = open(args.filelist)
123     lines = [line.rstrip('\n') for line in f.readlines()]
124     annotation_dims = []
125     size = np.zeros((1, 1, 3))
126     for line in lines:
127         # 注意路徑問題，通過替換圖片路徑中的Images為labels來找到標籤資訊
128         line = line.replace('Images','labels')
129         # line = line.replace('img1','labels')
130         # line = line.replace('JPEGImages', 'labels')
131         line = line.replace('.jpg', '.txt')
132         line = line.replace('.png', '.txt')
133         print(line)
134 
135         f2 = open(line)
136         for line in f2.readlines():
137             line = line.rstrip('\n')
138             w, h = line.split(' ')[3:]  # 得到標註檔案的寬和高[0 0.83984 0.40700 0.17188 0.47218]
139             # print(w,h)
140             annotation_dims.append(tuple(map(float, (w, h))))
141     annotation_dims = np.array(annotation_dims)
142     eps = 0.005
143     if args.num_clusters == 0:
144         for num_clusters in range(1, 11):  # we make 1 through 10 clusters
145             anchor_file = join(args.output_dir, 'anchors%d.txt' % (num_clusters))
146             indices = [random.randrange(annotation_dims.shape[0]) for i in range(num_clusters)]
147             centroids = annotation_dims[indices]
148             kmeans(annotation_dims, centroids, eps, anchor_file)
149             print('centroids.shape', centroids.shape)
150     else:
151         anchor_file = join(args.output_dir, 'anchors%d.txt' % (args.num_clusters))  # 儲存結果的檔案
152         # 在所有labels數量範圍內隨機生成質心的索引數，生成num_clusters個
153         indices = [random.randrange(annotation_dims.shape[0]) for i in range(args.num_clusters)]
154         # 生成質心
155         centroids = annotation_dims[indices]
156         # 呼叫kmeans
157         kmeans(annotation_dims, centroids, eps, anchor_file)
158         print('centroids.shape', centroids.shape)
159 
160 
161 if __name__ == "__main__":
162     main(sys.argv)

使用生成YOLOv3 anchor時需要注意

anchors[i][0] *= width_in_cfg_file # / 32. YOLOv3不用除以32 # 歸一化後的寬高乘以預設的圖片寬高

最後生成的結果，6個anchors：

7.90,21.48, 18.72,61.61, 34.67,138.55, 65.49,251.30, 104.70,64.11, 144.33,434.60
0.582349

可以看出寬高比都為1：3左右，結合我使用的是行人檢測的資料集，這個比例還算正常。但第5組資料（104.70，64.11）不符合這個寬高比，可能需要手動調整一下

YOLOv3中K-Means聚類出新資料集的Anchor尺寸

參考部落格：聚類kmeans演算法在yolov3中的應用 https://www.cnblogs.com/sdu20112013/p/10937717.html

YOLOV5——使用 k-means 聚類 anchorbox 資料

訓練的標註資料格式如下： [ { \"name\": \"235_2_t20201127123021723_CAM2.jpg\", \"image_height\": 6000,

在Python中使用K-Means聚類和PCA主成分分析進行影象壓縮

在Python中使用K-Means聚類和PCA主成分分析進行影象壓縮各位讀者好，在這片文章中我們嘗試使用sklearn庫比較k-means聚類演算法和主成分分析（PCA）在影象壓縮上的實現和結果。壓縮影象的效果通過佔用的減少比例以及

拓端tecdat|R語言譜聚類、K-means聚類分析非線性環狀資料比較

原文連結：http://tecdat.cn/?p=23276 原文出處：拓端資料部落公眾號有些問題是線性的，但有些問題是非線性的。我假設，你過去的知識是從討論和解決線性問題開始的，這是一個自然的起點。對於非線性問題的解決，往

python基於K-means聚類演算法的影象分割

1 K-means演算法實際上，無論是從演算法思想，還是具體實現上，K-means演算法是一種很簡單的演算法。它屬於無監督分類，通過按照一定的方式度量樣本之間的相似度，通過迭代更新聚類中心，當聚類中心不再移動或移動

python 程式碼實現k-means聚類分析的思路(不使用現成聚類庫)

一、實驗目標　　　　1、使用 K-means 模型進行聚類，嘗試使用不同的類別個數 K，並分析聚類結果。

k-means 聚類演算法與Python實現程式碼

k-means 聚類演算法思想先隨機選擇k個聚類中心，把集合裡的元素與最近的聚類中心聚為一類，得到一次聚類，再把每一個類的均值作為新的聚類中心重新聚類，迭代n次得到最終結果分步解析

簡單的k-means聚類

演算法步驟：在樣本中隨機選取k個樣本點充當各個簇的中心點；計算所有樣本點與各個簇中心之間的距離，然後把樣本點劃入最近的簇中；

Python用K-means聚類演算法進行客戶分群的實現

一、背景 1.專案描述你擁有一個超市(Supermarket Mall)。通過會員卡，你用有一些關於你的客戶的基本資料，如客戶ID，年齡，性別，年收入和消費分數。

MATLAB k-means聚類

聚類演算法，不是分類演算法。分類演算法是給一個數據，然後判斷這個資料屬於已分好的類中的具體哪一類。

matlab實現K-means聚類演算法（轉載）

https://blog.csdn.net/wys7541/article/details/82153844 K-means聚類演算法的一般步驟：初始化。輸入基因表達矩陣作為物件集X，輸入指定聚類類數N，並在X中隨機選取N個物件作為初始聚類中心。設定迭代中止條件，

論文題目：基於K-means聚類的三維點雲分類

論文題目：基於K-means聚類的三維點雲分類論文主要思想：先對原始點雲進行預處理保留關鍵點，密集去冗餘、稀疏進行三角形插值。再通過K-means聚類操作獲取區域性特徵，之後並行同過pointnet進行特徵提取。

關於k-means聚類演算法的原理及解析

一、k-means演算法思想：第一步，從檔案中讀取資料，點用元組表示，點集用列表表示。第二步，初始化聚類中心。首先獲取資料的長度，然後在range(0,length)這個區間上隨機產生k個不同的值,以此為下標提取出資料點，

OpenCV計算機視覺學習（12）——影象量化處理&影象取樣處理（K-Means聚類量化，區域性馬賽克處理）

如果需要處理的原圖及程式碼，請移步小編的GitHub地址　　傳送門：請點選我

K-means聚類演算法及python程式碼實現

K-means聚類演算法（事先資料並沒有類別之分！所有的資料都是一樣的） 1、概述

機器學習 - k-means聚類

k-means簡介 k-means是無監督學習下的一種聚類演算法，簡單說就是不需要資料標籤，僅靠特徵值就可以將資料分為指定的幾類。k-means演算法的核心就是通過計算每個資料點與k個質心（或重心）之間的距離，找出與各質心

K-means聚類

專案預備 1 from numpy.random import RandomState 2 from sklearn.datasets import make_blobs 3 import matplotlib.pyplot as plt

Python機器學習的練習七：K-Means聚類和主成分分析

這部分練習涵蓋兩個吸引人的話題：K-Means聚類和主成分分析（PCA），K-Means和PCA都是無監督學習技術的例子，無監督學習問題沒有為我們提供任何標籤或者目標去學習做出預測，所以無監督演算法試圖從資料本身中學習一

K-Means聚類演算法k值選取——輪廓係數

1 # 1 匯入模組和包 2 import matplotlib.pyplot as plt #匯入繪製資料圖的資料庫 3 from sklearn.datasets import make_blobs

k-means聚類演算法原理

k-means演算法原理 k-means是一種無監督的分類演算法，簡而言之就是餵給演算法的資料是沒有標籤的，但是我們需要自己設定k值（分類數）。如顆粒影象中有四種顏色，則設立k值為4.而相反，KNN分類演算法是一種有監督的

YOLOv3中K-Means聚類出新資料集的Anchor尺寸

相關推薦