YOLOV5——使用 k-means 聚類 anchorbox 資料

阿新 • • 發佈：2021-01-08

訓練的標註資料格式如下：

[
    {
        "name": "235_2_t20201127123021723_CAM2.jpg",
        "image_height": 6000,
        "image_width": 8192,
        "category": 5,
        "bbox": [
            1876.06,
            998.04,
            1883.06,
            1004.04
        ]
    },
    {
        "name": "235_2_t20201127123021723_CAM2.jpg 
",
        "image_height": 6000,
        "image_width": 8192,
        "category": 5,
        "bbox": [
            1655.06,
            1094.04,
            1663.06,
            1102.04
        ]
    }
]

聚類anchorbox只需要 bbox 中的左上角與右下角的 x,y 資料

k-means 聚類程式碼：

import numpy as np
import json


def iou(box, clusters):
     
"""
   計算 IOU
    param:
        box: tuple or array, shifted to the origin (i. e. width and height)
        clusters: numpy array of shape (k, 2) where k is the number of clusters
    return:
        numpy array of shape (k, 0) where k is the number of clusters
    """
    x = np.minimum(clusters[:, 0], box[0])
    y  
= np.minimum(clusters[:, 1], box[1])
    if np.count_nonzero(x == 0) > 0 or np.count_nonzero(y == 0) > 0:
        raise ValueError("Box has no area")

    intersection = x * y
    box_area = box[0] * box[1]
    cluster_area = clusters[:, 0] * clusters[:, 1]

    iou_ = intersection / (box_area + cluster_area - intersection + 1e-10)

    return iou_


#  計算框的 numpy 陣列和 k 個簇之間的平均並集交集（IoU）。
def avg_iou(boxes, clusters):
    """
    param:
        boxes: numpy array of shape (r, 2), where r is the number of rows
        clusters: numpy array of shape (k, 2) where k is the number of clusters
    return:
        average IoU as a single float
    """
    return np.mean([np.max(iou(boxes[i], clusters)) for i in range(boxes.shape[0])])


# 將所有框轉換為原點。
def translate_boxes(boxes):
    """
    param:
        boxes: numpy array of shape (r, 4)
    return:
    numpy array of shape (r, 2)
    """
    new_boxes = boxes.copy()
    for row in range(new_boxes.shape[0]):
        new_boxes[row][2] = np.abs(new_boxes[row][2] - new_boxes[row][0])
        new_boxes[row][3] = np.abs(new_boxes[row][3] - new_boxes[row][1])
    return np.delete(new_boxes, [0, 1], axis=1)


# 使用聯合上的交集（IoU）度量計算k均值聚類。
def kmeans(boxes, k, dist=np.median):
    """
    param:
        boxes: numpy array of shape (r, 2), where r is the number of rows
        k: number of clusters
        dist: distance function
    return:
        numpy array of shape (k, 2)
    """
    rows = boxes.shape[0]

    distances = np.empty((rows, k))
    last_clusters = np.zeros((rows,))

    np.random.seed()

    # the Forgy method will fail if the whole array contains the same rows
    clusters = boxes[np.random.choice(rows, k, replace=False)]  # 初始化k個聚類中心（方法是從原始資料集中隨機選k個）

    while True:
        for row in range(rows):
            # 定義的距離度量公式：d(box,centroid)=1-IOU(box,centroid)。到聚類中心的距離越小越好，但IOU值是越大越好，所以使用 1 - IOU，這樣就保證距離越小，IOU值越大。
            distances[row] = 1 - iou(boxes[row], clusters)
        # 將標註框分配給“距離”最近的聚類中心（也就是這裡程式碼就是選出（對於每一個box）距離最小的那個聚類中心）。
        nearest_clusters = np.argmin(distances, axis=1)
        # 直到聚類中心改變數為0（也就是聚類中心不變了）。
        if (last_clusters == nearest_clusters).all():
            break
        # 更新聚類中心（這裡把每一個類的中位數作為新的聚類中心）
        for cluster in range(k):
            clusters[cluster] = dist(boxes[nearest_clusters == cluster], axis=0)

        last_clusters = nearest_clusters

    return clusters


# 讀取 json 檔案中的標註資料
def parse_anno(annotation_path):
    with open(annotation_path, 'r') as f:
        anno = json.load(f)
    result = []
    for line in anno:
        bbox = line['bbox']
        x_min, y_min, x_max, y_max = bbox[0], bbox[1], bbox[2], bbox[3]
        # 計算邊框的大小
        width = x_max - x_min
        height = y_max - y_min
        assert width > 0
        assert height > 0
        result.append([width, height])
    result = np.asarray(result)
    return result


def get_kmeans(anno, cluster_num=9):

    anchors = kmeans(anno, cluster_num)
    ave_iou = avg_iou(anno, anchors)

    anchors = anchors.astype('int').tolist()

    anchors = sorted(anchors, key=lambda x: x[0] * x[1])

    return anchors, ave_iou


if __name__ == '__main__':
    annotation_path = "tile_round1_train_20201231/train_annos.json"
    anno_result = parse_anno(annotation_path)

    anchors, ave_iou = get_kmeans(anno_result, 9)

    anchor_string = ''
    for anchor in anchors:
        anchor_string += '{},{}, '.format(anchor[0], anchor[1])
    anchor_string = anchor_string[:-2]

　　 print(f'anchors are: {anchor_string}')
　　 print(f'the average iou is: {ave_iou}')

每次執行的結果都會有點不大一樣

參考：https://blog.csdn.net/zuliang001/article/details/90551798

YOLOV5——使用 k-means 聚類 anchorbox 資料

訓練的標註資料格式如下： [ { \"name\": \"235_2_t20201127123021723_CAM2.jpg\", \"image_height\": 6000,

YOLOv3中K-Means聚類出新資料集的Anchor尺寸

參考部落格：聚類kmeans演算法在yolov3中的應用 https://www.cnblogs.com/sdu20112013/p/10937717.html

拓端tecdat|R語言譜聚類、K-means聚類分析非線性環狀資料比較

原文連結：http://tecdat.cn/?p=23276 原文出處：拓端資料部落公眾號有些問題是線性的，但有些問題是非線性的。我假設，你過去的知識是從討論和解決線性問題開始的，這是一個自然的起點。對於非線性問題的解決，往

python基於K-means聚類演算法的影象分割

1 K-means演算法實際上，無論是從演算法思想，還是具體實現上，K-means演算法是一種很簡單的演算法。它屬於無監督分類，通過按照一定的方式度量樣本之間的相似度，通過迭代更新聚類中心，當聚類中心不再移動或移動

在Python中使用K-Means聚類和PCA主成分分析進行影象壓縮

在Python中使用K-Means聚類和PCA主成分分析進行影象壓縮各位讀者好，在這片文章中我們嘗試使用sklearn庫比較k-means聚類演算法和主成分分析（PCA）在影象壓縮上的實現和結果。壓縮影象的效果通過佔用的減少比例以及

python 程式碼實現k-means聚類分析的思路(不使用現成聚類庫)

一、實驗目標　　　　1、使用 K-means 模型進行聚類，嘗試使用不同的類別個數 K，並分析聚類結果。

k-means 聚類演算法與Python實現程式碼

k-means 聚類演算法思想先隨機選擇k個聚類中心，把集合裡的元素與最近的聚類中心聚為一類，得到一次聚類，再把每一個類的均值作為新的聚類中心重新聚類，迭代n次得到最終結果分步解析

簡單的k-means聚類

演算法步驟：在樣本中隨機選取k個樣本點充當各個簇的中心點；計算所有樣本點與各個簇中心之間的距離，然後把樣本點劃入最近的簇中；

Python用K-means聚類演算法進行客戶分群的實現

一、背景 1.專案描述你擁有一個超市(Supermarket Mall)。通過會員卡，你用有一些關於你的客戶的基本資料，如客戶ID，年齡，性別，年收入和消費分數。

MATLAB k-means聚類

聚類演算法，不是分類演算法。分類演算法是給一個數據，然後判斷這個資料屬於已分好的類中的具體哪一類。

matlab實現K-means聚類演算法（轉載）

https://blog.csdn.net/wys7541/article/details/82153844 K-means聚類演算法的一般步驟：初始化。輸入基因表達矩陣作為物件集X，輸入指定聚類類數N，並在X中隨機選取N個物件作為初始聚類中心。設定迭代中止條件，

論文題目：基於K-means聚類的三維點雲分類

論文題目：基於K-means聚類的三維點雲分類論文主要思想：先對原始點雲進行預處理保留關鍵點，密集去冗餘、稀疏進行三角形插值。再通過K-means聚類操作獲取區域性特徵，之後並行同過pointnet進行特徵提取。

關於k-means聚類演算法的原理及解析

一、k-means演算法思想：第一步，從檔案中讀取資料，點用元組表示，點集用列表表示。第二步，初始化聚類中心。首先獲取資料的長度，然後在range(0,length)這個區間上隨機產生k個不同的值,以此為下標提取出資料點，

OpenCV計算機視覺學習（12）——影象量化處理&影象取樣處理（K-Means聚類量化，區域性馬賽克處理）

如果需要處理的原圖及程式碼，請移步小編的GitHub地址　　傳送門：請點選我

K-means聚類演算法及python程式碼實現

K-means聚類演算法（事先資料並沒有類別之分！所有的資料都是一樣的） 1、概述

機器學習 - k-means聚類

k-means簡介 k-means是無監督學習下的一種聚類演算法，簡單說就是不需要資料標籤，僅靠特徵值就可以將資料分為指定的幾類。k-means演算法的核心就是通過計算每個資料點與k個質心（或重心）之間的距離，找出與各質心

K-means聚類

專案預備 1 from numpy.random import RandomState 2 from sklearn.datasets import make_blobs 3 import matplotlib.pyplot as plt

Python機器學習的練習七：K-Means聚類和主成分分析

這部分練習涵蓋兩個吸引人的話題：K-Means聚類和主成分分析（PCA），K-Means和PCA都是無監督學習技術的例子，無監督學習問題沒有為我們提供任何標籤或者目標去學習做出預測，所以無監督演算法試圖從資料本身中學習一

K-Means聚類演算法k值選取——輪廓係數

1 # 1 匯入模組和包 2 import matplotlib.pyplot as plt #匯入繪製資料圖的資料庫 3 from sklearn.datasets import make_blobs

k-means聚類演算法原理

k-means演算法原理 k-means是一種無監督的分類演算法，簡而言之就是餵給演算法的資料是沒有標籤的，但是我們需要自己設定k值（分類數）。如顆粒影象中有四種顏色，則設立k值為4.而相反，KNN分類演算法是一種有監督的

YOLOV5——使用 k-means 聚類 anchorbox 資料

相關推薦