Scikit-learn例項之Pca+Svm人臉識別(AT&T資料集)

阿新 • • 發佈：2019-01-10

from __future__ import print_function

from time import time
import logging
import matplotlib.pyplot as plt
import cv2

from numpy import *
from sklearn.model_selection import train_test_split
from sklearn.model_selection import GridSearchCV
from sklearn.metrics import classification_report
from sklearn.metrics import confusion_matrix
from sklearn.decomposition import PCA
from sklearn.svm import SVC

PICTURE_PATH = "D:\\Data\\"

def get_Image():
    for i in range(1,41):
        for j in range(1,11):
            path = PICTURE_PATH + "\\s" + str(i) + "\\"+ str(j) + ".pgm"
            img = cv2.imread(path)
            img_gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
            h,w = img_gray.shape
            img_col = img_gray.reshape(h*w)
            all_data_set.append(img_col)
            all_data_label.append(i)
    return h,w

all_data_set = []
all_data_label = []
h,w = get_Image()

X = array(all_data_set)
y = array(all_data_label)
n_samples,n_features = X.shape
n_classes = len(unique(y))
target_names = []
for i in range(1,41):
    names = "person" + str(i)
    target_names.append(names)

print("Total dataset size:")
print("n_samples: %d" % n_samples)
print("n_features: %d" % n_features)
print("n_classes: %d" % n_classes)

# split into a training and testing set
X_train, X_test, y_train, y_test = train_test_split(
    X, y, test_size=0.25, random_state=42)

n_components = 10
print("Extracting the top %d eigenfaces from %d faces"
      % (n_components, X_train.shape[0]))

t0 = time()
pca = PCA(n_components=n_components, svd_solver='randomized',
          whiten=True).fit(X_train)
print("done in %0.3fs" % (time() - t0))

eigenfaces = pca.components_.reshape((n_components, h, w))

print("Projecting the input data on the eigenfaces orthonormal basis")
t0 = time()
X_train_pca = pca.transform(X_train)
X_test_pca = pca.transform(X_test)
print("done in %0.3fs" % (time() - t0))

print("Fitting the classifier to the training set")
t0 = time()
param_grid = {'C': [1e3, 5e3, 1e4, 5e4, 1e5],
              'gamma': [0.0001, 0.0005, 0.001, 0.005, 0.01, 0.1], }
clf = GridSearchCV(SVC(kernel='rbf', class_weight='balanced'), param_grid)
clf = clf.fit(X_train_pca, y_train)
print("done in %0.3fs" % (time() - t0))
print("Best estimator found by grid search:")
print(clf.best_estimator_)

print("Predicting people's names on the test set")
t0 = time()
y_pred = clf.predict(X_test_pca)
print("done in %0.3fs" % (time() - t0))

print(classification_report(y_test, y_pred, target_names=target_names))
print(confusion_matrix(y_test, y_pred, labels=range(n_classes)))

def plot_gallery(images, titles, h, w, n_row=3, n_col=4):
    """Helper function to plot a gallery of portraits"""
    plt.figure(figsize=(1.8 * n_col, 2.4 * n_row))
    plt.subplots_adjust(bottom=0, left=.01, right=.99, top=.90, hspace=.35)
    for i in range(n_row * n_col):
        plt.subplot(n_row, n_col, i + 1)
        plt.imshow(images[i].reshape((h, w)), cmap=plt.cm.gray)
        plt.title(titles[i], size=12)
        plt.xticks(())
        plt.yticks(())


# plot the result of the prediction on a portion of the test set

def title(y_pred, y_test, target_names, i):
    pred_name = target_names[y_pred[i]-1]
    true_name = target_names[y_test[i]-1]
    return 'predicted: %s\ntrue:      %s' % (pred_name, true_name)

prediction_titles = [title(y_pred, y_test, target_names, i)
                     for i in range(y_pred.shape[0])]

plot_gallery(X_test, prediction_titles, h, w)

# plot the gallery of the most significative eigenfaces

eigenface_titles = ["eigenface %d" % i for i in range(eigenfaces.shape[0])]
plot_gallery(eigenfaces, eigenface_titles, h, w)

plt.show()

Scikit-learn例項之Pca+Svm人臉識別(AT&T資料集)

from __future__ import print_function from time import time import logging import matplotlib.pyplot as plt import cv2 from numpy import * from sklearn.mo

pytorch人臉識別——自己製作資料集

這是一篇面向新手的博文：因為本人也是新手，記錄一下自己在做這個專案遇到的大大小小的坑。按照下面的例子寫就好了 import torch as t from torch.utils import data import os from PIL import Image import numpy as

基於PCA和SVM人臉識別之二.MATLAB實現

此文章中MATLAB實現均根據《數字影象處理與機器視覺----Visual c++ 與MATLAB實現》一書，我所獲得的基礎知識也大多源於此書，感謝！下面將我根據教程建立的工程以及敲擊的程式碼塊一一奉上，供日後參閱。建立以專

【SciKit-Learn學習筆記】7：PCA結合SVM做AT&T資料集人物影象分類

學習《scikit-learn機器學習》時的一些實踐。原理見PCA及繪製降維與恢復示意圖。 sklearn的PCA sklearn中包裝的PCA也是不帶有歸一化和縮放等預處理操作的，可以用MinMaxScaler()實現並裝在Pipeline裡封裝起來。 from

scikit-learn學習之SVM演算法

====================================================================== 本系列部落格主要參考 Scikit-Learn 官方網站上的每一個演算法進行，並進行部分翻譯，如有錯誤，請大家指正轉載請註明

PCA進行人臉識別

clc;clear all;close all; %測試資料：32人，每人10張照片,取前5張照片作為訓練集，後5張照片作為測試集。 ph=5;%測試訓練集樣本數 imdata=zeros(11292,32ph); for i=1:32 for j=1:ph addr=strcat(‘E:/

執行svm人臉識別程式碼提示：Intel MKL FATAL ERROR: Cannot load libmkl_avx2.so or libmkl_def.so.

Intel MKL FATAL ERROR: Cannot load libmkl_avx2.so or libmkl_def.so. Process finished with exit code 2 在anaconda 的環境中，匯入from skimage import measure

【SciKit-Learn學習筆記】5：核SVM分類和預測乳腺癌資料集

學習《scikit-learn機器學習》時的一些實踐。常用引數引數C SVM分類器svm.SVC()中的引數C即SVM所優化的目標函式 a

scikit-learn 支援向量機實現手寫體識別

隨時程式碼，閱讀筆記 %matplotlib inline import matplotlib.pyplot as plt import numpy as np from sklearn import datasets digits = datasets.load_d

CV之FC：人臉識別之判斷相似度極高的國內外明星根據人工智慧演算法(AP雲)預測判別是否為同一個人

CV之FC：人臉識別之判斷相似度極高的國內外明星根據人工智慧演算法(AP雲)預測判別是否為同一個人根據美國人口調查局的估計，截至到2013年1月4日，全世界有70.57億人。美國人口調查局的資料顯示全球人口在2012年3月12日突破70億；而聯合國人

Scikit-learn 筆記之 LogisticRegression

LogisticRegression class sklearn.linear_model.LogisticRegression(penalty=’l2’, dual=False, tol=0.00

SVM人臉識別分類案例（機器學習）

運用sklearn自帶的資料集做一個分類任務from sklearn.datasets import fetch_lfw_people faces = fetch_lfw_people(min_faces_per_person=60) fig, ax = plt.subplo

OpenCv 之（圖片人臉識別）和（攝像頭讀入）

先來張人臉識別效果圖： 1、概述人臉識別，是基於人的臉部特徵資訊進行身份識別的一種生物識別技術。用攝像機或攝像頭採集含有人臉的影象或視訊流，並自動在影象中檢測和跟蹤人臉，進而對檢測到的人臉進行臉部的一系列相關技術，通常也叫做人像識別、面部識別。

基於PCA的人臉識別步驟

人臉識別是一個有監督學習過程，首先利用訓練集構造一個人臉模型，然後將測試集與訓練集進行匹配，找到與之對應的訓練集頭像。最容易的方式是直接利用歐式距離計算測試集的每一幅影象與訓練集的每一幅影象的距離，然後選擇距離最近的影象作為識別的結果。這種直接計算距離的方式直觀，但是有一

SVM人臉識別

SVM在中等維度的分類問題中，有較好的表現，其在某種程度上構建了一個簡單的網路結構，類似於神經網路中的RBF神經網路。人臉資料集是經典的分類和聚類問題中經常使用的資料集，維度相對不高，灰度影象，這裡選用64*64的人臉影象，將其reshape從1*64^2的一維陣列，共4

scikit-learn學習之K-means聚類演算法與 Mini Batch K-Means演算法

======================================================================本系列部落格主要參考 Scikit-Learn 官方網站上的每一個演算法進行，並進行部分翻譯，如有錯誤，請大家指正轉載請註明出

機器學習實戰例項之手寫數字識別（KNN、python3）

from numpy import * from os import listdir import operator def img2Vector(filename): returnVecter = zeros((1,1024)) fr = open(fil

scikit-learn學習之迴歸分析

======================================================================本系列部落格主要參考 Scikit-Learn 官方網站上的

PCA的人臉識別（含matlab程式碼）

在讀完Baback Moghaddam大神的論文之後，我們來講下具體的程式碼實現。我們以人臉的識別為例子，講述下具體的實現。（1）首先我們需要有人臉的資料集，在這裡對應每個人只有一個照片在資料集中

scikit-learn學習之K-means聚類演算法與 Mini Batch K-Means演算法 [轉自別的作者，還有其他sklearn翻譯]

http://blog.csdn.net/gamer_gyt/article/details/51244850 ====================================================================== 本系列部落格主要

Scikit-learn例項之Pca+Svm人臉識別(AT&T資料集)

相關推薦