java影象處理：灰度化，二值化，降噪，切割，裁剪，識別，找相似等

阿新 • • 發佈：2019-01-17

前段時間做爬蟲，涉及到對圖片驗證碼的破解，這裡羅列一些常用的影象處理方法，都很簡單並沒用到什麼複雜的演算法，所以不涉及opencv，都是一些直接對rgb畫素點的操作，很簡單也很好理解，至於識別直接用的tesseract-ocr，也可以用svm。（ps:圖片的畫素值矩陣的原點在左上角，上邊是x軸，左邊是y軸）

1、灰度化和二值化，即把彩色圖片經過灰度化和二值化變成只有黑白(只有0,1的矩陣)的資料，便於後續對圖片的處理

        public static BufferedImage grayImage(BufferedImage bufferedImage) throws Exception {

		int width = bufferedImage.getWidth();
		int height = bufferedImage.getHeight();

		BufferedImage grayBufferedImage = new BufferedImage(width, height, bufferedImage.getType());
		for (int i = 0; i < bufferedImage.getWidth(); i++) {
			for (int j = 0; j < bufferedImage.getHeight(); j++) {
				final int color = bufferedImage.getRGB(i, j);
				final int r = (color >> 16) & 0xff;
				final int g = (color >> 8) & 0xff;
				final int b = color & 0xff;
				int gray = (int) (0.3 * r + 0.59 * g + 0.11 * b);
				int newPixel = colorToRGB(255, gray, gray, gray);
				grayBufferedImage.setRGB(i, j, newPixel);
			}
		}

		return grayBufferedImage;

	}

	/**
	 * 顏色分量轉換為RGB值
	 * 
	 * @param alpha
	 * @param red
	 * @param green
	 * @param blue
	 * @return
	 */
	private static int colorToRGB(int alpha, int red, int green, int blue) {

		int newPixel = 0;
		newPixel += alpha;
		newPixel = newPixel << 8;
		newPixel += red;
		newPixel = newPixel << 8;
		newPixel += green;
		newPixel = newPixel << 8;
		newPixel += blue;

		return newPixel;

	}

	public static BufferedImage binaryImage(BufferedImage image) throws Exception {
		int w = image.getWidth();  
	    int h = image.getHeight();  
	    float[] rgb = new float[3];  
	    double[][] zuobiao = new double[w][h];  
	    int black = new Color(0, 0, 0).getRGB();  
	    int white = new Color(255, 255, 255).getRGB();  
	    BufferedImage bi= new BufferedImage(w, h,  
	            BufferedImage.TYPE_BYTE_BINARY);;  
	    for (int x = 0; x < w; x++) {  
	        for (int y = 0; y < h; y++) {  
	            int pixel = image.getRGB(x, y);   
	            rgb[0] = (pixel & 0xff0000) >> 16;  
	            rgb[1] = (pixel & 0xff00) >> 8;  
	            rgb[2] = (pixel & 0xff);  
	            float avg = (rgb[0]+rgb[1]+rgb[2])/3;  
	            zuobiao[x][y] = avg;      
	              
	        }  
	    }

            //這裡是閾值，白底黑字還是黑底白字，大多數情況下建議白底黑字，後面都以白底黑字為例
	    double SW = 192;  
	    for (int x = 0; x < w; x++) {  
	        for (int y = 0; y < h; y++) {  
	            if (zuobiao[x][y] < SW) {  
	                bi.setRGB(x, y, black);  
	            }else{  
	                bi.setRGB(x, y, white);  
	            }  
	        }             
	    }  
	    
	    
	    
	    return bi;
	}

// 自己加周圍8個灰度值再除以9，算出其相對灰度值
	public static double getGray(double[][] zuobiao, int x, int y, int w, int h) {
		double rs = zuobiao[x][y] + (x == 0 ? 255 : zuobiao[x - 1][y]) + (x == 0 || y == 0 ? 255 : zuobiao[x - 1][y - 1])
				+ (x == 0 || y == h - 1 ? 255 : zuobiao[x - 1][y + 1]) + (y == 0 ? 255 : zuobiao[x][y - 1])
				+ (y == h - 1 ? 255 : zuobiao[x][y + 1]) + (x == w - 1 ? 255 : zuobiao[x + 1][y])
				+ (x == w - 1 || y == 0 ? 255 : zuobiao[x + 1][y - 1])
				+ (x == w - 1 || y == h - 1 ? 255 : zuobiao[x + 1][y + 1]);
		return rs / 9;
	}

2、降噪（領域檢測法），針對二值化後的圖片來說(白底黑字)，噪點就是圖片中一堆密集黑色畫素點中少許白色畫素點(字型裡的白點或背景裡的黑點)，這時，解決方法可以遍歷畫素點，一個畫素點周圍有8個畫素點，如果這個畫素點周圍有6個以上畫素點是黑色就可以把這個畫素點也認為是黑色：

/**
	 * 降噪，以1個畫素點為單位（實際使用中可以迴圈降噪，或者把單位可以擴大為多個畫素點）
	 * @param image
	 * @return
	 */
	public static BufferedImage denoise(BufferedImage image){
		int w = image.getWidth();  
	    int h = image.getHeight();
	    int white = new Color(255, 255, 255).getRGB();  

	    if(isWhite(image.getRGB(1, 0)) && isWhite(image.getRGB(0, 1)) && isWhite(image.getRGB(1, 1))){
	    	image.setRGB(0,0,white);
        }
	    if(isWhite(image.getRGB(w-2, 0)) && isWhite(image.getRGB(w-1, 1)) && isWhite(image.getRGB(w-2, 1))){
	    	image.setRGB(w-1,0,white);
        }
	    if(isWhite(image.getRGB(0, h-2)) && isWhite(image.getRGB(1, h-1)) && isWhite(image.getRGB(1, h-2))){
	    	image.setRGB(0,h-1,white);
        }
	    if(isWhite(image.getRGB(w-2, h-1)) && isWhite(image.getRGB(w-1, h-2)) && isWhite(image.getRGB(w-2, h-2))){
	    	image.setRGB(w-1,h-1,white);
        }
	    
	    for(int x = 1; x < w-1; x++){
	    	int y = 0;
	    	if(isBlack(image.getRGB(x, y))){
            	int size = 0;
                if(isWhite(image.getRGB(x-1, y))){
                	size++;
                }
                if(isWhite(image.getRGB(x+1, y))){
                	size++;
                }
                if(isWhite(image.getRGB(x, y+1))){
                	size++;
                }
                if(isWhite(image.getRGB(x-1, y+1))){
                	size++;
                }
                if(isWhite(image.getRGB(x+1, y+1))){
                	size++;
                } 
                if(size>=5){
                	image.setRGB(x,y,white);                     
                }
            }
	    }
	    for(int x = 1; x < w-1; x++){
	    	int y = h-1;
	    	if(isBlack(image.getRGB(x, y))){
            	int size = 0;
                if(isWhite(image.getRGB(x-1, y))){
                	size++;
                }
                if(isWhite(image.getRGB(x+1, y))){
                	size++;
                }
                if(isWhite(image.getRGB(x, y-1))){
                	size++;
                }
                if(isWhite(image.getRGB(x+1, y-1))){
                	size++;
                }
                if(isWhite(image.getRGB(x-1, y-1))){
                	size++;
                }
                if(size>=5){
                	image.setRGB(x,y,white);                     
                }
            }
	    }
	    
	    for(int y = 1; y < h-1; y++){
	    	int x = 0;
	    	if(isBlack(image.getRGB(x, y))){
            	int size = 0;
                if(isWhite(image.getRGB(x+1, y))){
                	size++;
                }
                if(isWhite(image.getRGB(x, y+1))){
                	size++;
                }
                if(isWhite(image.getRGB(x, y-1))){
                	size++;
                }
                if(isWhite(image.getRGB(x+1, y-1))){
                	size++;
                }
                if(isWhite(image.getRGB(x+1, y+1))){
                	size++;
                } 
                if(size>=5){
                	image.setRGB(x,y,white);                     
                }
            }
	    }
	    
	    for(int y = 1; y < h-1; y++){
	    	int x = w - 1;
	    	if(isBlack(image.getRGB(x, y))){
            	int size = 0;
            	if(isWhite(image.getRGB(x-1, y))){
                	size++;
                }
                if(isWhite(image.getRGB(x, y+1))){
                	size++;
                }
                if(isWhite(image.getRGB(x, y-1))){
                	size++;
                }
                //斜上下為空時，去掉此點
                if(isWhite(image.getRGB(x-1, y+1))){
                	size++;
                }
                if(isWhite(image.getRGB(x-1, y-1))){
                	size++;
                }
                if(size>=5){
                	image.setRGB(x,y,white);                     
                }
            }
	    }
	    
		//降噪，以1個畫素點為單位
    	for(int y = 1; y < h-1; y++){
            for(int x = 1; x < w-1; x++){                   
                if(isBlack(image.getRGB(x, y))){
                	int size = 0;
                    //上下左右均為空時，去掉此點
                    if(isWhite(image.getRGB(x-1, y))){
                    	size++;
                    }
                    if(isWhite(image.getRGB(x+1, y))){
                    	size++;
                    }
                    //上下均為空時，去掉此點
                    if(isWhite(image.getRGB(x, y+1))){
                    	size++;
                    }
                    if(isWhite(image.getRGB(x, y-1))){
                    	size++;
                    }
                    //斜上下為空時，去掉此點
                    if(isWhite(image.getRGB(x-1, y+1))){
                    	size++;
                    }
                    if(isWhite(image.getRGB(x+1, y-1))){
                    	size++;
                    }
                    if(isWhite(image.getRGB(x+1, y+1))){
                    	size++;
                    } 
                    if(isWhite(image.getRGB(x-1, y-1))){
                    	size++;
                    }
                    if(size>=8){
                    	image.setRGB(x,y,white);                     
                    }
                }
            }
        }
	    
	    return image;
	}
	
	 public static boolean isBlack(int colorInt)  
     {  
         Color color = new Color(colorInt);  
         if (color.getRed() + color.getGreen() + color.getBlue() <= 300)  
         {  
             return true;  
         }  
         return false;  
     }  

     public static boolean isWhite(int colorInt)  
     {  
         Color color = new Color(colorInt);  
         if (color.getRed() + color.getGreen() + color.getBlue() > 300)  
         {  
             return true;  
         }  
         return false;  
     }  
     
     public static int isBlack(int colorInt, int whiteThreshold) {
 		final Color color = new Color(colorInt);
 		if (color.getRed() + color.getGreen() + color.getBlue() <= whiteThreshold) {
 			return 1;
 		}
 		return 0;
 	}

3、切割字串，比如識別驗證碼時，可能需要將每個字元給單獨切割下來進行單獨識別，準確率會高得多，如果驗證碼字元之間粘連度不高，可以使用最簡單的投影法，即，x軸上的每一列有多少個黑色畫素點，這樣形成了一個數組，針對這個陣列可以想象成一個折線圖，如果圖片沒有粘連那最理想的效果是這樣：

，

如果有粘連，那就是一個有波峰波谷的折線圖，只有通過演算法找到波峰波谷來切割了；同理，對y軸也可以進行投影。opencv的話，還有個常用的輪廓法，需要的可以瞭解下（吐槽：java版的opencv和c++版本的api稍有不同，這個只有看官方文件了）

4、裁剪，把上一步的畫素點區域找到(原點+寬高)即可裁剪。

5、識別，如果字型扭曲的不是很嚴重的話，推薦直接使用tesseract-ocr簡單方便，java包好像叫tess4j，也可以自己用svm；字型扭曲嚴重的話，只有自己寫演算法了，這種情況還是建議使用opencv來處理，有相應的處理方法的。

6、圖片相似度，像有些驗證碼扭曲的厲害確實破解不了的（md，難道要上神經網路喂幾萬張圖片去識別麼，拜託，我只是爬個數據而已，還是不費力氣了~~），

但是，我之前碰到過有些網站雖然破解不了，但是驗證碼圖片數量是固定的，比如裁判文書網，爬了幾千張下來才發現基本上都是重複的，之前的白弄了。其實可以直接hash的，但是為了擴充套件和複用，還是弄個圖片相似度，其實原理差的不多，還是圖片預處理後再進行hash，形成每張圖片獨有的指紋。

PS：簡單的可以自己處理畫素，也便於學習和加深理解，更復雜的建議還是使用opencv吧~~

這裡可以簡單說說我最開始怎麼處理裁判文書網的驗證碼的，準確率只有60%左右（大部分8，90%多的圖片能分割出來，識別就不行了）；字型，背景色，干擾線，噪點，顏色基本都不一樣，通過某些演算法區分這幾個區域（分類演算法很多，kmeans最簡單效果也行），然後挨著邊緣的是干擾線先去掉（需要與二值化後的進行對比確認干擾線），中間區域分類顏色最多的是字型，然後去掉其他區域，再二值化分割判斷。缺點：有些圖片干擾線和字型顏色很接近分不出來，去掉干擾線把字型的一部分也除掉了。大家還有沒有什麼好的方法

程式碼下載

java影象處理：灰度化，二值化，降噪，切割，裁剪，識別，找相似等

java影象處理：灰度化，二值化，降噪，切割，裁剪，識別，找相似等

影象處理：灰度化，二值化，反色

bmp點陣圖檔案：讀取、寫入、24位真彩轉8位灰度、灰度圖的二值化

影象灰度變換、二值化、直方圖

【OpenCV_09】平滑/模糊圖片 Smooth / Blur Images及彩色圖轉灰度圖和二值化

【數字影象】C++8位和24位BMP點陣圖的平滑、銳化、二值化處理，以及24位真彩圖的灰度化

java影象處理---灰度化（二值化）處理

Matlab影象處理轉灰度圖，繪製直方圖，直方圖均衡化

opencv 圖片基本處理（灰度化，反色，二值化，膨脹腐蝕以及ROI的copy等）

【數字影象處理】灰度直方圖、直方圖均衡化、直方圖規定化

影象處理-python實現影象處理（消噪，直方圖均衡化，二值化，形態學）

【數字影象處理】灰度變換函式（對數變換、反對數變換、冪次變換）

影象驗證碼識別（四）——灰度化和二值化

影象處理opencv2-灰度直方圖以及灰度直方圖均衡化

影象處理之灰度模糊影象與彩色清晰影象的變換

Matlab影象處理之灰度變換函式

例項說明影象的灰度化和二值化的區別

JAVA灰度化、二值化圖片如此簡單方便

C#下用Emgucv對圖片進行灰度化、二值化、邊緣檢測、膨脹腐蝕運算、霍夫變換進行表格識別

掃描儀掃描文件處理-ABBYY生成小體積黑白二值化PDF

java影象處理：灰度化，二值化，降噪，切割，裁剪，識別，找相似等

相關推薦