More than 3 years have passed since last update.

OpenCVのマウス操作でトリミング

Posted at 2021-11-24

はじめに

opencvのマウスイベント機能を使って、画像トリミングプログラムを作成しました。
マウスの左クリックでトリミング座標取得、右クリックでトリミング実行、
ESCキーでプログラム終了をします。timeoutの指定もできます。

環境

anaconda
windows 10 pro
python = 3.9
opencv = 4.5

参考：

今回作成したプログラムのひな型1(座標取得)
Python、OpenCVでマウスのクリック位置を取得する
今回作成したプログラムのひな型2(描画)
【Python】OpenCVを使ったマウス操作での直接描画 – setMouseCallback()
OpenCVのマウスイベントの組み合わせについて参考にした。
OpenCVでマウスイベントを取得する　～GUIな集中線ツールを作る～
OpenCVのマウスイベントについてまとまっています。
画像情報処理研究室
【OpenCV】cv2.putText関数の使い方【文字を描画する】
‐ 画像のトリミングについて参考にした。
Python, NumPyで画像処理（読み込み、演算、保存）
座標の取り扱いなどわかりやすい。
【Python】OpenCVによる画像の画素へのアクセスと切り取り (初心者向け)

作成プログラム

プログラムの解説は、プログラム中にコメントで記述しています。


import time

import cv2

def pos_est_rect(file_name, wh=100, time_out=120):
    """trimming position estimation

    Args:
        file_name (str):file name 
        wh (int, optional): trimming size. Defaults to 1100.
        time_out (int, optional): time out [second]. Defaults to 120.
    """
    start = time.time()

    img = cv2.imread(file_name)
    
    def printCoor(event,x,y,flags,param):
        # OpenCVマウスイベントのcallbackは上記のような引数をとる。

        #　nonlocalの宣言をして、この関数外にある変数にアクセス。
        nonlocal img
        nonlocal img_mes
        nonlocal file_name

        if event == cv2.EVENT_LBUTTONDOWN:
            # 元の画像に直接書き込むと前の描画がそのまま残ってしまうため、コピーを作成。

            img_tmp = img_mes.copy()

            # 直線で書きたい場合
            # cv2.line(img_tmp,(x,y),(x+wh,y),(255,255,255),1)
            # cv2.line(img_tmp,(x,y),(x,y+wh),(255,255,255),1)
            # cv2.line(img_tmp,(x+wh,y),(x+wh,y+wh),(255,255,255),1)
            # cv2.line(img_tmp,(x,y+wh),(x+wh,y+wh),(255,255,255),1)
            
            cv2.rectangle(img_tmp,(x,y),(x+wh,y+wh),(255,255,255), thickness=1)

            # 座標は左上が原点　x座標:左から右　y座標：上から下　行列では,行(height):y、列(width):x
             # orgは文字オブジェクトの左下の座標
            cv2.putText(img_tmp, text=f'(x,y):({x},{y})',org=(x, y-10), fontFace=cv2.FONT_HERSHEY_SIMPLEX,
                        fontScale=0.5, color=(255,255,255),thickness=1,lineType=cv2.LINE_4)

            print(f'start x:{x}, y:{y} --wh:{wh}-- end x:{x+wh}, y:{y+wh}')

            cv2.imshow('image',img_tmp)
            
        elif event == cv2.EVENT_RBUTTONDOWN:
            # cv2.imshow('image',img)
            idx = file_name.rindex('.')
            trim_name = f'{file_name[:idx]}_trim.jpg'
            trim_array = trim(array=img, x=x, y=y, width=wh, height=wh)
            cv2.imwrite(trim_name, trim_array)


    h,w,_ =  img.shape  
    img_mes = img.copy() 
    print(img.shape)
    print('Quit -> ESC Key ')

    cv2.namedWindow('image',cv2.WINDOW_NORMAL)
    cv2.setMouseCallback('image',printCoor)
    cv2.moveWindow('image',100,100) #100,100はwindows上に表示される位置を指定。
    cv2.putText(img_mes, text=f'Quit -> ESC Key',org=(5,10), fontFace=cv2.FONT_HERSHEY_SIMPLEX,
                        fontScale=0.3, color=(255,255,255),thickness=1,lineType=cv2.LINE_4)

    # cv2.imshow('image',img)
    cv2.imshow('image',img_mes)
    # 第一引数の名前が同じだと同じウィンドウに上書き表示(名前が異なると別のウインドウが作成される)。

    while True:
        elasped_time = time.time() - start

        if cv2.waitKey(20) & 0xFF == 27:
            break
        
        if elasped_time > time_out:
            # Exit with a timeout 
            print('time out')
            break
        
    cv2.destroyAllWindows()


def trim(array, x, y, width, height):
    """
    Function specified by the upper left coordinates and the width / height of the area to be trimmed
    The return value of shap is the order of rows and columns.
    Note that the order is y and x in terms of xy coordinates. The origin is on the upper left.

    Args:
        array (2Dndarray): image data 
        x (int): start x
        y (int): start y
        width (int): width
        height (int): height

    Returns:
        [ndarray]: trim image
    
    Example:
    im_trim2 = trim(im, 128, 192, 256, 128)
    # (128, 256, 3)

    Ref:
    https://note.nkmk.me/python-numpy-image-processing/

    """
    array_trim = array.copy()
    array_trim = array_trim[y:y + height, x:x+width]

    print(f'Original h(Y), w(X) : {array.shape}')
    print(f'Trimmed h(Y), w(X) :  {array_trim.shape}')

    return array_trim
    
if __name__ == '__main__':
# test sampleとしてlenaさんを利用しています。
    pos_est_rect(file_name='lena.jpg', time_out=120)

実行結果

プログラム実行中画像

左クリックでトリミング位置表示
右クリックでトリミング実行
ESCで終了

トリミング結果

読み込ませる画像（画像の保存から、プログラムを実行するときに利用してください。）

まとめ

このプログラムの作成にあたり

opencvマウスイベント
矩形表示、文字表示
トリミング
画像読み込み、保存
の要素を組み合わせています。
このプログラムが少しでも皆さんの参考になればと思っています。

You get articles that match your needs
You can efficiently read back useful information
You can use dark theme

What you can do with signing up