使用Python調整圖像大小

可以說，每一個“使用計算機的人”都需要在某個時間點調整圖像的大小。macOS的預覽版可以做到，windowsPowerToys也可以。

本文使用Python/ target=_blank class=infotextkey>Python來調整圖像大小，幸運的是，圖像處理和命令行工具是Python的兩個特長。

本文旨在向你展示三件事：

圖像的基本概念。
用于操作圖像的Python庫。
你可以在自己的項目中使用本文的代碼。

我們要構建的命令行程序可以一次調整一個或多個圖像文件的大小。

創建圖像

在這個例子中，我們將創建我們自己的圖像，而不是找到一個真正的圖像來操縱。

為什么？事實上，創造圖像是一個很好的方式來說明一個圖像實際上是什么。這個調整大小的程序在Instagram上也同樣適用。

那么，什么是圖像？在Python數據術語中，圖像是int元組的列表。

image = list[list[tuple[*int, float]]]

NumPy的定義是一個二維形狀數組 (h, w, 4)，其中h表示高的像素數（上下），w表示寬的像素數（從左到右）。

換句話說，圖像是像素列表（行）的列表（整個圖像）。每個像素由3個整數和1個可選浮點數組成：紅色通道、綠色通道、藍色通道、alpha（浮點可選）。紅色、綠色、藍色通道（RGB）的值從0到255。

從現在開始，我們將討論沒有alpha通道的彩色圖像，以保持簡單。Alpha是像素的透明度。圖像也只能有一個值從0到255的通道。這就是灰度圖像，也就是黑白圖像。在這里我們使用彩色圖像！

import matplotlib as plt
pixel: tuple = (200, 100, 150)
plt.imshow([[list(pixel)]])

用純Python制作圖像

Python完全能夠創建圖像。要顯示它，我將使用matplotlib庫，你可以使用它安裝：

pip install matplotlib

創建像素：

from dataclasses import dataclass
@dataclass
class Pixel:
  red: int
  green: int
  blue: int
  # alpha: float = 1
pixel = Pixel(255,0,0)
pixel# returns: 
# Pixel(red=255, green=0, blue=0, alpha=1)

創建圖像：

from __future__ import annotations
from dataclasses import dataclass, astuple
from itertools import cycle
from typing import List
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
@dataclass
class Pixel:
  red: int
  green: int
  blue: int
  # alpha: float = 1
pixel = Pixel(255,0,0)
pixel
marigold: Pixel = Pixel(234,162,33)
red: Pixel = Pixel(255,0,0)
Image = List[List[Pixel]]def create_image(*colors: Pixel, blocksize: int = 10, squaresize: int = 9) -> Image:
  """ 用可配置的像素塊制作一個正方形圖像(寬度和高度相同).
  Args:
      colors (Pixel): 可迭代的顏色呈現順序的參數。
      blocksize (int, optional): [description]. 默認10.
      squaresize (int, optional): [description]. 默認9.
  Returns:
      Image: 一幅漂亮的正方形圖片!
  """
  img: list = []
  colors = cycle(colors)  for row in range(squaresize):    row: list = []
    for col in range(squaresize):      color = next(colors) # 設置顏色      for _ in range(blocksize):        values: list[int] = list(astuple(color))
        row.Append(values)    [img.append(row) for _ in range(squaresize)] # 創建行高  return imgif __name__ == '__main__':
  image = create_image(marigold, red)  plt.imshow(image)

這就是渲染的圖像。在背后，數據是這樣的：

[[[234, 162, 33],
  [234, 162, 33],
  [234, 162, 33],
  [234, 162, 33],
  [234, 162, 33],
  [234, 162, 33],
  [234, 162, 33],
  [234, 162, 33],
  [234, 162, 33],
  [234, 162, 33],
  [255, 0, 0],
  [255, 0, 0],
  [255, 0, 0],
  [255, 0, 0],
  [255, 0, 0],
  [255, 0, 0],
  [255, 0, 0],
  [255, 0, 0],
  [255, 0, 0],
  [255, 0, 0],
  [234, 162, 33],
  ...

現在我們有了一個圖像，讓我們調整它的大小！

在Python中調整大小

在Python中編寫調整圖像大小的算法實際上有很多的工作量。

在圖像處理算法中有很多內容，有些人為此貢獻了十分多的工作。例如重采樣——在縮小后的圖像中使用一個像素來代表周圍的高分辨率像素。圖像處理是一個巨大的話題。如果你想親眼看看，看看Pillow的Image.py，它在路徑path/to/site-packages/PIL中。

這中間還有一些優化，比如抗鋸齒和減少間隙…這里的內容非常多。我們是站在巨人的肩膀上，可以用一行代碼來解決我們的問題。

如果你有興趣了解更多有關處理圖像時幕后發生的事情，我鼓勵你更多地查看“機器視覺”主題！這絕對是一個蓬勃發展的領域。

做得足夠好，就會有很多公司愿意為你的計算機視覺專業知識付出最高的代價。自動駕駛，IOT，監視，你命名它；所有基本上依賴于處理圖片（通常在Python或C++）。

一個很好的起點是查看scikit image。

OpenCV

OpenCV可以用來作圖像處理。他使用C++編寫并移植到了Python

import cv2
def resize(fp: str, scale: Union[float, int]) -> np.ndarray:
    """ 調整圖像大小，保持其比例
    Args:
        fp (str): 圖像文件的路徑參數
        scale (Union[float, int]): 百分比作為參數。如：53
    Returns:
        image (np.ndarray): 按比例縮小的圖片
    """    
    _scale = lambda dim, s: int(dim * s / 100)
    im: np.ndarray = cv2.imread(fp)    width, height, channels = im.shape    new_width: int = _scale(width, scale)
    new_height: int = _scale(height, scale)
    new_dim: tuple = (new_width, new_height)    return cv2.resize(src=im, dsize=new_dim, interpolation=cv2.INTER_LINEAR)

interpolation參數的選項是cv2包中提供的flags之一：

INTER_NEAREST – 近鄰插值
INTER_LINEAR – 雙線性插值(默認使用)
INTER_AREA – 利用像素區域關系重新采樣。它可能是圖像抽取的首選方法。但是當圖像被縮放時，它類似于INTER_NEAREST方法。
INTER_CUBIC – 一個大于4×4像素鄰域的雙三次插值
INTER_LANCZOS4 – 一個大于8×8像素鄰域的Lanczos插值

返回后：

resized = resize("checkers.jpg", 50)
print(resized.shape)plt.imshow(resized) # 也可以使用 cv2.imshow("name", image)

它做了我們所期望的。圖像從900像素高，900像素寬，到450×450（仍然有三個顏色通道）。因為Jupyter Lab的matplotlib著色，上面的屏幕截圖看起來不太好。

Pillow

pillow庫在Image類上有一個調整大小的方法。它的參數是：

size: (width, height)
resample: 默認為BICUBIC. 重采樣算法需要的參數。
box: 默認為None。為一個4元組，定義了在參數(0,0，寬度，高度)內工作的圖像矩形。
reducing_gap: 默認為None。重新采樣優化算法，使輸出看起來更好。

以下是函數：

from PIL import Image
def resize(fp: str, scale: Union[float, int]) -> np.ndarray:
    """ 調整圖像大小，保持其比例
    Args:
        fp (str): 圖像文件的路徑參數
        scale (Union[float, int]): 百分比作為參數。如：53
    Returns:
        image (np.ndarray): 按比例縮小的圖片
    """
    _scale = lambda dim, s: int(dim * s / 100)
    im = Image.open(fp)    width, height = im.size    new_width: int = _scale(width, scale)
    new_height: int = _scale(height, scale)
    new_dim: tuple = (new_width, new_height)    return im.resize(new_dim)

使用Pillow 的函數與OpenCV非常相似。唯一的區別是PIL.Image.Image類具有用于訪問圖像（寬度、高度）的屬性大小。

結果是：

resized = resize("checkers.jpg", 30.5)
print(resized.size)resized.show("resized image", resized)

請注意show方法如何打開操作系統的默認程序以查看圖像的文件類型。

創建命令行程序

現在我們有了一個調整圖像大小的函數，現在是時候讓它有一個運行調整大小的用戶界面了。

調整一個圖像的大小是可以的。但我們希望能夠批量處理圖像。

我們將要構建的接口將是最簡單的接口：命令行實用程序。

Pallets項目是Flask背后的天才社區，是一個Jinja模板引擎：Click(https://click.palletsprojects.com/en/7.x/。)

pip install click

Click是一個用于制作命令行程序的庫。這比使用普通的argparse或在if __name__ == '__main__':中啟動一些if-then邏輯要好得多。所以，我們將使用Click來裝飾我們的圖像調整器。

下面是從命令行調整圖像大小的完整腳本！

""" resize.py
"""
from __future__ import annotations
import osimport globfrom pathlib import Path
import sysimport clickfrom PIL import Image
"""
文檔:
    https://pillow.readthedocs.io/en/5.1.x/handbook/image-file-formats.html
"""
SUPPORTED_FILE_TYPES: list[str] = [".jpg", ".png"]
def name_file(fp: Path, suffix) -> str:
    return f"{fp.stem}{suffix}{fp.suffix}"
def resize(fp: str, scale: Union[float, int]) -> Image:
    """ 調整圖像大小，保持其比例
    Args:
        fp (str): 圖像文件的路徑參數
        scale (Union[float, int]): 百分比作為參數。如：53
    Returns:
        image (np.ndarray): 按比例縮小的圖片
    """
    _scale = lambda dim, s: int(dim * s / 100)
    im: PIL.Image.Image = Image.open(fp)    width, height = im.size    new_width: int = _scale(width, scale)    new_height: int = _scale(height, scale)    new_dim: tuple = (new_width, new_height)    return im.resize(new_dim)
@click.command()@click.option("-p", "--pattern")
@click.option("-s", "--scale", default=50, help="Percent as whole number to scale. eg. 40")
@click.option("-q", "--quiet", default=False, is_flag=True, help="Suppresses stdout.")
def main(pattern: str, scale: int, quiet: bool):
    for image in (images := Path().glob(pattern)):
        if image.suffix not in SUPPORTED_FILE_TYPES:
            continue
        im = resize(image, scale)        nw, nh = im.size        suffix: str = f"_{scale}_{nw}x{nh}"
        resize_name: str = name_file(image, suffix)        _dir: Path = image.absolute().parent        im.save(_dir / resize_name)        if not quiet:
            print(                f"resized image saved to {resize_name}.")
    if images == []:
        print(f"No images found at search pattern '{pattern}'.")
        returnif __name__ == '__main__':
    main()

命令行程序從入口點函數main運行。參數通過傳遞給click.option選項：

pattern采用字符串形式來定位與腳本運行的目錄相關的一個或多個圖像。--pattern="../catpics/*.png將向上一級查找catpics文件夾，并返回該文件夾中具有.png圖像擴展名的所有文件。
scale接受一個數字、浮點或整數，并將其傳遞給resize函數。這個腳本很簡單，沒有數據驗證。如果你添加到代碼中，檢查比例是一個介于5和99之間的數字（合理的縮小比例參數）。你可以通過-s "chicken nuggets"進行設置。
如果不希望在程序運行時將文本輸出到標準流，則quiet是一個選項參數。

從命令行運行程序：

python resize.py -s 35 -p "./*jpg"

結果：

$ py resize.py -p "checkers.jpg" -s 90
resized image saved to checkers_90_810x810.jpg.

正在檢查文件夾：

$ ls -lh checkers*
-rw-r--r-- 1 nicho 197609 362K Aug 15 13:13 checkers.jpg
-rw-r--r-- 1 nicho 197609 231K Aug 15 23:56 checkers_90_810x810.jpg

不錯！所以程序縮小了圖像，給了它一個描述性的標簽，我們可以看到文件大小從362KB到231KB！

為了查看程序同時處理多個文件，我們將再次運行它：

$ py resize.py --pattern="checkers*" --scale=20
resized image saved to checkers_20_180x180.jpg.
resized image saved to checkers_90_810x810_20_162x162.jpg.

文件系統輸出：

$ ll -h checkers*
-rw-r--r-- 1 nicho 197609 362K Aug 15 13:13 checkers.jpg
-rw-r--r-- 1 nicho 197609 1.8K Aug 16 00:23 checkers_20_180x180.jpg
-rw-r--r-- 1 nicho 197609 231K Aug 15 23:56 checkers_90_810x810.jpg
-rw-r--r-- 1 nicho 197609 1.8K Aug 16 00:23 checkers_90_810x810_20_162x162.jpg

只要匹配到了模式，遞歸可以處理任意數量的圖像。

Click

Click 是一個神奇的工具。它可以包裝一個函數并在一個模塊中以“正常的方式”從一個if __name__ == '__main__'語句運行。（實際上，它甚至不需要這樣做；你只需定義和裝飾要運行的函數即可），但它真正的亮點在于將腳本作為包安裝。

這是通過Python附帶的setuptools庫完成的。

這是我的setup.py.

from setuptools import setup
setup(    name='resize',
    version='0.0.1',
    py_modules=['resize'],
    install_requires=[        'click',
        'pillow',
    ],    entry_points='''
        [console_scripts]
        resize=resize:main
    '''
)

使用以下命令生成可執行文件/包裝包：