Module preprocessor.image
Images preprocessing is for all sorts of "picture" data, including medical imaging.
Images come in a wide variety of file formats (common formats such as PNG, JPEG, GIF, as well as DICOM medical imaging). Even within the same file format there can be varying characteristics of image files like dimensions and color depth that need to be normalized before using as training data or in order to work with an existing algorithm.
The ImagePreprocessor allows input data to be standardized for a training. This can include changes to color space (e.g. forcing all images to grayscale), resizing images, and ensuring the order of color bytes is uniform.
More complex image transformation is also possible via PyTorch image
manipulation commands.
See the ImagePreprocessorBuilder.torch_image_transforms()
and ImagePreprocessorBuilder.torch_tensor_transforms()
for more on this.
Additionally, labeling of image data for training can be tricky. To simplify this, the preprocessor can associate the image filename with a label column in a tabular dataset that was included with the image file when the dataset was packaged. For example, the labeling file might look like:
records.csv
"label", "paths"
"cat", "img_003.png"
"doc", "img_004.png"
Typical usage:
# Force images to a 128x128 RGB format
preprocessor = (
tb.ImagePreprocessor.builder()
.resize(128, 128)
.channels_first()
.target_column("label") # associate image with corresponding "label"
.target_dtype("int64")
.dtype("float32")
)
NOTE: The ImagePreprocessors can generate representations of data in multiple formats, such as numpy.ndarrays or a Torch.Dataset
Classes
class ImagePreprocessor (target_column: str, convert: str, resize: Tuple[int, int], channels_first: bool, dtype: str | None, dicom: bool, target_dtype: str | None, torch_image_transforms: List[str] | None, torch_tensor_transforms: List[str] | None, expand_target_dims: bool)
-
Subclasses
- preprocessor.image.ImageNumpyPreprocessor
- preprocessor.image.ImageTorchPreprocessor
Static methods
def builder() -> ImagePreprocessorBuilder
Instance variables
var target_column : str
-
Name of column in the image package containing a training target label
Returns
str
- A column name from the Package's record file.
Methods
def preprocess_image(self, img:
) -> numpy.ndarray -
Converts the image into a numpy.ndarray representation
Args
img
:Image
- The internal loaded representation of the image
Returns
np.ndarray
- The image as an ndarray
def preprocess_target(self, target) -> numpy.ndarray
class ImagePreprocessorBuilder
-
Utility for defining an image preprocessing pipeline
Ancestors
Methods
def channels_first(self, val: bool = True) -> ImagePreprocessorBuilder
-
Transposes images so channels are the first dimension of the output.
True
= (channel, width, height)False
= (width, height, channel)Args
val
- Explicitly sets the channels_first property, if blank default is
True
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
def convert(self, val: str) -> ImagePreprocessorBuilder
-
Sets the format used by Pillow – RGB, RGBA, etc.
When translating a color image to greyscale (mode "L"), the library uses the ITU-R 601-2 luma transform:
L = R * 299/1000 + G * 587/1000 + B * 114/1000
Args
val
- The target image format. Must be one of: "RGB", "L" (for luma, aka grayscale) or "CMYK".
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
def dicom(self, dicom: bool = False) -> ImagePreprocessorBuilder
-
Sets data format as DICOM rather than a typical image format.
Args
dicom
- Read the data as a medical image using the DICOM library.
If
False
, data will be read as an autodetected image format using the PIL image library (default behavior).
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
def dtype(self, dtype: str | None) -> ImagePreprocessorBuilder
-
Casts output numpy.ndarray to the given dtype.
Args
dtype
- The dtype that a numpy output will be cast into. If not set, the Protocol will choose. Ignored for non-numpy outputs.
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
def expand_target_dims(self, expand=True)
-
Sets target array expansion flag.
Args
expand
:bool
, optional- Target array expansion flag. Defaults to True.
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
def resize(self, width: int, height: int) -> ImagePreprocessorBuilder
-
Resizes images to the given dimensions.
Args
width
- The desired width of the image. (Default 32)
height
- The desired height of the image. (Default 32)
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
def target_column(self, column_name: str) -> ImagePreprocessorBuilder
-
Sets which column from the asset's record data to use as a target.
Args
column_name
- The name of the column to take as target information.
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
def target_dtype(self, target_dtype: str | None) -> ImagePreprocessorBuilder
-
Sets target data type for the output numpy.ndarray
Args
dtype
- The dtype that a target value will be cast into. If not set, the operation will select type. Ignored for non-numpy outputs.
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
def torch_image_transforms(self, transforms: List[str]) -> ImagePreprocessorBuilder
-
Create torch preprocessing pipeline utilizing torchvision library
Supported image transforms:
- CenterCrop
- ColorJitter
- FiveCrop
- Grayscale
- Pad
- RandomAffine
- RandomCrop
- RandomGrayscale
- RandomHorizontalFlip
- RandomPerspective
- RandomResizedCrop
- RandomRotation
- RandomSizedCrop
- RandomVerticalFlip
- Resize
- Scale
- TenCrop
Usage pattern:
flip = tb.TorchEncoder.encode( transforms.RandomHorizontalFlip(p=0.5) ) tb.ImagePreprocessor.builder().torch_image_transforms([flip])
See PyTorch documentation for more details.
Args
transforms
- list of transforms from torchvision.transforms library encoded using TorchEncoder
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
def torch_tensor_transforms(self, transforms: List[str]) -> ImagePreprocessorBuilder
-
Creates torch preprocessing pipeline utilizing torchvision library
Supported tensor transforms:
- LinearTransformation
- Normalize
- RandomErasing
Usage Pattern:
normalize = tb.TorchEncoder.encode( transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]) ) tb.ImagePreprocessor.builder().torch_tensor_transforms([normalize])
See PyTorch documentation for more details.
Args
transforms
:List[str]
- A list of torchvision.transforms encoded using TorchEncoder
Returns
ImagePreprocessorBuilder
- This class instance, useful for chaining.
Inherited members