sleap.nn.data.normalization

sleap.nn.data.normalization#

Transformers for normalizing data formats.

class sleap.nn.data.normalization.Normalizer(image_key: str = 'image', ensure_float: bool = True, ensure_rgb: bool = False, ensure_grayscale: bool = False, imagenet_mode: Optional[str] = None)[source]#

Data transformer to normalize images.

This is useful as a transformation to data streams that require specific data ranges such as for pretrained models with specific preprocessing constraints.

image_key#

String name of the key containing the images to normalize.

Type:: str

ensure_float#

If True, converts the image to a tf.float32 if not already.

Type:: bool

ensure_rgb#

If True, converts the image to RGB if not already.

Type:: bool

ensure_grayscale#

If True, converts the image to grayscale if not already.

Type:: bool

imagenet_mode#

Specifies an ImageNet-based normalization mode commonly used in tf.keras.applications-based pretrained models. No effect if not set. Valid values are: “tf”: Values will be scaled to [-1, 1], expanded to RGB if grayscale. “caffe”: Values will be scaled to [0, 255], expanded to RGB if grayscale,

RGB channels flipped to BGR, and subtracted by a fixed mean.

“torch”: Values will be scaled to [0, 1], expanded to RGB if grayscale,: subtracted by a fixed mean, and scaled by fixed standard deviation.

Type:: Optional[str]

classmethod from_config(config: PreprocessingConfig, image_key: str = 'image') → Normalizer[source]#

Build an instance of this class from its configuration options.

Parameters:

config – An PreprocessingConfig instance with the desired parameters.
image_key – String name of the key containing the images to normalize.

Returns:

An instance of this class.

property input_keys: List[str]#: Return the keys that incoming elements are expected to have.

property output_keys: List[str]#: Return the keys that outgoing elements will have.

transform_dataset(ds_input: DatasetV2) → DatasetV2[source]#

Create a dataset that contains centroids computed from the inputs.

Parameters:: ds_input – A dataset with image key specified in the image_key attribute.
Returns:: A tf.data.Dataset with elements containing the same images with normalization applied.

sleap.nn.data.normalization.convert_rgb_to_bgr(image: Tensor) → Tensor[source]#

Convert an RGB image to BGR format by reversing the channel order.

Parameters:: image – Tensor of any dtype with shape (…, 3) in RGB format. If grayscale, the image will be converted to RGB first.
Returns:: The input image with the channels axis reversed.

sleap.nn.data.normalization.ensure_float(image: Tensor) → Tensor[source]#

Convert the image to a tf.float32.

Parameters:

image – Tensor of any dtype.

Returns:

A tensor of the same shape as image but with dtype tf.float32. If the image was already of tf.float32 dtype, it will not be changed.

If the input was of an integer type, it will be scaled to the range [0, 1] according to the dtype’s maximum value.

sleap.nn.data.normalization

Contents

sleap.nn.data.normalization#