基于tensorflow的图像处理(二) tf.image的使用方法

阅读量：693 次

发布时间：2019-03-17

本文共 3254 字，大约阅读时间需要 10 分钟。

TensorFlow 图像处理函数参考手册

1. 图像编码与解码

TensorFlow 提供了对常见图像格式（如 JPEG、PNG）的解码和编码功能，便于将压缩后的图像数据还原为可处理的三维矩阵。以下是基于 JPEG 格式的典型操作示例：

import matplotlib.pyplot as pltimport tensorflow as tf# 读取原始/jpeg格式的图像数据image_raw_data = tf.gfile.FastGFile("/path/to/picture.jpg", 'r').read()# 使用TensorFlow解码 jpeg 格式的图像with tf.Session() as sess:    img_data = tf.image.decode_jpeg(image_raw_data)    # 输出解码后的图像矩阵    print(img_data.eval())    # 显示解码后的图像    plt.imshow(img_data.eval())    plt.show()

编码插入后的示例：

with tf.Session() as sess:    # 对解码后的图像进行编码    encoded_image = tf.image.encode_jpeg(img_data)    # 存储编码后的图像数据    with tf.gfile.GFile("/path/to/output.jpg", "wb") as f:        f.write(encoded_image.eval())

注意事项：

解码和编码操作需要在训练好的 TensorFlow 会话中执行，以确保计算图的正确性。

2. 图像大小调整

TensorFlow 提供了多种图像大小调整的方法，以适应不同神经网络的输入需求。以下是常用的调整方式：

使用 resize_images 函数

# 假设 img_data 是已解码的图像张量resized = tf.image.resize_images(img_data, [300, 300], method=0)# 使用 matplotlib 可视化调整后的图像plt.imshow(resized.eval())plt.show()

resize_images 参数说明：

method：取值为 0-3，对应不同的插值方法：
- 0：双线性插值
- 1：最近邻插值
- 2：双三次插值
- 3：面积插值

适用场景

** entrenairng：** 在训练数据集上调整图像大小以适应模型输入。

** inference：** 在推理阶段进一步缩放图像以获得统一的输入尺寸。

剪裁与填充

TensorFlow 提供了 crop_and_resize 和 crop_to_bounding_box 函数，用于根据需求对图像进行剪裁或填充操作：

# 裁剪并调整大小resized_with_crop = tf.image.crop_and_resize(img_data, boxes, box_ind, [180, 26])

注意事项：

揎割前需确保图像大小和标注框的坐标一致。

填充操作适用于目标图像大于原始图像的情况。

3. 图像翻转与调整

TensorFlow 提供了多种图像翻转和调整功能，用于提高模型对图像变换的鲁棒性。

翻转操作

# 上下翻转flipped = tf.image.flip_up_down(img_data)# 左右翻转flipped = tf.image.flip_left_right(flipped)# 随机翻转（训练阶段推荐）flipped = tf.image.random_flip_up_down(flipped)flipped = tf.image.random_flip_left_right(flipped)

色彩调整

# 调整亮度adjusted = tf.image.adjust_brightness(img_data, -0.5)# 调整对比度adjusted = tf.image.adjust_contrast(img_data, 0.5)# 调整色相adjusted = tf.image.adjust_hue(img_data, 0.1)# 调整饱和度adjusted = tf.image.adjust_saturation(img_data, 0.5)# 随机调整颜色（推荐训练阶段使用）adjusted = tf.image.random_brightness(adjusted, 32. / 255.)adjusted = tf.image.random_saturation(adjusted, 0.5, 1.5)adjusted = tf.image.random_hue(adjusted, 0.2)adjusted = tf.image.random_contrast(adjusted, 0.5, 1.5)

标准化

# 将图像数据进行标准化（归一化）normalized = tf.image.perception_halve(img_data)

4. 关键图像处理 API

TensorFlow 提供了以下关键图像处理 API，广泛应用于图像识别任务。

1. resize_bilinear

使用双线性插值调整图像大小。

resized = tf.image.resize_bilinear(img_data, [299, 299])

参数：

align_corners：布尔值，控制角点对齐（默认 False）。

输出类型：float32。

2. non_max_suppression

贪婪选择边界框子集。

# 选择最优的边界框elected_indices = tf.image.non_max_suppression(boxes, scores, max_output_size=200)selected_boxes = tf.gather(boxes, elected_indices)

3. crop_and_resize

从图像中剪裁并调整大小。

# 从图片中剪裁并调整大小crops = tf.image.crop_and_resize(img_data, boxes, box_ind, [128, 128])

4. flip_left_right

左右翻转图像。

flipped = tf.image.flip_left_right(img_data)

5. draw_bounding_boxes

在图像中绘制边界框。

# 绘制边界框result = tf.image.draw_bounding_boxes(images, boxes)

6. resize_nearest_neighbor

使用最近邻插值调整图像大小。

resized = tf.image.resize_nearest_neighbor(img_data, [224, 224])

7. resize_images

使用指定方法调整图像大小的通用函数。

# 使用双线性插值调整大小resized_images = tf.image.resize_images(img_data, [224, 224], method=tf_resize.ResizeMethod.BILINEAR)

5. Oficial Resources

TensorFlow官方文档：TensorFlow 图像处理 API

Example Scripts：[TensorFlow Examples](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples image_processing)

转载地址：http://tjhhz.baihongyu.com/

你可能感兴趣的文章

MySQL 用户权限管理：授权、撤销、密码更新和用户删除（图文解析）

查看>>

mysql 用户管理和权限设置

查看>>

MySQL 的 varchar 水真的太深了！

查看>>

mysql 的GROUP_CONCAT函数的使用（group_by 如何显示分组之前的数据）

查看>>

MySQL 的instr函数

查看>>

MySQL 的mysql_secure_installation安全脚本执行过程介绍

查看>>

MySQL 的Rename Table语句

MySQL 的存储引擎有哪些？为什么常用InnoDB？

查看>>

MySQL 精选 60 道面试题（含答案）