make image similarity check less sensitive by jbitton · Pull Request #258 · facebookresearch/AugLy

jbitton · 2025-02-25T01:21:14Z

Summary:
as part of my personal side quest to make augly's tests pass again, i am making a change to our tests.

currently, to assess image similarity, we use the np.allclose function. while that's better / less sensitive than an MD5 hash it's not much better because imperceptible changes can actually have large differences in values between numpy image arrays.

thus, to make augly's tests less affected by slight version updates by PIL or whatever else, we are switching to using imagehash.

we're specifically using the phash - you can read about it here: https://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html

phash isn't a perfect fit though, long term. it's not sensitive to color, scaling, or aspect ratio changes. to deal with the latter two, im keeping in the size equality check. for color, i want to do some more research on what is an efficient way to do this. nonetheless, this is still better than what we currently have right now.

Differential Revision: D70137163

facebook-github-bot · 2025-02-25T01:21:36Z

This pull request was exported from Phabricator. Differential Revision: D70137163

facebook-github-bot · 2025-02-25T02:00:57Z

This pull request was exported from Phabricator. Differential Revision: D70137163

Summary: Pull Request resolved: facebookresearch#258 as part of my personal side quest to make augly's tests pass again, i am making a change to our tests. currently, to assess image similarity, we use the `np.allclose` function. while that's better / less sensitive than an MD5 hash it's not much better because imperceptible changes can actually have large differences in values between numpy image arrays. thus, to make augly's tests less affected by slight version updates by PIL or whatever else, we are switching to using imagehash. we're specifically using the phash - you can read about it here: https://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html phash isn't a perfect fit though, long term. it's not sensitive to color, scaling, or aspect ratio changes. to deal with the latter two, im keeping in the size equality check. for color, i want to do some more research on what is an efficient way to do this. nonetheless, this is still better than what we currently have right now. Differential Revision: D70137163

Summary: as part of my personal side quest to make augly's tests pass again, i am making a change to our tests. currently, to assess image similarity, we use the `np.allclose` function. while that's better / less sensitive than an MD5 hash it's not much better because imperceptible changes can actually have large differences in values between numpy image arrays. thus, to make augly's tests less affected by slight version updates by PIL or whatever else, we are switching to using imagehash. we're specifically using the phash - you can read about it here: https://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html phash isn't a perfect fit though, long term. it's not sensitive to color, scaling, or aspect ratio changes. to deal with the latter two, im keeping in the size equality check. for color, i want to do some more research on what is an efficient way to do this. nonetheless, this is still better than what we currently have right now. Differential Revision: D70137163

facebook-github-bot · 2025-02-26T17:42:54Z

This pull request was exported from Phabricator. Differential Revision: D70137163

facebookresearch#258) Summary: as part of my personal side quest to make augly's tests pass again, i am making a change to our tests. ## overlay wrap text fix seems like we were modifying the original image in place for overlay wrap text which is not the augly way (we always copy + modify + return new image), so this change fixes that and also fixes 99% of the image tests. ## imagehash change currently, to assess image similarity, we use the `np.allclose` function. while that's better / less sensitive than an MD5 hash it's not much better because imperceptible changes can actually have large differences in values between numpy image arrays. thus, to make augly's tests less affected by slight version updates by PIL or whatever else, we are switching to using imagehash. we're specifically using the phash - you can read about it here: https://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html phash isn't a perfect fit though, long term. it's not sensitive to color, scaling, or aspect ratio changes. to deal with the latter two, im keeping in the size equality check. for color, i want to do some more research on what is an efficient way to do this. nonetheless, this is still better than what we currently have right now. Reviewed By: joelicohk, mayaliliya Differential Revision: D70137163

facebook-github-bot · 2025-02-27T22:49:25Z

This pull request was exported from Phabricator. Differential Revision: D70137163

facebook-github-bot · 2025-02-28T17:18:33Z

This pull request has been merged in 0bfda4c.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 25, 2025

facebook-github-bot added the fb-exported label Feb 25, 2025

jbitton force-pushed the export-D70137163 branch from 66eb525 to 7778f95 Compare February 25, 2025 02:01

jbitton force-pushed the export-D70137163 branch from 7778f95 to e746ff9 Compare February 26, 2025 17:42

jbitton force-pushed the export-D70137163 branch from e746ff9 to f2b5f6c Compare February 27, 2025 22:49

facebook-github-bot closed this in 0bfda4c Feb 28, 2025

facebook-github-bot added the Merged label Feb 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make image similarity check less sensitive#258

make image similarity check less sensitive#258
jbitton wants to merge 1 commit intofacebookresearch:mainfrom
jbitton:export-D70137163

jbitton commented Feb 25, 2025

Uh oh!

facebook-github-bot commented Feb 25, 2025

Uh oh!

facebook-github-bot commented Feb 25, 2025

Uh oh!

facebook-github-bot commented Feb 26, 2025

Uh oh!

facebook-github-bot commented Feb 27, 2025

Uh oh!

facebook-github-bot commented Feb 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jbitton commented Feb 25, 2025

Uh oh!

facebook-github-bot commented Feb 25, 2025

Uh oh!

facebook-github-bot commented Feb 25, 2025

Uh oh!

facebook-github-bot commented Feb 26, 2025

Uh oh!

facebook-github-bot commented Feb 27, 2025

Uh oh!

facebook-github-bot commented Feb 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants