Implement AutoRecord for new snapshots. #32

babbage · 2018-04-07T08:16:48Z

Provides an autoRecord parameter. If set, new tests that do not have a reference image will still execute, will return a failure, and will provide a fail image which is a snapshot of the view. This provides the opportunity for the user to review the image and decide whether to set it as a reference image. If recordMode == true, autoRecord is ignored and behavior is as previously.

CLAassistant · 2018-04-07T08:16:55Z

All committers have signed the CLA.

Complementary feature to the implementation of AutoRecord proposed to iOSSnapshotTestCase in this pull request: uber/ios-snapshot-test-case#32

babbage · 2018-04-07T08:51:33Z

One use of such a change is in partnership with a tool like FBSnapshotsViewer, which enables the review of failed snapshot tests, comparison with reference images, and replacement of old reference images with test snapshots. The pull request at Antondomashnev/FBSnapshotsViewer#72 implements the ability to recognise the images created with the autoRecord functionality in the current pull request, and to accept those images as the reference image.

Complementary feature to the implementation of AutoRecord proposed to iOSSnapshotTestCase in this pull request: uber/ios-snapshot-test-case#32

alanzeino · 2018-06-22T00:56:22Z

Could we not implement this via the _recordSnapshotOfViewOrLayer:... path and not the _performPixelComparisonWithViewOrLayer:... path?

Right now, by going through the _performPixelComparisonWithViewOrLayer:... path, you're relying on saveFailedReferenceImage: to save the image for you. But the reference image isn't a 'failed' one in this case, and this makes the current way you've written this difficult to grok and review.

In addition you also had to use this hack:

    BOOL imagesSame;

    if (referenceImage == nil && self.autoRecord) {
      imagesSame = NO;
    } else {
...

To force the library to save the image for you. But in this case, 'imagesSame` has nothing to do with the mode we're in ("auto record").

I think it would be a little easier and clearer if this started from FBSnapshotTestCase.m in snapshotVerifyViewOrLayer:identifier:suffixes:tolerance:defaultReferenceDirectory: like so:

  if (self.recordMode) {
    NSString *referenceImagesDirectory = [NSString stringWithFormat:@"%@%@", referenceImageDirectory, suffixes.firstObject];
    BOOL referenceImageSaved = [self _compareSnapshotOfViewOrLayer:viewOrLayer referenceImagesDirectory:referenceImagesDirectory identifier:(identifier) tolerance:tolerance error:&error];
    if (!referenceImageSaved) {
      [errors addObject:error];
    }
  } else if (self.autoRecord) {
      NSString *referenceImagesDirectory = [NSString stringWithFormat:@"%@%@", referenceImageDirectory, suffixes.firstObject];
      BOOL referenceImageSaved = [self _compareSnapshotOfViewOrLayer:viewOrLayer referenceImagesDirectory:referenceImagesDirectory identifier:(identifier) tolerance:tolerance error:&error];
      if (!referenceImageSaved) {
          [errors addObject:error];
      }
  } else {
...

Then in compareSnapshotOfViewOrLayer:selector:identifier:tolerance:error: of FBSnapshotTestController.m:

  if (self.recordMode) {
    return [self _recordSnapshotOfViewOrLayer:viewOrLayer selector:selector identifier:identifier error:errorPtr];
  } else if (self.autoRecord) {
    return [self _recordSnapshotOfViewOrLayer:viewOrLayer selector:selector identifier:identifier error:errorPtr];
  } else {
    return [self _performPixelComparisonWithViewOrLayer:viewOrLayer selector:selector identifier:identifier tolerance:tolerance error:errorPtr];
  }

...and finally back in snapshotVerifyViewOrLayer:identifier:suffixes:tolerance:defaultReferenceDirectory: at the end:

  if (self.recordMode) {
    return @"Test ran in record mode. Reference image is now saved. Disable record mode to perform an actual snapshot comparison!";
  } else if (self.autoRecord) {
      return @"Test ran in auto record mode. Reference image is now saved. Disable auto record mode to perform an actual snapshot comparison!";
  } else if (!testSuccess) {
    return [NSString stringWithFormat:@"Snapshot comparison failed: %@", errors.firstObject];
  }

  return nil;

...this is easier to understand, and doesn't hack the recording of snapshots into the verify code path.

alanzeino · 2018-06-22T00:56:54Z

FBSnapshotTestCase/FBSnapshotTestCase.m

  if (self.recordMode) {
    return @"Test ran in record mode. Reference image is now saved. Disable record mode to perform an actual snapshot comparison!";
  }
+  else if (!testSuccess && !self.autoRecord) {


please match the indentation of the library

babbage · 2018-06-23T07:14:48Z

I think you might be right that the implementation I've produced has some hacky elements, and there is a cleaner approach. The alternative approach you outline does actually probably produce something that is closer to what a user might expect from something called "AutoRecord", in that if I've read your alternative correctly, the way it would work is that it would make comparisons where a reference image exists, and automatically and silently store a new reference image for any test where there wasn't one.

Closer to what a default expectation might be for "AutoRecord", but the behaviour I've described above is not what I was trying to achieve. Instead, what I was interested in was to have a test fail but produce a reference image candidate. Then, using FBSnapshotsViewer as I linked at the top, there would be a review step where you look at the candidate image, and have to manually select whether to Accept or Reject the candidate image.

Perhaps both of these ideas are worth having; your approach being perhaps AutoRecord and another one that is AutoRecordButFail or something.

In either case, this could still potentially be done through recordSnapshotOfViewOrLayer, just that if it is done there, there would need to be knowledge in that method of how to save a failed comparison image. But let's agree on the intention first, then can figure out the least hacky way to do it.

Later: OK, I think I'm partly wrong above. Your proposed approach wouldn't silently store the new reference image. It would return an error, and show the test as failed. But, I think I am correct re the part that it would at least as currently outlined save that image to the reference images directory immediately, rather than saving it as a failed comparison image that could then be accepted or rejected.

alanzeino · 2018-06-23T08:11:04Z

Yep I totally want the tests to fail in auto record mode; I just wanted this implemented in a more obvious way like the other mode (‘record mode’) for code clarity reasons.

I personally don’t understand how this could be called ‘auto record’ if the images aren’t placed in the reference images directory. Without that, it’s not “auto record” to put them in failed. I guess it won’t work for that Mac tool you’re planning on updating, but “auto record” to me really sounds like “hey if I don’t have this reference image just generate it and move on” not “if I don’t have this reference image save it as a failed image in the failed snapshots directory”.

babbage · 2018-06-25T00:02:48Z

Essentially, the current proposal is trying to implement two things that are related but could be seen separately:
a) Have a process that allows existing tests to run unimpeded, while providing a mechanism to simultaneously progress the process of establishing the baseline for new snapshots.

b) Have a process that provides for a deliberate decision about whether something should be accepted as a new reference image, rather than saving that image to the reference images directory automatically.

I guess I have always wondered about the original implementation of this tool, where if you have a new snapshot or set of snapshots being added, your only option essentially is to blindly add them to the reference images directory, and then use some sort of external process (e.g., source control) to find the new snapshots, review them, and confirm that they are good. Or, to just assume they are good unless you find otherwise during other testing, so you are only looking for future regressions, rather than the possibility that there is already an issue at the point of initially running the process.

Since iOS-snapshot-test-case doesn't provide a tool for snapshot review, perhaps that is just business as usual. It is the case that I have only ever used it in conjunction with the Mac tool for the review process, so I don't have practical experience with working through the other method.

Fair enough that perhaps AutoRecord is the wrong term. Or that AutoRecord should be used as a term that implements a) and saves them to the reference images directory, but having a second option that instead creates a failed image instead? Because if you see an absence of a baseline as being like having a current baseline that is blank, then having a failed image and a diff that shows it is 100% different makes sense.

Would that be acceptable?

alanzeino · 2018-07-09T21:06:43Z

To answer the general question there; yes the intention of the library was to be used in conjunction with source control tools. At Facebook, they use Phabricator which allows engineers to see what has changed in a PR just as the commit would look if landed (though of course they don't use this library anymore or this workflow, for snapshots) — so when the library was created, that was the intended workflow.

Personally I'm of mind that if there is going to be a GUI tool for the library, it should be provided by the library. That way, any choices made as to how the workflow changes for that GUI tool that end up in source level changes are done once, for 'the official way'. This helps us to not have to accommodate every tool that might pop up wanting to do similar things.

As for building that GUI tool, I'm a little wary of doing that considering that Apple might sherlock this library really soon:

https://github.com/cysp/VisualTestKit.framework

Provides an autoRecord parameter. If set, new tests that do not have a reference image will still execute, will return a failure, and will store a reference image for approval. If recordMode == true, autoRecord is ignored and behavior is as previously.

babbage added a commit to babbage/FBSnapshotsViewer that referenced this pull request Apr 7, 2018

Recognise AutoRecord-created images.

707ede6

Complementary feature to the implementation of AutoRecord proposed to iOSSnapshotTestCase in this pull request: uber/ios-snapshot-test-case#32

babbage mentioned this pull request Apr 7, 2018

Recognise and enable acceptance of AutoRecorded images. Antondomashnev/FBSnapshotsViewer#72

Open

babbage force-pushed the AutoRecord branch from 1a9ced8 to c2663ae Compare April 8, 2018 09:27

babbage added a commit to babbage/FBSnapshotsViewer that referenced this pull request Apr 9, 2018

Recognise AutoRecord-created images.

6037f76

Complementary feature to the implementation of AutoRecord proposed to iOSSnapshotTestCase in this pull request: uber/ios-snapshot-test-case#32

babbage force-pushed the AutoRecord branch 4 times, most recently from ddd2fd1 to 6eb98b0 Compare May 27, 2018 04:05

alanzeino requested changes Jun 22, 2018

View reviewed changes

babbage force-pushed the AutoRecord branch from 74488be to 1070b13 Compare February 23, 2019 22:40

babbage force-pushed the AutoRecord branch from 1070b13 to dd27873 Compare February 24, 2019 05:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement AutoRecord for new snapshots. #32

Implement AutoRecord for new snapshots. #32

babbage commented Apr 7, 2018

CLAassistant commented Apr 7, 2018 •

edited

Loading

babbage commented Apr 7, 2018

alanzeino commented Jun 22, 2018

alanzeino Jun 22, 2018

babbage commented Jun 23, 2018 •

edited

Loading

alanzeino commented Jun 23, 2018

babbage commented Jun 25, 2018 •

edited

Loading

alanzeino commented Jul 9, 2018

Implement AutoRecord for new snapshots. #32

Are you sure you want to change the base?

Implement AutoRecord for new snapshots. #32

Conversation

babbage commented Apr 7, 2018

CLAassistant commented Apr 7, 2018 • edited Loading

babbage commented Apr 7, 2018

alanzeino commented Jun 22, 2018

alanzeino Jun 22, 2018

Choose a reason for hiding this comment

babbage commented Jun 23, 2018 • edited Loading

alanzeino commented Jun 23, 2018

babbage commented Jun 25, 2018 • edited Loading

alanzeino commented Jul 9, 2018

CLAassistant commented Apr 7, 2018 •

edited

Loading

babbage commented Jun 23, 2018 •

edited

Loading

babbage commented Jun 25, 2018 •

edited

Loading