ARW: Support new LJPEG compression on ILCE-7M4 and ILCE-7R5 #482

da-phil · 2023-06-04T20:51:02Z

This PR continues where #386 stopped, so all the work in the first commit has been done by @artemist, I just rebased her changes against recent develop and fixed the merge conflicts before starting to add more work on top.

I'm still not sure about the cropping, however could only verify that it is reasonable, even though a few valid border pixels get cropped away, compared to Sony A7R5 files, which are converted to DNG. This really needs a little bit more attention before we settle on something too early.

LebedevRI · 2023-06-04T21:37:43Z

Thank you for rebasing this!

This PR continues where #386 stopped, so all the work has been done by @artemist, I just rebased her changes against recent develop and fixed the merge conflicts.

https://docs.github.com/en/pull-requests/committing-changes-to-your-project/creating-and-editing-commits/creating-a-commit-with-multiple-authors might be relevant here.

<...>

I tried to look in this general direction, and the main stopping point for me is the said boolean interleaveRows.
What i have failed to understand from ITU-T81, is how we are supposed to derive the actual MCU size,
from the LJpeg itself?

* Support large-resolution ARW from ILCE-7RM5 * Generalize sonyArrange in LJPEG arrangement to support 1x2 Co-authored-by: Artemis Tosini <[email protected]>

codecov · 2023-06-04T22:59:26Z

Codecov Report

Patch coverage: 25.88% and project coverage change: -0.25 ⚠️

Comparison is base (820a907) 59.06% compared to head (5f3fe8f) 58.82%.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #482      +/-   ##
===========================================
- Coverage    59.06%   58.82%   -0.25%     
===========================================
  Files          232      232              
  Lines        13813    13881      +68     
  Branches      1932     1942      +10     
===========================================
+ Hits          8159     8165       +6     
- Misses        5521     5582      +61     
- Partials       133      134       +1

Flag	Coverage Δ
benchmarks	`8.26% <0.00%> (-0.05%)`	⬇️
integration	`47.18% <27.84%> (-0.19%)`	⬇️
linux	`56.64% <27.50%> (-0.23%)`	⬇️
macOS	`18.77% <0.00%> (-0.10%)`	⬇️
rpu_u	`47.18% <27.84%> (-0.19%)`	⬇️
unittests	`18.20% <0.00%> (-0.10%)`	⬇️
windows	`∅ <ø> (∅)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/librawspeed/decoders/ArwDecoder.h	`60.00% <ø> (ø)`
src/librawspeed/decoders/ArwDecoder.cpp	`39.76% <8.77%> (-6.79%)`	⬇️
...rc/librawspeed/decompressors/LJpegDecompressor.cpp	`51.38% <52.17%> (-3.23%)`	⬇️
src/librawspeed/decompressors/LJpegDecoder.cpp	`64.28% <100.00%> (+1.32%)`	⬆️

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

kmilos · 2023-06-05T06:06:05Z

What i have failed to understand from ITU-T81, is how we are supposed to derive the actual MCU size,
from the LJpeg itself?

You won't find this anywhere, as this is AFAICT not the subject of ITU-T81. The only way is to relate the TIFF tile size and SOF3 header yourself, as I have mentioned already: #334 (comment)

How you arrange the channels is up to you, you are in charge of the TIFF container abstraction, not ITU-T81 - SOF3 here (restricting to DNG & ARW) just provides simple, non-interleaved multi-component scans (see section 4.8.1 - it doesn't know or care where components A, B, and C came from). Hence the different schemes concocted by Adobe (Ncomp=2), Blackmagic (Ncomp=1) and Sony (Ncomp=4).

Furthermore, section 4.8.2 says this for the non-interleaved multi-component scan:

For example, in Figure 12 the MCU for the noninterleaved case is a single data unit.

meaning - a single pixel.

So, let's stop confusing the ITU-T81 interleaving w/ CFA interleaving please, which is on a higher (and independent) level of abstraction.

da-phil · 2023-06-05T07:51:42Z

Besides the ITU-T81 discussion I just noticed that the current decoding approach fails with M and S sized compressed raws from my Sony A7r5, see images. Decoding those (M & S) files gives me the following exception:
rawspeed::ArwDecoder::DecodeLJpeg(const rawspeed::TiffIFD*) const, line 292: Unexpected bits per pixel: 15

However, it works with the files which were uploaded to raw.pixls.us: https://raw.pixls.us/data/Sony/ILCE-7RM5

It is possible that the LJPEG coding depends on some other settings (other than size), which apparently I set on my camera?

kmilos · 2023-06-05T08:28:42Z

M & S ARW images are a different can of worms - they are not CFA raws, but demosaiced YCbCr...

kmilos · 2023-06-05T08:58:21Z

src/librawspeed/decoders/ArwDecoder.cpp

+  const TiffEntry* origin_entry = raw->getEntry(TiffTag::DEFAULTCROPORIGIN);
+  const TiffEntry* size_entry = raw->getEntry(TiffTag::DEFAULTCROPSIZE);


As I already mentioned, let's not use these but the dimensions in the new tag 0x7038. We then need a new mode that'll tell us to ignore any further cameras.xml crop.

You could even go a step further and use those directly as width and height on L281-L282 without needing to do anything extra here. The LJpeg decoder should take of the padding just fine.

Seems to work, I just pushed the changes!

You could even go a step further and use those directly as width and height on L281-L282 without needing to do anything extra here. The LJpeg decoder should take of the padding just fine.

This however does not work, as the width & height need to be multiples of the tile size to make the tiling code work, or how did you mean that?

They don't. The tiling code rounds up to the required multiple.

Well, at least it does in the DNG decoder, don't know if it'll just work out here.

We then need a new mode that'll tell us to ignore any further cameras.xml crop.

Can we make the mandatory cropping step according to the values in cameras.xml optional by just skipping it on a decoder basis?
But I guess it would be cleaner and more flexible to introduce a new flag in cameras.xml a'la
<Crop x="0" y="0" width="0" height="0" read_from_exif="true"/>
If cropping based on exif tags fails cropping shall fall back to values provided in cameras.xml .

Again, let's not call this "Exif crop". This is a pure kludge on Sony's part.

Thanks, will have a look later, when I've got time to finish this work package for final review ;)

Just checked all the rawspeed entrypoints and decodeRaw() always comes before decodeMetaData() (both calling their inherited decoder specific function), so this should work. Will give it a try now...

by the way, by resetting the history on one of my test images using the Exif.SubImage1.0x7038 crop I saw that this gives an area which still contains garbage from the tiling operation on the right and a black bar on the bottom

If I set the offset to 0,0 I only get valid pixels and no borders, so this way you can get the max. possible pixel area it seems. What's weird though is the size from Exif.SubImage1.0x7038 = 9568, 6376, but darktable crop module shows me 9567, 6376.
The DNG converted file shows 9504, 6336 in the crop module 😅

Then I did another test with way bigger crop to see where the garbage & borders begins and this proves that the values in Exif.SubImage1.0x7038 seem to be quite accurate...

Rather than resetting for validation, I'd remove the old images from the database (and their xmp sidecars), and import again.

The 0x7038 tag values, as mentioned, should result in the same size as the uncompressed/lossy case w/ cameras.xml crop. For 7RM5 this is, for uncompressed/lossy (9600-32)x(6376-0) which matches ljpeg compressed 0x7038 values 9568x6376.

Agreed, then the current state (of cropping) in this PR makes sense, right?

da-phil · 2023-06-05T09:55:49Z

M & S ARW images are a different can of worms - they are not CFA raws, but demosaiced YCbCr...

OMG, why would you do that? this is quite f*cked up 😅

Answering myself to

However, it works with the files which were uploaded to raw.pixls.us: https://raw.pixls.us/data/Sony/ILCE-7RM5

it seems I picked the S35 files, which are further cropped in, however I don't understand why the "normal" M&S files use a completely different encoding (demosaiced YCbCr)?

da-phil · 2023-06-05T09:58:52Z

I tried to look in this general direction, and the main stopping point for me is the said boolean interleaveRows.
What i have failed to understand from ITU-T81, is how we are supposed to derive the actual MCU size,
from the LJpeg itself?

@LebedevRI
When I rename the compressed ARW files to CR2 or DNG, rawspeed seems to still be able to decode the files and somehow deal with the "interleaving". I haven't understood all of the code to explain that to myself yet 😅
But this already shows the potential for refactoring & generalization of all this LJPEG stuff...

kmilos · 2023-06-05T10:06:59Z

it seems I picked the S35 files, which are further cropped in,

Yes, the S35/APS-C crop is the same size in pixels as the M output in FF, but the M are demosaiced and full field of view (i.e. scaled down), while S35/APS-C are still CFAs, only physically cropped FOV.

why would you do that?

Even smaller files, if you don't care about the resolution, but maybe still care about dynamic range and want to push color and tone processing in post.

Again, different can of worms (CbCr sub-sampling and ITU-T81 interleaving might play a factor here), so I'd ignore M&S for now and tackle them separately later. The S35/APS-C cropped ARWs should just work once this first hurdle is done.

kmilos · 2023-06-05T13:33:18Z

@LebedevRI Actually, after looking at the SOF3 and SOS headers again, there is component interleaving on the MCU level:

Sony SOF3 (14-bit, 256x256 frame, 4 components, 1x1 sampling each): FF C3 00 14 0E 01 00 01 00 04 01 11 00 02 11 00 03 11 00 04 11 00
Sony SOS (4 components interleaved, single entropy table, horiz. predictor): FF DA 00 0E 04 01 00 02 00 03 00 04 00 01 00 00

and, for Adobe DNG:

Adobe SOF3 (16-bit, 128x256 frame, 2 components, 1x1 sampling each): FF C3 00 0E 10 01 00 00 80 02 00 11 00 01 11 00
Adobe SOS (2 components interleaved, entropy table each, horiz. predictor): FF DA 00 0A 02 00 00 01 10 01 00 00

But the connection between the SOF3 4/2-component frame and the TIFF single channel CFA tile is still independent of ITU-T81 and on a different level of abstraction you (i.e. Adobe, Blackmagic, Sony) have to establish and maintain.

And similar for the Blackmagic case, where the number of components is 1, so there is even no MCU interleaving...

Blackmagic SOF3 (12-bit, 2064x1088, 1 component, 1x1 sampling): FF C3 00 0B 0C 04 40 08 10 01 00 11 00
Blackmagic SOS (1 component, predictor 6): FF DA 00 08 01 00 00 06 00 00

All of these are valid ITU-T81 - it doesn't know or care where the components it was fed came from in these 3 cases, but the bespoke TIFF decoder needs to know how to rearrange these SOF3 frames back into 512x512, 256x256, and 1032x2176 TIFF tiles respectively.

Recap: due to crop size roundup to multiples of tile size (512) Sony lossless files exhibit black areas if not further cropped down to actual pixel area, which is covered by the sensor. Exif.SubImage1.0x7038 seems to reliably provide the true crop size, whereas Exif.SubImage1.DefaultCropSize only crops to OOC jpeg size losing some pixel area

kmilos · 2023-06-07T10:47:19Z

src/librawspeed/decoders/ArwDecoder.cpp

@@ -145,6 +147,13 @@ RawImage ArwDecoder::decodeRawInternal() {
    return mRaw;
  }

+  if (7 == compression) {
+    DecodeLJpeg(raw);
+    // cropping of lossless compressed L files already done in Ljpeg decoder


Not only L, but also APS-C/S35...

Could also leave a note that M/S are not CFA, and support is TBD. We also probably want to detect this somehow, if not throwing an exception earlier already?

Question is, where in the whole pipeline does the encoding fail, it is not even executing the ARW decoder in case of M/S images.

src/librawspeed/decoders/ArwDecoder.cpp

LebedevRI · 2023-06-07T18:11:03Z

Let me just take this over...

LebedevRI · 2023-06-07T18:13:31Z

(Taking over in #483)

da-phil · 2023-06-07T18:33:45Z

Let me just take this over...

would be great if you have some time to continue this work properly, but please let us know if you need help with some tasks, maybe with the easier ones ;)

LebedevRI · 2023-06-08T22:13:21Z

c38746d

LebedevRI · 2023-06-08T22:23:25Z

Let me just take this over...

would be great if you have some time to continue this work properly

Nah.

but please let us know if you need help with some tasks, maybe with the easier ones ;)

Yeah i wish i knew. Some small self-contained things could include:

Get rid of all hints.has() calls, there should be a way to tell from the raw itself, most of the time
Some cameras store black/white levels in EXIF, while some decoders already handle that, many do not (e.g. Canon)
Black area sampling is just completely broken
Black level handling needs to be refactored to support arbitrary number (and layout) of bins
???

On a high level, i'm still stuck on the general idea to the likes of #325.

LebedevRI · 2023-06-09T00:04:53Z

@da-phil thank you!

da-phil requested a review from LebedevRI as a code owner June 4, 2023 20:51

da-phil mentioned this pull request Jun 4, 2023

ARW: Support new LJPEG compression on ILCE-7M4 #386

Closed

da-phil force-pushed the sony-ljpeg2 branch 4 times, most recently from 700d4d1 to ee36657 Compare June 4, 2023 22:17

ARW: Support new LJPEG compression on ILCE-7M4 and ILCE-7R5

b383e9e

* Support large-resolution ARW from ILCE-7RM5 * Generalize sonyArrange in LJPEG arrangement to support 1x2 Co-authored-by: Artemis Tosini <[email protected]>

da-phil force-pushed the sony-ljpeg2 branch from ee36657 to b383e9e Compare June 4, 2023 22:19

kmilos reviewed Jun 5, 2023

View reviewed changes

da-phil added 3 commits June 5, 2023 19:50

Boyscout: make sony exif tag naming consistent

1972397

Do not apply crop from cameras.xml and set crop origin to 0,0

5f3fe8f

kmilos suggested changes Jun 7, 2023

View reviewed changes

LebedevRI mentioned this pull request Jun 7, 2023

ARW: Support new LJPEG compression on ILCE-7M4 and ILCE-7R5 #483

Closed

LebedevRI closed this Jun 7, 2023

LebedevRI mentioned this pull request Jun 8, 2023

ARW: Support new LJPEG compression on ILCE-7M4 and ILCE-7R5, approach 2 #484

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARW: Support new LJPEG compression on ILCE-7M4 and ILCE-7R5 #482

ARW: Support new LJPEG compression on ILCE-7M4 and ILCE-7R5 #482

da-phil commented Jun 4, 2023 •

edited

Loading

LebedevRI commented Jun 4, 2023

codecov bot commented Jun 4, 2023 •

edited

Loading

kmilos commented Jun 5, 2023 •

edited

Loading

da-phil commented Jun 5, 2023 •

edited

Loading

kmilos commented Jun 5, 2023 •

edited

Loading

kmilos Jun 5, 2023 •

edited

Loading

da-phil Jun 5, 2023

kmilos Jun 5, 2023 •

edited

Loading

da-phil Jun 5, 2023

kmilos Jun 5, 2023

da-phil Jun 6, 2023

da-phil Jun 6, 2023

da-phil Jun 6, 2023

kmilos Jun 7, 2023

da-phil Jun 7, 2023

da-phil commented Jun 5, 2023

da-phil commented Jun 5, 2023 •

edited

Loading

kmilos commented Jun 5, 2023 •

edited

Loading

kmilos commented Jun 5, 2023 •

edited

Loading

kmilos Jun 7, 2023 •

edited

Loading

da-phil Jun 7, 2023

LebedevRI commented Jun 7, 2023

LebedevRI commented Jun 7, 2023

da-phil commented Jun 7, 2023

LebedevRI commented Jun 8, 2023

LebedevRI commented Jun 8, 2023

LebedevRI commented Jun 9, 2023

		const TiffEntry* origin_entry = raw->getEntry(TiffTag::DEFAULTCROPORIGIN);
		const TiffEntry* size_entry = raw->getEntry(TiffTag::DEFAULTCROPSIZE);

ARW: Support new LJPEG compression on ILCE-7M4 and ILCE-7R5 #482

ARW: Support new LJPEG compression on ILCE-7M4 and ILCE-7R5 #482

Conversation

da-phil commented Jun 4, 2023 • edited Loading

LebedevRI commented Jun 4, 2023

codecov bot commented Jun 4, 2023 • edited Loading

Codecov Report

kmilos commented Jun 5, 2023 • edited Loading

da-phil commented Jun 5, 2023 • edited Loading

kmilos commented Jun 5, 2023 • edited Loading

kmilos Jun 5, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kmilos Jun 5, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

da-phil commented Jun 5, 2023

da-phil commented Jun 5, 2023 • edited Loading

kmilos commented Jun 5, 2023 • edited Loading

kmilos commented Jun 5, 2023 • edited Loading

kmilos Jun 7, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LebedevRI commented Jun 7, 2023

LebedevRI commented Jun 7, 2023

da-phil commented Jun 7, 2023

LebedevRI commented Jun 8, 2023

LebedevRI commented Jun 8, 2023

LebedevRI commented Jun 9, 2023

da-phil commented Jun 4, 2023 •

edited

Loading

codecov bot commented Jun 4, 2023 •

edited

Loading

kmilos commented Jun 5, 2023 •

edited

Loading

da-phil commented Jun 5, 2023 •

edited

Loading

kmilos commented Jun 5, 2023 •

edited

Loading

kmilos Jun 5, 2023 •

edited

Loading

kmilos Jun 5, 2023 •

edited

Loading

da-phil commented Jun 5, 2023 •

edited

Loading

kmilos commented Jun 5, 2023 •

edited

Loading

kmilos commented Jun 5, 2023 •

edited

Loading

kmilos Jun 7, 2023 •

edited

Loading