Why not directly cut the picture into quarters, but need the following operation? #8

Pandaxia8 · 2021-12-02T14:22:48Z

def downsample_concatenate(X, kernel):
    b, h, w, c = X.shape
    Y = X.contiguous().view(b, h, w // kernel, c * kernel)
    Y = Y.permute(0, 2, 1, 3).contiguous()
    Y = Y.view(b, w // kernel, h // kernel, kernel * kernel * c).contiguous()
    Y = Y.permute(0, 2, 1, 3).contiguous()
    return Y

I don't understand why not directly cut the pictures into quarters, could you tell me what will happen if I just replace the above codes with the following? Thank you! :)

def downsample_concatenate(X, kernel):
    b, h, w, c = X.shape
    Y = X.contiguous().view(b, h//kernel, w // kernel, c * kernel*kernel)
    return Y

The text was updated successfully, but these errors were encountered:

jbcdnr · 2021-12-02T14:41:35Z

Hello,

You can check the explanation here #5 maybe?

Here is the difference:

import torch

def downsample_concatenate(X, kernel):
    b, h, w, c = X.shape
    Y = X.contiguous().view(b, h, w // kernel, c * kernel)
    Y = Y.permute(0, 2, 1, 3).contiguous()
    Y = Y.view(b, w // kernel, h // kernel, kernel * kernel * c).contiguous()
    Y = Y.permute(0, 2, 1, 3).contiguous()
    return Y

def downsample_concatenate2(X, kernel):
    b, h, w, c = X.shape
    Y = X.contiguous().view(b, h//kernel, w // kernel, c * kernel*kernel)
    return Y

x = torch.arange(16).view(1, 4, 4, 1)
downsample_concatenate(x, 2)
# tensor([[[[ 0,  1,  4,  5],
#           [ 2,  3,  6,  7]],
# 
#          [[ 8,  9, 12, 13],
#           [10, 11, 14, 15]]]])

downsample_concatenate2(x, 2)
# tensor([[[[ 0,  1,  2,  3],
#           [ 4,  5,  6,  7]],
# 
#          [[ 8,  9, 10, 11],
#           [12, 13, 14, 15]]]])

jbcdnr · 2021-12-02T14:44:44Z

Another beautiful solution with einops:

einops.rearrange(x, "batch (h p_h) (w p_w) c -> batch h w (p_h p_w c)", p_h=kernel, p_w=kernel)

Pandaxia8 · 2021-12-02T15:08:23Z

Hello,

You can check the explanation here #5 maybe?

Here is the difference:

import torch

def downsample_concatenate(X, kernel):
    b, h, w, c = X.shape
    Y = X.contiguous().view(b, h, w // kernel, c * kernel)
    Y = Y.permute(0, 2, 1, 3).contiguous()
    Y = Y.view(b, w // kernel, h // kernel, kernel * kernel * c).contiguous()
    Y = Y.permute(0, 2, 1, 3).contiguous()
    return Y

def downsample_concatenate2(X, kernel):
    b, h, w, c = X.shape
    Y = X.contiguous().view(b, h//kernel, w // kernel, c * kernel*kernel)
    return Y

x = torch.arange(16).view(1, 4, 4, 1)
downsample_concatenate(x, 2)
# tensor([[[[ 0,  1,  4,  5],
#           [ 2,  3,  6,  7]],
# 
#          [[ 8,  9, 12, 13],
#           [10, 11, 14, 15]]]])

downsample_concatenate2(x, 2)
# tensor([[[[ 0,  1,  2,  3],
#           [ 4,  5,  6,  7]],
# 
#          [[ 8,  9, 10, 11],
#           [12, 13, 14, 15]]]])

I see. Thank you very much for your answer! : )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why not directly cut the picture into quarters, but need the following operation? #8

Why not directly cut the picture into quarters, but need the following operation? #8

Pandaxia8 commented Dec 2, 2021 •

edited

Loading

jbcdnr commented Dec 2, 2021

jbcdnr commented Dec 2, 2021

Pandaxia8 commented Dec 2, 2021

Why not directly cut the picture into quarters, but need the following operation? #8

Why not directly cut the picture into quarters, but need the following operation? #8

Comments

Pandaxia8 commented Dec 2, 2021 • edited Loading

jbcdnr commented Dec 2, 2021

jbcdnr commented Dec 2, 2021

Pandaxia8 commented Dec 2, 2021

Pandaxia8 commented Dec 2, 2021 •

edited

Loading