Pixel decoder internal structure #130

MicheleCazzola · 2024-11-30T23:47:47Z

It is not clear to me what is the internal structure of the pixel decoder.

It is clear that it uses the Multi-Scale Deformable Attention and an encoder-only transformer, but what are the query points and features? Is some upsampling performed? How many transformer layers are used and what are their characteristics in terms of input and outputs?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pixel decoder internal structure #130

Pixel decoder internal structure #130

MicheleCazzola commented Nov 30, 2024

Pixel decoder internal structure #130

Pixel decoder internal structure #130

Comments

MicheleCazzola commented Nov 30, 2024