ddpm 코드 분석

2024-04-06 최대 1 분 소요

motivation: ddpm 코드 분석

원본 코드: https://github.com/lucidrains/denoising-diffusion-pytorch/blob/main/denoising_diffusion_pytorch/denoising_diffusion_pytorch.py

1. Block:

class Block(nn.Module):
    def __init__(self, dim, dim_out, groups = 8):
        super().__init__()
        self.proj = nn.Conv2d(dim, dim_out, 3, padding = 1)
        self.norm = nn.GroupNorm(groups, dim_out)
        self.act = nn.SiLU()

    def forward(self, x, scale_shift = None):
        x = self.proj(x)
        x = self.norm(x)

        if exists(scale_shift):
            scale, shift = scale_shift
            x = x * (scale + 1) + shift

        x = self.act(x)
        return x

3*3conv+ Groupnorm+silu

2. ResnetBlock

class ResnetBlock(nn.Module):
    def __init__(self, dim, dim_out, *, time_emb_dim = None, groups = 8):
        super().__init__()
        self.mlp = nn.Sequential(
            nn.SiLU(),
            nn.Linear(time_emb_dim, dim_out * 2)
        ) if exists(time_emb_dim) else None

        self.block1 = Block(dim, dim_out, groups = groups)
        self.block2 = Block(dim_out, dim_out, groups = groups)
        self.res_conv = nn.Conv2d(dim, dim_out, 1) if dim != dim_out else nn.Identity()

    def forward(self, x, time_emb = None):

        scale_shift = None
        if exists(self.mlp) and exists(time_emb):
            time_emb = self.mlp(time_emb)
            time_emb = rearrange(time_emb, 'b c -> b c 1 1')
            scale_shift = time_emb.chunk(2, dim = 1)

        h = self.block1(x, scale_shift = scale_shift)

        h = self.block2(h)

        return h + self.res_conv(x)

Twitter Facebook LinkedIn

LG_CBM paper summary

2025-06-04 9 분 소요

motivation: VLG_CBM paper summary 1. Introduction CBM(Concept Bottleneck Model)은 중간에 Concept Bottleneck Layer(CBL)을 삽입해서, 사람이 이해할 수 있는 개념 단위로 예측 근거를 제공한다. ...

sumarize paper called ‘A Bayesian Approach To Analysing Training Data Attribution In Deep Learning’

2025-05-21 3 분 소요

motivation: sumarize paper ‘A Bayesian Approach To Analysing Training Data Attribution In Deep Learning’ Blog Post: Rethinking TDA – A Bayesian Approach to ...

ewc 코드 분석

2024-08-05 5 분 소요

motivation: ewc 코드를 분석해보자. elastic weight consolidation. fisher information이란 무엇인가. 이거를 알아야 할 필요가 있어 보여서, 분석을 해보겠다. 두개만 이해하면 된다. 1. EWC class, 2. ewc...

information theory

2024-07-16 1 분 소요

motivation: 정보량에 대해서 알아보자. entropy의 정의 entropy의 정의는, 정보를 표현할 수 있는 평균 최소 자원량을 의미한다. 정의는 sigma_i(p_i)(log(1/p_i))로 쓰는데, 최소 자원량이라는 말에 log(1/pi)가 들어가있고, 평균...

ddpm 코드 분석

1. Block:

3*3conv+ Groupnorm+silu

2. ResnetBlock

공유하기

댓글남기기

참고

LG_CBM paper summary

sumarize paper called ‘A Bayesian Approach To Analysing Training Data Attribution In Deep Learning’

ewc 코드 분석

information theory