wandb 사용하는 방법

2024-07-03 1 분 소요

motivation:

https://docs.wandb.ai/quickstart

linux에서 pip install wandb 실행
이후 wandb login 은 python 코드 안에서 진행하게 된다.

  

import wandb
wandb.login(key="42a1f3a.....")
# start a new wandb run to track this script
wandb.init(
    # set the wandb project where this run will be logged
    project="my-awesome-project",
    config={
    "learning_rate": CONFIG.LR, # 3e-4,
    "number_of_epochs" : CONFIG.N_EPOCHS, # 5,
    "batch_size": CONFIG.BATCH_SIZE, # 96,
    "sample_rate": CONFIG.SR, # 32000,
    "N_MFCC": CONFIG.N_MFCC, # 13,
    "SEED": CONFIG.SEED, # 42,
    "architecture": "MLP",
    "hidden_dim":CONFIG.hidden_dim,
    # "dataset": "audio_test50000_train55438",
    # "N_CLASSES":  2,
    }
)

이런 식으로 정의하고,

나중에 for i in epochs: loop에서,
wandb.log({“train_loss”: _train_loss, “_val_loss”: _val_loss}) 이런 식으로 logging하면 된다. 그러면 자동으로 기록이됨.

def seed_everything(seed):
    random.seed(seed)
    os.environ['PYTHONHASHSEED'] = str(seed)
    np.random.seed(seed)
    torch.manual_seed(seed)
    torch.cuda.manual_seed(seed)
    torch.backends.cudnn.deterministic = True
    torch.backends.cudnn.benchmark = True
  
seed_everything(CONFIG.SEED) # Seed 고정

2. watch

wandb.watch(model, log='all')

def train(model, optimizer, train_loader, val_loader, device):

    model.to(device)
    criterion = nn.BCELoss().to(device)
    
    wandb.watch(model, log='all')
    
    best_val_score = 0
    best_model = None
    for epoch in range(1, CONFIG.N_EPOCHS+1):
	    # .......

이런 식으로 model이 정의되면 wandb.watch를 하면 gradient를 추적해준다.

3. sweep

https://docs.wandb.ai/guides/sweeps/define-sweep-configuration

근데 hyperparamter도 자동으로 찾게 하게 하려면,

wandb sweep --project <propject-name> <path-to-config file> wandb agent <sweep-ID>

를 linux에서 쳐주면 된다.

나의 경우,

wandb sweep --project my-awesome-project sweep_config.yaml 이거를 실행하면 sweep-ID가 리눅스 창에서 나온다. 나의 경우 g7kdxvhl 이거 였다.

그러면

wandb agent 'wys000112-Kyung Hee University/my-awesome-project/415m1qg2' 를 하면 된다.

infer_model = train(model, optimizer, train_loader, val_loader, device)

import config # 이거는 wandb sweep --project my-awesome-project sweep_config.yaml 이거를 해야 보이는 것.
sweep_id = wandb.sweep(config.sweep_config)
wandb.agent(sweep_id, train, count=2)

Twitter Facebook LinkedIn

LG_CBM paper summary

2025-06-04 9 분 소요

motivation: VLG_CBM paper summary 1. Introduction CBM(Concept Bottleneck Model)은 중간에 Concept Bottleneck Layer(CBL)을 삽입해서, 사람이 이해할 수 있는 개념 단위로 예측 근거를 제공한다. ...

sumarize paper called ‘A Bayesian Approach To Analysing Training Data Attribution In Deep Learning’

2025-05-21 3 분 소요

motivation: sumarize paper ‘A Bayesian Approach To Analysing Training Data Attribution In Deep Learning’ Blog Post: Rethinking TDA – A Bayesian Approach to ...

ewc 코드 분석

2024-08-05 5 분 소요

motivation: ewc 코드를 분석해보자. elastic weight consolidation. fisher information이란 무엇인가. 이거를 알아야 할 필요가 있어 보여서, 분석을 해보겠다. 두개만 이해하면 된다. 1. EWC class, 2. ewc...

information theory

2024-07-16 1 분 소요

motivation: 정보량에 대해서 알아보자. entropy의 정의 entropy의 정의는, 정보를 표현할 수 있는 평균 최소 자원량을 의미한다. 정의는 sigma_i(p_i)(log(1/p_i))로 쓰는데, 최소 자원량이라는 말에 log(1/pi)가 들어가있고, 평균...