Continuously Indexed Domain Adaptation (CIDA)

An unofficial refactor of the code used by the ICML 2020 paper Continuously Indexed Domain Adaptation. [Official Code].

Install

$ pip install git+https://github.com/sshh12/cida.git --upgrade

Usage

The code should work for any dataset that serves data in the correct format (see RotatedMNIST), although I created this repo for my specific use case so some trivial/generic features may be missing. Fill free to create an issue/PR.

Important Note: Domain adaptation models are a bit different from other models in terms of the train/test datasets. Rather than being distinct, the CIDA model is trained on both the train and test sets (e.g. the .fit() DataLoaders should feed both) however the model ignores the labels for any examples where is_train is set to False. This allows the model to learn the continuous relationship between the train and test domains while also optimizing the classification/regression task given in the is_train data. At no point (except of course in evaluation) does the model see the labels for the test data.

Classification (Rotated MNIST)

from torch.utils.data import DataLoader
from cida.datasets import RotatedMNIST
from cida.conv_model import ConvPCIDAClassifier
import os

if __name__ == "__main__":
    # RotatedMNIST.download()
    dataset = RotatedMNIST(
        os.path.join("data", "MNIST", "processed", "training.pt"), rotate_range=(0, 360), train_range=(0, 45)
    )
    dataloader = DataLoader(
        dataset=dataset,
        shuffle=True,
        batch_size=100,
        num_workers=1,
    )
    val_dataloader = DataLoader(
        dataset=dataset,
        shuffle=True,
        batch_size=100,
        num_workers=1,
    )
    model = ConvPCIDAClassifier(
        classes=10, input_dims=28 * 28, domain_dims=1, domains_to_labels=RotatedMNIST.domains_to_labels, verbose=True
    )
    model = model.to("cpu")
    model.fit(dataloader, val_dataloader, epochs=100)
    print(model.predict(next(iter(val_dataloader))))

Regression

from torch.utils.data import DataLoader
from cida.regr_model import PCIDARegressor
import os

def mse(y_pred, y):
    return ((y_pred - y) ** 2).mean()

if __name__ == "__main__":
    dataset = MyRegressionDataset()
    dataloader = DataLoader(
        dataset=dataset,
        shuffle=True,
        batch_size=128,
        num_workers=1,
    )
    val_dataloader = DataLoader(
        dataset=dataset,
        shuffle=True,
        batch_size=128,
        num_workers=1,
    )
    model = PCIDARegressor(
        input_dims=1000,
        domain_dims=1,
        domains_to_labels=dataset.domains_to_labels,
        lr=3e-4,
        beta1=0.9,
        gamma=0.99,
        weight_decay=5e-4,
        lambda_gan=lambda epoch: 3.0,
        latent_size=64,
        test_domain_known=True,
        metrics={"mse": mse},
        verbose=True,
    )
    model = model.to("cuda")
    model.fit(dataloader, val_dataloader, epochs=100, save_metric="test_mse", save_fn="cida-best.pth")

Reference

For citing the original paper:

@inproceedings{DBLP:conf/icml/WangHK20,
  author    = {Hao Wang and
               Hao He and
               Dina Katabi},
  title     = {Continuously Indexed Domain Adaptation},
  booktitle = {ICML},
  year      = {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.vscode		.vscode
cida		cida
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continuously Indexed Domain Adaptation (CIDA)

Install

Usage

Classification (Rotated MNIST)

Regression

Reference

About

Languages

License

sshh12/cida

Folders and files

Latest commit

History

Repository files navigation

Continuously Indexed Domain Adaptation (CIDA)

Install

Usage

Classification (Rotated MNIST)

Regression

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Languages