Domain Adaptation using Normalizing Flows

This repository includes code for the course project of UW CSE599J class Data Centeric Machine Learning. The topic is unsupervised domain adaptation for binary classification, under the assumption of covariate shift, on Amazon reviews dataset [Ni et. al (2019)] using transport maps learned via the machine learning architecture of normalizing flows. We use masked autoregressive flows [Papamakarios et al. (2021)] specifically to learn transport maps between source and target domain features.

Let $(X_S, Y_S)$ denote the source domain where $X_S$ are text embeddings and $Y_S$ are binary labels. Denote by $(X_T, Y_T)$ similar quantities for the target domain. Under our setup, labeled i.i.d. samples are available from the source domain but only unlabeled samples are available from the target domain. A classifier learned on the labeled source data is transferred to the target domain by mapping the source and target features to a latent space, distributed as a standard normal distribution, using normalizing flows. That is, we learn normalizing flows $f_S$ and $f_T$ that map $X_S$ and $X_T$ to normal distribution respectively.

Text embeddings are created by:

Finetuning a BERT model on a set of random set of 20,000 data points for an empirical risk minimization task.
Using the penultimate layer of BERT to obtain the last hidden layer embeddings.

The code for finetuning BERT, creating splits for source and target domain, and obtaining embedding can be found in

./create_dataset.ipynb

The code for training masked autoregressive flows $f_S$ and $f_T$ and transferring classifier from source to target domain can be found in

./amazon.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
results		results
utils		utils
Project_Report_Medha.pdf		Project_Report_Medha.pdf
README.md		README.md
__init__.py		__init__.py
amazon.ipynb		amazon.ipynb
create_dataset.ipynb		create_dataset.ipynb
normalizing_flow.py		normalizing_flow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Domain Adaptation using Normalizing Flows

About

Uh oh!

Releases

Packages

Languages

medhaaga/UDA-NF

Folders and files

Latest commit

History

Repository files navigation

Domain Adaptation using Normalizing Flows

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages