Learning domain invariant representations by joint Wasserstein distance minimization