This is the basic test set bound:  split the labeled data into train and 
test sets and use them for training and testing, respectively.  The unlabeled
data is ignored.
