Back to the Drawing Board...
Why is this such a difficult problem?
It appears that we simply don't have enough Asian, Black, and Hispanic faces to form robust models for the respective classes.
The variation in the training set is not sufficient to model the variation in the testing set.
We need more faces!!
Hmm ...