rcv1sub2 function

Dataset from the Reuters corpus (subset 2)