Towards limiting the big RDD Towards limiting the big RDD hadoop hadoop