What is batch quota in the context of offline inference (machine learning)?
It's just mean to parallel or distributed algorithm over N batches (i.e. MapReduce) instead of all dataset sizes. Offline as you just predict some large number of data points locally rather than real-time prediction, so that is the case you have a larger amount of data you are trying to predict (ex. weather data)