How to extract all the collected tweets in a single file
You can configure the HDFS sink to produce a message by time, event or size. So, if you want to save multiple messages till 120MB limit is reached, set
hdfs.rollInterval = 0 # This is to create new file based on timehdfs.rollSize = 125829120 # This is to create new file based on sizehdfs.rollCount = 0 # This is to create new file based on events (different tweets in your case)