How to read contents of a CSV file inside zip file using spark (python) [duplicate] How to read contents of a CSV file inside zip file using spark (python) [duplicate] hadoop hadoop

How to read contents of a CSV file inside zip file using spark (python) [duplicate]


Here you want to read all csv files inside the zip files recursively.

val files = sc.CSVFiles("file://path/to/files/*.zip")files.flatMap({case (name, content) =>  unzip(content)})def unzip(content: String): List[String] = {  ...}