Convert a text file to sequence format in Spark Java Convert a text file to sequence format in Spark Java hadoop hadoop

Convert a text file to sequence format in Spark Java


Change this:

JavaPairRDD<String, String> infile = ctx.wholeTextFiles("input_txt");infile.saveAsNewAPIHadoopFile("outfile.seq", String.class, String.class, SequenceFileOutputFormat.class);

to

JavaPairRDD<String, String> infile = ctx.wholeTextFiles("input_txt");JavaPairRDD<Text, Text> resultRDD = infile.mapToPair(f -> new Tuple2<>(new Text(f._1()), new Text(f._2())));resultRDD.saveAsNewAPIHadoopFile("outfile.seq", Text.class, Text.class, SequenceFileOutputFormat.class);