pyspark.sql.streaming.DataStreamWriter.partitionBy¶

DataStreamWriter.partitionBy(*cols)[source]¶

Partitions the output by the given columns on the file system.

If specified, the output is laid out on the file system similar to Hive’s partitioning scheme.