pyspark.sql.streaming.DataStreamWriter.partitionBy

DataStreamWriter.partitionBy(*cols)[source]

Partitions the output by the given columns on the file system.

If specified, the output is laid out on the file system similar to Hive’s partitioning scheme.

Note

Evolving.

Parameters

cols – name of columns

New in version 2.0.