pyspark.sql.DataFrameWriter.format

DataFrameWriter.format(source)[source]

Specifies the underlying output data source.

Parameters

source – string, name of the data source, e.g. ‘json’, ‘parquet’.

>>> df.write.format('json').save(os.path.join(tempfile.mkdtemp(), 'data'))

New in version 1.4.