pyspark.sql.DataFrameNaFunctions

class pyspark.sql.DataFrameNaFunctions(df)[source]

Functionality for working with missing data in DataFrame.

New in version 1.4.

__init__(df)[source]

Initialize self. See help(type(self)) for accurate signature.

Methods

__init__(df)

Initialize self.

drop([how, thresh, subset])

Returns a new DataFrame omitting rows with null values.

fill(value[, subset])

Replace null values, alias for na.fill().

replace(to_replace[, value, subset])

Returns a new DataFrame replacing a value with another value.