pyspark.sql.functions.
schema_of_json
Parses a JSON string and infers its schema in DDL format.
json – a JSON string or a string literal containing a JSON string.
options – options to control parsing. accepts the same options as the JSON datasource
Changed in version 3.0: It accepts options parameter to control schema inferring.
>>> df = spark.range(1) >>> df.select(schema_of_json(lit('{"a": 0}')).alias("json")).collect() [Row(json='struct<a:bigint>')] >>> schema = schema_of_json('{a: 1}', {'allowUnquotedFieldNames':'true'}) >>> df.select(schema.alias("json")).collect() [Row(json='struct<a:bigint>')]
New in version 2.4.