pythonspark实例,如何在Python中创建示例Spark dataFrame?

I want to create a sample DataFrame but the following code is not working:

df = spark.createDataFrame(["10","11","13"], ("age"))

## ValueError

## ...

## ValueError: Could not parse datatype: age

Expected result is:

age

10

11

13

解决方案the following code is not working

With single element you need a schema as type

spark.createDataFrame(["10","11","13"], "string").toDF("age")

or DataType:

from pyspark.sql.types import StringType

spark.createDataFrame(["10","11","13"], StringType()).toDF("age")

With name elements should be tuples and schema as sequence:

spark.createDataFrame([("10", ), ("11", ), ("13", )], ["age"])

你可能感兴趣的:(pythonspark实例)