'PipelinedRDD' object has no attribute 'toDF' in PySpark
https://www.py4u.net/discuss/2577601'PipelinedRDD' object has no attribute 'toDF' in PySpark . I'm trying to load an SVM file and convert it to a DataFrame so I can use the ML module (Pipeline ML) from Spark. I've just installed a fresh Spark 1.5.0 on an Ubuntu 14.04 (no spark-env.sh configured).. My my_script.py is:. from pyspark.mllib.util import MLUtils from pyspark import SparkContext sc = SparkContext("local", …