Installation — PySpark 3.2.0 documentation
spark.apache.org › docs › latestIf you want to install extra dependencies for a specific component, you can install it as below: pip install pyspark [ sql] For PySpark with/without a specific Hadoop version, you can install it by using PYSPARK_HADOOP_VERSION environment variables as below: PYSPARK_HADOOP_VERSION=2 .7 pip install pyspark.