DataFrame — Dask documentation
docs.dask.org › en › stableA Dask DataFrame is a large parallel DataFrame composed of many smaller Pandas DataFrames, split along the index. These Pandas DataFrames may live on disk for larger-than-memory computing on a single machine, or on many different machines in a cluster. One Dask DataFrame operation triggers many operations on the constituent Pandas DataFrames.
dask.dataframe.from_pandas — Dask documentation
docs.dask.org › daskJan 01, 2010 · Construct a Dask DataFrame from a Pandas DataFrame. This splits an in-memory Pandas dataframe into several parts and constructs a dask.dataframe from those parts on which Dask.dataframe can operate in parallel. By default, the input dataframe will be sorted by the index to produce cleanly-divided partitions (with known divisions).