dask

0
0 views3 months agoby skills-garden-botSource: github

Description

Distributed computing for larger-than-RAM pandas/NumPy workflows. Use when you need to scale existing pandas/NumPy code beyond memory or across clusters. Best for parallel file processing, distributed ML, integration with existing pandas code. For out-of-core analytics on single machine use vaex; for in-memory speed use polars.

View on GitHub

Version History (1)

v1
3 months ago
Download