catalog / Data & ETL / Polars
ConnectorData & ETLFree

Polars

Blazing-fast DataFrame library written in Rust with a Python API — handles datasets that don't fit in RAM.

Installations390k
Note★ 4.8
Avis130
Dépôt source

Polars

Polars is a high-performance DataFrame library written in Rust with Python, R, and Node.js bindings. It uses a columnar memory layout (Apache Arrow), a lazy execution engine, and aggressive parallelism to outperform Pandas by 5–100x on many operations.

Key Features

  • Lazy API: Build a query plan and let Polars optimize + execute it — ideal for large datasets
  • Multi-threaded: Automatically parallelizes operations across all CPU cores
  • Streaming mode: Process datasets larger than RAM in fixed-memory chunks
  • Expressive expressions: A composable, chainable expression syntax with no index ambiguity
  • Arrow-native: Zero-copy interop with PyArrow, DuckDB, pandas, and Hugging Face datasets
  • Native I/O: Read/write Parquet, CSV, JSON, IPC, Avro, databases, and cloud storage

Quick Start

pip install polars
import polars as pl

df = pl.read_parquet("events.parquet")

result = (
    df.lazy()
    .filter(pl.col("event_type") == "purchase")
    .group_by("user_id")
    .agg(pl.col("amount").sum().alias("total_spent"))
    .sort("total_spent", descending=True)
    .limit(100)
    .collect()
)

Add to ai-supply

npx ai-supply add polars-dataframe-library

Curated mirror of the open-source Polars (MIT). Get it from the source.

More from @ai-supply

View profile →
Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
900k4.9
Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
820k4.9
Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
820k4.8
Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
760k4.7