CLI tool for blazing-fast CSV data-wrangling
Top 16.1% on sourcepulse
qsv is a command-line data wrangling toolkit designed for blazing-fast processing of tabular data. It offers a comprehensive suite of commands for querying, transforming, analyzing, and validating CSV and other file formats, targeting data analysts and engineers who need efficient data manipulation capabilities.
How It Works
qsv is built in Rust, prioritizing speed and memory efficiency. It leverages multithreading extensively, especially when an index is available, and employs streaming algorithms for most operations to handle arbitrarily large files. Key features include an optional indexing mechanism for constant-time random access, support for various data formats beyond CSV (like Parquet, JSON, Excel), and integration with Luau and Python for complex data pipelines.
Quick Start & Requirements
cargo install qsv --locked --features all_features
is also supported.py
command.Highlighted Details
Maintenance & Community
The project is sponsored by datHere. Community interaction is facilitated through GitHub discussions.
Licensing & Compatibility
Dual-licensed under MIT or the UNLICENSE, allowing for commercial use and integration with closed-source projects.
Limitations & Caveats
Some commands (marked with 🤯) load the entire CSV into memory, though external variants exist. The luau
feature may not be available in musl prebuilt binaries and requires compilation from source on a musl-based distro. CPU optimizations in prebuilt binaries may cause issues on older CPUs.
1 day ago
1 day