Skip to content

Commit

Permalink
wordsmith sqlp command description
Browse files Browse the repository at this point in the history
remove link to Polars SQL intro which is dated and talks more about py-polars flavor of Polars SQL
  • Loading branch information
jqnatividad committed Jan 3, 2024
1 parent 00b957e commit 7ac0602
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -76,7 +76,7 @@
| [sort](/src/cmd/sort.rs#L2)<br>πŸš€πŸ€― | Sorts CSV data in alphabetical (with case-insensitive option), numerical, reverse, unique or random (with optional seed) order (See also `extsort` & `sortcheck` commands). |
| [sortcheck](/src/cmd/sortcheck.rs#L2)<br>πŸ“‡ | Check if a CSV is sorted. With the --json options, also retrieve record count, sort breaks & duplicate count. |
| [split](/src/cmd/split.rs#L2)<br>πŸ“‡πŸŽοΈ | Split one CSV file into many CSV files of N chunks. Uses multithreading to go faster if an index is present. |
| [sqlp](/src/cmd/sqlp.rs#L2)<br>βœ¨πŸš€πŸ»β€β„οΈ | Run blazing-fast Polars SQL queries against several CSVs - converting queries to [fast LazyFrame expressions](https://pola-rs.github.io/polars-book/user-guide/sql/intro/), processing larger than memory CSV files. |
| [sqlp](/src/cmd/sqlp.rs#L2)<br>βœ¨πŸš€πŸ»β€β„οΈ | Run [Polars](https://pola.rs) SQL queries against several CSVs - converting queries to blazing-fast [LazyFrame](https://stackoverflow.com/questions/76612163/what-are-the-advantages-of-a-polars-lazyframe-over-a-dataframe) expressions, processing larger than memory CSV files. |
| [stats](/src/cmd/stats.rs#L2)<br>πŸ“‡πŸ€―πŸŽοΈ | Compute [summary statistics](https://en.wikipedia.org/wiki/Summary_statistics) (sum, min/max/range, min/max length, mean, stddev, variance, nullcount, sparsity, quartiles, IQR, lower/upper fences, skewness, median, mode/s, antimode/s & cardinality) & make GUARANTEED data type inferences (Null, String, Float, Integer, Date, DateTime, Boolean) for each column in a CSV.<br>Uses multithreading to go faster if an index is present (with an index, can compile "streaming" stats on NYC's 311 data (15gb, 28m rows) in less than 20 seconds). |
| [table](/src/cmd/table.rs#L2)<br>🀯 | Show aligned output of a CSV using [elastic tabstops](https://github.com/BurntSushi/tabwriter). To interactively view CSV files, qsv pairs well with [csvlens](https://github.com/YS-L/csvlens#csvlens). |
| [to](/src/cmd/to.rs#L2)<br>βœ¨πŸš€ | Convert CSV files to [PostgreSQL](https://www.postgresql.org), [SQLite](https://www.sqlite.org/index.html), XLSX, [Parquet](https://parquet.apache.org) and [Data Package](https://datahub.io/docs/data-packages/tabular). |
Expand Down

0 comments on commit 7ac0602

Please sign in to comment.