Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs(blog): ibis, duckdb and lonboard for overture maps #10143

Closed
wants to merge 107 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
107 commits
Select commit Hold shift + click to select a range
de4c7eb
docs(blog): ibis-duckdb and lonboard for overture maps
ncclementi Sep 13, 2024
c9a749b
chore: add visualizations as pngs
ncclementi Sep 16, 2024
a943dac
chore: add freeze false
ncclementi Sep 16, 2024
e941a3b
chore: apply code review comments
ncclementi Sep 16, 2024
f9b905b
fix(datafusion): raise when attempting to create temp table (#10072)
ncclementi Sep 10, 2024
f64a7c6
depr(selectors): deprecate `c` and `r` selectors in favor of `cols` a…
jcrist Sep 10, 2024
f329636
depr(api): deprecate `bool_val.negate()`/`-bool_val` in favor of `~bo…
jcrist Sep 10, 2024
bd06b3f
test(backends): use slightly more useful assertions for call count
cpcloud Sep 10, 2024
d8b2510
test(bigquery): make test non-strict xfail (#10081)
cpcloud Sep 10, 2024
9050b5c
feat(mssql): add lpad and rpad ops (#10060)
IndexSeek Sep 10, 2024
bcee24f
refactor(mssql): simplify lpad and rpad ops (#10085)
IndexSeek Sep 10, 2024
4ec6cac
doc: add docstring example for `topk`
jcrist Sep 10, 2024
abf0863
feat(api): add `name` argument to `topk`
jcrist Sep 10, 2024
521958b
feat(api): add `name` argument to `value_counts`
jcrist Sep 10, 2024
0ae44aa
test(polars): replace register with create_table
IndexSeek Sep 11, 2024
2b22433
fix(docs): update invalid read_parquet link
IndexSeek Sep 11, 2024
20e0bb7
perf(backends): speed up most memtable existence checks (#10067)
cpcloud Sep 11, 2024
11fb399
ci(backends): run backend doctests in CI (#9970)
cpcloud Sep 11, 2024
c3a7481
revert: fix(datafusion): raise when attempting to create temp table (…
gforsyth Sep 11, 2024
39ebbb5
ci: test examples (#10098)
cpcloud Sep 11, 2024
a507754
chore(release): 9.5.0
semantic-release-bot Sep 11, 2024
c1fa0b5
refactor(sql): simplify paren handling for binary ops
jcrist Sep 11, 2024
d0e72e9
chore(deps): update actions/create-github-app-token action to v1.11.0…
renovate[bot] Sep 12, 2024
f15b1e6
docs(dropdowns): make dropdowns scrollable and easier to see in navig…
ramlakhanmadheshiya Sep 12, 2024
6b7ceb8
test(polars): unxfail polars timestamp truncation tests by casting th…
cpcloud Sep 12, 2024
bbf4294
chore(deps): bump rich lower bound for docs/dev work (#10105)
gforsyth Sep 12, 2024
5a6f29f
docs: avoid needing to render API docs for any preview/render invocat…
cpcloud Sep 12, 2024
a88f384
refactor(duckdb): replace register usage with read
IndexSeek Sep 13, 2024
21877db
test(datafusion): replace register with create_table or read
IndexSeek Sep 13, 2024
d203eb4
refactor(dask): remove the dask backend
cpcloud Aug 4, 2024
21a858c
ci: disable verification of removed deprecations
cpcloud Sep 12, 2024
6c07598
chore: remove unused snapshots (#10107)
cpcloud Sep 13, 2024
a22527f
test(sigcheck): check function signature parity across backends (#10008)
gforsyth Sep 13, 2024
e3a02b8
test: remove dask backend marker remnants (#10114)
cpcloud Sep 13, 2024
d2a9060
fix(mysql): add dtype mapping for `mediumint`
gforsyth Sep 13, 2024
198b886
docs(how-to): fix the `ffill`/`bfill` how-to guide
deepyaman Sep 13, 2024
87ee637
docs(security): update security report address to point to private Zu…
gforsyth Sep 13, 2024
207f334
docs(code_of_conduct): update committee members and reporting email (…
gforsyth Sep 13, 2024
9f9f953
refactor(pandas): remove the pandas backend
cpcloud Aug 5, 2024
d58c619
ci: remove pandas backend jobs
cpcloud Sep 13, 2024
6d566d1
test: skip generic operation test that uses pandas if pandas is not i…
cpcloud Sep 13, 2024
b0603a2
test: remove dead fixture
cpcloud Sep 13, 2024
83606f5
test(dot-columns): add benchmark for columns property access
cpcloud Aug 26, 2024
e0e79ae
perf(api): return `tuple` from `Table.columns` instead of `list`
cpcloud Aug 26, 2024
becc6ea
test(backends): fix backend tests that assume a list
cpcloud Aug 26, 2024
3df1e32
chore(deps): update apache/impala docker tag to v4.4.1 (#10126)
renovate[bot] Sep 14, 2024
79c246e
refactor(bigquery): remove unnecessary and misspelled bigquery string…
tswast Sep 14, 2024
d8debde
fix(joins): allow chaining positional and cross joins (#10122)
gforsyth Sep 14, 2024
d49767b
chore(deps): bump poetry2nix and nixpkgs (#10127)
cpcloud Sep 14, 2024
ba36c77
ci: use jupyter cache in docs ci builds to speed up docs builds (#10128)
cpcloud Sep 14, 2024
019aed4
ci: `before` is a top level field (#10129)
cpcloud Sep 14, 2024
4d8a3b3
fix(repr): remove expression printing from exception message (#10130)
cpcloud Sep 15, 2024
01bfb6f
chore(deps): remove the `pandas` extra (#10132)
cpcloud Sep 15, 2024
3d7d530
fix(deps): update dependency sqlglot to >=23.4,<25.22 (#10109)
renovate[bot] Sep 15, 2024
1dc97a4
refactor(table-api): unify exception type for all backends to `TableN…
ncclementi Sep 16, 2024
b54f528
test(bigquery): check the correct exception for missing tables (#10137)
cpcloud Sep 16, 2024
dc16506
feat(api): add `distinct` option to `collect`
jcrist Sep 13, 2024
2b95c18
docs(api): avoid quartodoc warning about missing parameter
cpcloud Sep 16, 2024
317b055
chore: remove unused `Dispatched` utility
jcrist Sep 16, 2024
25ef934
docs(datafusion): add datafusion nyc presentation (#10141)
gforsyth Sep 16, 2024
7055525
chore(deps): update bitnami/minio docker tag to v2024.9.13 (#10146)
renovate[bot] Sep 17, 2024
3ad6e6d
chore(deps): update apache/druid docker tag to v30.0.1 (#10145)
renovate[bot] Sep 17, 2024
f910cef
chore(deps): lock file maintenance (#10134)
renovate[bot] Sep 17, 2024
3a83aa4
refactor(padding): follow python string padding conventions (#10096)
gforsyth Sep 17, 2024
75dac5c
fix(deps): update dependency datafusion to v41 (#10147)
renovate[bot] Sep 17, 2024
bde00c3
docs(datafusion): assorted edits to datafusion meetup talk (#10144)
gforsyth Sep 17, 2024
14af3f4
docs(datafusion): update talk title (#10150)
gforsyth Sep 17, 2024
6aa7c44
chore(deps): update ghcr.io/risingwavelabs/risingwave docker tag to v…
renovate[bot] Sep 18, 2024
e0d3dde
chore(deps): update trinodb/trino docker tag to v458 (#10155)
renovate[bot] Sep 18, 2024
64e97c9
chore: show deprecation warning at caller level (#10154)
NickCrews Sep 18, 2024
147f3f1
ci(pyspark): name the output path when downloading jar (#10156)
cpcloud Sep 18, 2024
59ae216
ci: run datafusion tests in series to avoid high memory usage (#10158)
cpcloud Sep 18, 2024
c3a7b43
ci(bigquery): avoid race condition in create table by using a dataset…
cpcloud Sep 18, 2024
c33c263
chore(mysql): port to MySQLdb instead of pymysql (#10077)
cpcloud Sep 18, 2024
9402725
docs(datafusion): add imdb live demo reference to end of presentation…
gforsyth Sep 18, 2024
c5ccb99
docs(table-expr): include inherited methods (all `to_*` methods) (#10…
gforsyth Sep 18, 2024
1ae4bd1
docs(synonyms): add synonym list to redirect searches with no results…
gforsyth Sep 18, 2024
2aa51ae
test(bigquery): avoid trying to clobber existing tables by generating…
cpcloud Sep 18, 2024
f34106f
fix(datatype-parsing): ensure that geospatial types are round trippab…
cpcloud Sep 19, 2024
3e9396c
docs(bigquery): add update-adc flag to gcloud auth login (#10172)
cpcloud Sep 19, 2024
12023e6
fix(mssql): ensure `ibis.random()` generates a new value per call (#1…
jcrist Sep 19, 2024
e5c11af
docs(clickhouse): entry into the accursed (#10174)
cpcloud Sep 19, 2024
fcd7564
chore: add governance paragraph and link to governance doc to README …
ncclementi Sep 20, 2024
7ca2fdc
fix(deps): update dependency sqlglot to >=23.4,<25.23 (#10176)
renovate[bot] Sep 20, 2024
7c6a176
test(bigquery): ensure that quoting test generates unique table name …
cpcloud Sep 20, 2024
2cb5771
chore(docs): clean up current pyodide build (#10180)
cpcloud Sep 20, 2024
fc626cc
ci: add automatic pull request labels (#10181)
cpcloud Sep 20, 2024
4957854
refactor(api): remove schema (#10149)
ncclementi Sep 20, 2024
19c3845
docs(build): fetch all commits to enable proper dynamic versioning in…
cpcloud Sep 20, 2024
a563504
feat(bigquery): non-nullable schema support for embedded fields in st…
ssabdb Sep 21, 2024
cf39ea0
chore: remove `poetry.lock` from nix label patterns (#10192)
cpcloud Sep 23, 2024
21d748c
chore(deps): lock file maintenance (#10191)
renovate[bot] Sep 23, 2024
7cdd2ee
chore(deps): update dependency itables to >=1.6.3,<2.3 (#10190)
renovate[bot] Sep 23, 2024
b0060e4
feat(polars): allow user to specify "engine" kwarg (#10151)
deepyaman Sep 23, 2024
0284736
fix(polars): use elementwise flatten to flatten nested arrays (#10168)
cpcloud Sep 23, 2024
39362cd
test(bigquery): xfail nested flatten test (#10194)
cpcloud Sep 23, 2024
13f9291
fix(api): use `to_pyarrow()` instead of `execute()` when pretty print…
cpcloud Sep 23, 2024
cd9ee1b
refactor(joins): require explicit abstract table as RHS of joins (#9661)
gforsyth Sep 23, 2024
b7135e7
feat(pyspark): add official support and ci testing with spark connect…
cpcloud Sep 23, 2024
3c223de
fix(snowflake): apply casting logic for json output to scalars (#10202)
cpcloud Sep 24, 2024
8191a05
chore(value): remove deprecated `greatest` and `least` methods
gforsyth Sep 23, 2024
7689b6d
chore(api): remove deprecated `where` methodism
gforsyth Sep 23, 2024
5d0e807
chore(api): remove top-level geo functions
gforsyth Sep 23, 2024
67ee2b4
chore(api): remove top-level `negate` function
gforsyth Sep 23, 2024
d16d0c7
refactor(pyarrow-format): avoid constructing unnecessary array to pro…
cpcloud Sep 24, 2024
8f66695
chore(deps): bump nix flake dependencies (#10203)
cpcloud Sep 24, 2024
92f1a89
chore: udpate after duckdb 1.1.1 release
ncclementi Sep 24, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .env
Original file line number Diff line number Diff line change
Expand Up @@ -5,3 +5,4 @@ PGPASSWORD="postgres"
MYSQL_PWD="ibis"
MSSQL_SA_PASSWORD="1bis_Testing!"
DRUID_URL="druid://localhost:8082/druid/v2/sql"
SPARK_CONFIG=./docker/spark-connect/conf.properties
102 changes: 102 additions & 0 deletions .github/labeler.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,102 @@
# backends
bigquery:
- changed-files:
- any-glob-to-any-file: "ibis/backends/bigquery/**"

clickhouse:
- changed-files:
- any-glob-to-any-file: "ibis/backends/clickhouse/**"

datafusion:
- changed-files:
- any-glob-to-any-file: "ibis/backends/datafusion/**"

druid:
- changed-files:
- any-glob-to-any-file: "ibis/backends/druid/**"

duckdb:
- changed-files:
- any-glob-to-any-file: "ibis/backends/duckdb/**"

exasol:
- changed-files:
- any-glob-to-any-file: "ibis/backends/exasol/**"

flink:
- changed-files:
- any-glob-to-any-file: "ibis/backends/flink/**"

impala:
- changed-files:
- any-glob-to-any-file: "ibis/backends/impala/**"

mssql:
- changed-files:
- any-glob-to-any-file: "ibis/backends/mssql/**"

mysql:
- changed-files:
- any-glob-to-any-file: "ibis/backends/mysql/**"

oracle:
- changed-files:
- any-glob-to-any-file: "ibis/backends/oracle/**"

polars:
- changed-files:
- any-glob-to-any-file: "ibis/backends/polars/**"

postgres:
- changed-files:
- any-glob-to-any-file: "ibis/backends/postgres/**"

pyspark:
- changed-files:
- any-glob-to-any-file: "ibis/backends/pyspark/**"

risingwave:
- changed-files:
- any-glob-to-any-file: "ibis/backends/risingwave/**"

snowflake:
- changed-files:
- any-glob-to-any-file: "ibis/backends/snowflake/**"

sqlite:
- changed-files:
- any-glob-to-any-file: "ibis/backends/sqlite/**"

trino:
- changed-files:
- any-glob-to-any-file: "ibis/backends/trino/**"

# miscellaneous labels
tests:
- changed-files:
- any-glob-to-any-file: "**/tests/**"

nix:
- changed-files:
- any-glob-to-any-file: "**/*.nix"

datatypes:
- changed-files:
- any-glob-to-any-file: "ibis/expr/datatypes/**"

ci:
- changed-files:
- any-glob-to-any-file: ".github/**"

dependencies:
- changed-files:
- any-glob-to-any-file: "**/*.nix"
- any-glob-to-any-file: "poetry.lock"
- any-glob-to-any-file: "flake.lock"
- any-glob-to-any-file: "requirements-dev.txt"
- any-glob-to-any-file: "conda/*.yml"

docs:
- changed-files:
- any-glob-to-any-file: "**/*.qmd"
- any-glob-to-any-file: "**/*.md"
10 changes: 5 additions & 5 deletions .github/renovate.json
Original file line number Diff line number Diff line change
Expand Up @@ -29,10 +29,6 @@
"matchPackagePrefixes": ["clickhouse"],
"addLabels": ["clickhouse"]
},
{
"matchPackagePatterns": ["dask"],
"addLabels": ["dask"]
},
{
"matchPackagePatterns": ["datafusion"],
"addLabels": ["datafusion"]
Expand Down Expand Up @@ -71,7 +67,7 @@
"addLabels": ["druid"]
},
{
"matchPackagePatterns": ["pymysql", "mariadb"],
"matchPackagePatterns": ["mysqlclient", "mariadb"],
"addLabels": ["mysql"]
},
{
Expand All @@ -82,6 +78,10 @@
"matchPackagePatterns": ["pyspark"],
"addLabels": ["pyspark"]
},
{
"matchPackagePatterns": ["risingwave"],
"addLabels": ["risingwave"]
},
{
"matchPackagePatterns": ["snowflake-connector-python"],
"addLabels": ["snowflake"]
Expand Down
19 changes: 19 additions & 0 deletions .github/workflows/algolia/configure-algolia-api.py
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,15 @@
index_name = os.environ["ALGOLIA_INDEX"]


ONE_WAY_SYNONYMS = {
# A list of search terms that have (historically) not returned results
# that we can map to existing search terms that we know are good
"md5": ["hashbytes"],
"fetchdf": ["to_pandas", "to_polars", "to_pyarrow"],
"unique": ["distinct"],
}


def main():
client = SearchClient.create(app_id, api_key)
index = client.init_index(index_name)
Expand All @@ -30,6 +39,16 @@ def main():

index.set_settings(override_default_settings)

for input_, synonyms in ONE_WAY_SYNONYMS.items():
index.save_synonym(
{
"objectID": input_,
"type": "oneWaySynonym",
"input": input_,
"synonyms": synonyms,
}
)


if __name__ == "__main__":
main()
2 changes: 1 addition & 1 deletion .github/workflows/create-rotate-key-issue.yml
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@ jobs:
runs-on: ubuntu-latest
steps:
- name: Generate a GitHub token
uses: actions/create-github-app-token@v1.10.4
uses: actions/create-github-app-token@v1.11.0
id: generate_token
with:
app-id: ${{ secrets.SQUAWK_BOT_APP_ID }}
Expand Down
10 changes: 8 additions & 2 deletions .github/workflows/docs-preview.yml
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ jobs:
cancel-in-progress: true
if: github.event.label.name == 'docs-preview'
steps:
- uses: actions/create-github-app-token@v1.10.4
- uses: actions/create-github-app-token@v1.11.0
id: generate_token
with:
app-id: ${{ secrets.DOCS_BOT_APP_ID }}
Expand Down Expand Up @@ -70,8 +70,14 @@ jobs:
done
} | tee /tmp/comment

- name: restore cache of the previously rendered notebooks
uses: actions/cache/restore@v4
with:
key: docs-${{ github.event.pull_request.base.sha }}
path: docs/**/.jupyter_cache

- name: build docs
run: nix develop --ignore-environment --keep HOME -c just docs-build-all
run: nix develop '.#ibis311' --ignore-environment --keep HOME -c just docs-build-all

- name: install netlify cli
run: npm install -g netlify-cli
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/ibis-backends-cloud.yml
Original file line number Diff line number Diff line change
Expand Up @@ -80,7 +80,7 @@ jobs:
fetch-depth: 0
ref: ${{ github.event.pull_request.head.sha }}

- uses: actions/create-github-app-token@v1.10.4
- uses: actions/create-github-app-token@v1.11.0
id: generate_token
with:
app-id: ${{ secrets.DOCS_BOT_APP_ID }}
Expand All @@ -107,7 +107,7 @@ jobs:
run: poetry add snowflake-snowpark-python --python="==${{ steps.install_python.outputs.python-version }}"

- name: install ibis
run: poetry install --without dev --without docs --extras "${{ join(matrix.backend.extras, ' ') }}"
run: poetry install --without dev --without docs --extras "${{ join(matrix.backend.extras, ' ') }} examples"

- uses: extractions/setup-just@v2
env:
Expand Down
Loading