WIP: update to Vega-Lite 5 #2517

ChristopherDavisUCI · 2021-11-09T16:08:33Z

(Draft version only!)

Surprisingly the biggest obstacle so far hasn't been params but a change to layer. I think in the newest Vega-Lite schema, charts in a layer are not allowed to specify height or width, which seems to break many Altair examples. Here is a minimal example that doesn't work:

no_data = pd.DataFrame()

c = alt.Chart(no_data).mark_circle().properties(
    width=100
)

c+c

I don't see a good way to deal with that. Do you have a suggestion?

I've read the list of "breaking changes" for the Vega-Lite 5.0.0 release and don't see anything that seems related to this, so it does make me wonder if maybe I misunderstand the cause of the problem.

Other things:

I usually try some tests by running things in Jupyter notebook, but since making the change to Vega-Lite 5 that hasn't worked for me. Instead I get the following message in red the first time I try to display a chart, and then subsequent times I just get a blank response: Error loading script: Script error for "vega-util", needed by: vega-lite http://requirejs.org/docs/errors.html#scripterror It does work in Jupyter Lab.
Some of the code changes have been experiments trying to learn how the old selection fits with the new parameter. It might be best to redo this code later now that I see more of the big picture.

mattijn · 2021-11-10T15:50:23Z

width within a layered mark seems the culprit, observe:

# python 3
from urllib.request import urlopen
import json
import jsonschema  # 3.2.0

def validate(vl_spec, vl_schema="v5.1.0"):
    schema = json.load(urlopen(f'https://vega.github.io/schema/vega-lite/{vl_schema}.json'))
    spec = json.loads(vl_spec)
    jsonschema.validate(spec, schema)
    
vl_spec = """
{
  "layer": [{"mark": "circle", "width": 100}, {"mark": "circle"}],
  "data": {"values": []}
}
"""
validate(vl_spec)

ValidationError: Additional properties are not allowed ('mark' was unexpected)
On instance:
    {'mark': 'circle', 'width': 100}

The above validates still OK in v4.17.0 (validate(vl_spec, vl_schema='v4.17.0').

When placing the width outside the layered mark it validates without error in v5.1.0, like this:

vl_spec = """
{
  "width": 100,
  "layer": [{"mark": "circle"}, {"mark": "circle"}],
  "data": {"values": []}
}
"""
validate(vl_spec)

jakevdp · 2021-11-10T15:56:43Z

If we wanted, we could add logic to alt.layer that will move any width/height specification in child charts to the top level. I believe there is already some existing logic along these lines for other properties.

It's a minor thing, but it would be nice to remain backward-compatible on this.

ChristopherDavisUCI · 2021-11-10T16:39:37Z

Thank you! Do you have an example of a property where we do something similar to this? (It doesn't have to be in LayerChart.) If I have a template to use, I should be able to do the same thing for width and height.

So far at least with selection it's seemed straightforward to keep everything backwards compatible.

jakevdp · 2021-11-10T16:46:26Z

What I had in mind was the _combine_subchart_data step in alt.LayerChart: https://github.com/altair-viz/altair/blob/e11ab10bf2239af7db4b56ed61877756c431395f/altair/vegalite/v4/api.py#L2333
Basically this looks for common datasets in the layers, and moves them up to the top level. This so you can do things like this:

base = alt.Chart(dataframe).encode(...)
chart = base.mark_point() + base.mark_line()

and only embed a single copy of the serialized dataframe in the output. We could do something similar for width and height: extract them from the underlying layers & apply them to the parent chart, raising an error if there are any mismatches.

joelostblom · 2021-11-10T17:01:31Z

I've read the list of "breaking changes" for the Vega-Lite 5.0.0 release and don't see anything that seems related to this, so it does make me wonder if maybe I misunderstand the cause of the problem.

I looked around a bit and I think this was initially introduced in VL4.1.1, but then reverted and included with a deprecation warning because Jake reported that it broke Altair. That deprecated code was removed in 5.0 in this PR, which is probably why it is breaking now. It seems like in addition to width and height, the view property is also affected so maybe we need to include that in this PR as well.

ChristopherDavisUCI · 2021-11-10T18:16:57Z

Thank you @joelostblom for tracking that down. Would you be able to give me an example where we use this view property in Altair? That would help with my testing. I tried searching through the documentation examples and didn't immediately find one.

jakevdp · 2021-11-10T18:53:08Z

One relevant piece of info here is that the Vega-lite renderer maintains backward compatibility for some things (such as passing width and height to within a layer) even though the Vega-Lite schema now forbids them. Because we validate based on the schema, Altair is actually a bit stricter than the Vega-Lite renderer.

We have a couple options for dealing with that:

Accept that Altair is stricter than vega-lite, and pass on those errors to users (the "do nothing" approach)
maintain a list of allowable departures from the schema.
do some preprocessing at initialization for specific issues
stop validating based on the schema

I would lean toward (3) here, as (1) would lead to poorer usability, and my sense is that (2) would add an undue maintenance burden, and (4) would be a large departure from Altair's current implementation

joelostblom · 2021-11-10T18:58:47Z

@ChristopherDavisUCI I am not sure how much it is actually used, but alt.Chart does accept a view parameter, so I am guessing we should handle that the same as height and width (since it was deprecated in the same VL PR) and move it up to the top level of layered charts:

alt.Chart(alt.Data(values=[{}]), height=200, view=alt.ViewConfig()).mark_bar().to_dict()

{'config': {'view': {'continuousWidth': 400, 'continuousHeight': 300}},
 'data': {'values': [{}]},
 'mark': 'bar',
 'height': 200,
 'view': {},
 '$schema': 'https://vega.github.io/schema/vega-lite/v4.8.1.json'}

and using view instead of width in the validation example by @mattijn above gives the same ValidationError:

vl_spec = """
{
  "layer": [{"mark": "circle", "view": {}}, {"mark": "circle"}],
  "data": {"values": []}
}
"""

ChristopherDavisUCI · 2021-11-11T00:25:08Z

@joelostblom Thank you!

@jakevdp Thanks for the pointer to _combine_subchart_data. I think I was able to adapt that to get the layer examples working.

Edit: My current guess for the below example is we should just accept it needing to switch "selection" to "param". It seems reasonable to me that using an invalid property name in the dictionary is not going to have a nice fix.

Aside from rendering examples (literally zero of which are working), there's just one example that doesn't work that I know of, Scatter Plot with Minimap, because it explicitly specifies "selection": zoom.name. If we change it to "param": zoom.name then it works. Do you see a robust way to deal with that? I looked through the schema and I think "selection" as a property name only exists in "Config".

"""
Scatter Plot with Minimap
-------------------------
This example shows how to create a miniature version of a plot
such that creating a selection in the miniature version
adjusts the axis limits in another, more detailed view.
"""
# category: scatter plots

import altair as alt
from vega_datasets import data

source = data.seattle_weather()

zoom = alt.selection_interval(encodings=["x", "y"])

minimap = (
    alt.Chart(source)
    .mark_point()
    .add_selection(zoom)
    .encode(
        x="date:T",
        y="temp_max:Q",
        color=alt.condition(zoom, "weather", alt.value("lightgray")),
    )
    .properties(
        width=200,
        height=200,
        title="Minimap -- click and drag to zoom in the detail view",
    )
)

detail = (
    alt.Chart(source)
    .mark_point()
    .encode(
        x=alt.X(
            "date:T", scale=alt.Scale(domain={"selection": zoom.name, "encoding": "x"})
        ),
        y=alt.Y(
            "temp_max:Q",
            scale=alt.Scale(domain={"selection": zoom.name, "encoding": "y"}),
        ),
        color="weather",
    )
    .properties(width=600, height=400, title="Seattle weather -- detail view")
)

detail | minimap

ChristopherDavisUCI · 2021-11-11T17:49:23Z

Here's a first example of using VariableParameter. Is that basically how we want it to work? Should we try to add functionality so opacity=alt.ExprRef("sample/1000") can be replaced by maybe something like opacity=alt.expr("sample/1000") or opacity=alt.expr(f"{size_var}/1000")?

import altair as alt
from vega_datasets import data

cars = data.cars.url

year_slider = alt.binding_range(min=0, max=1000, step=5)
size_var = alt.variable(bind=year_slider, value=200, name="sample")

c = alt.Chart(cars).mark_circle(size=size_var, opacity=alt.ExprRef("sample/1000")).encode(
    x='Miles_per_Gallon:Q',
    y='Horsepower:Q',
).add_variable(size_var)

c

The chart could also be defined this way:

c = alt.Chart(cars).mark_circle().encode(
    x='Miles_per_Gallon:Q',
    y='Horsepower:Q',
    size = alt.value(size_var),
    opacity = alt.value(alt.ExprRef("sample/1000"))
).add_variable(size_var)

jakevdp · 2021-11-11T18:48:15Z

That's nice! I think it would be cool if we could use the variables directly in the expression, i.e. something like opacity = size_var / 1000 which would basically output that same ExprRef.

We do a similar thing with alt.datum.value / 1000

mattijn · 2021-11-11T22:59:55Z

I played a bit in ChristopherDavisUCI@0b34ec6 and now this works:

cars = data.cars.url

year_slider = alt.binding_range(min=0, max=1, step=0.05)
size_var = alt.variable(bind=year_slider, value=0.2, name="sample")

c = alt.Chart(cars).mark_circle(
    size=size_var*1000,
    opacity=size_var + 0.2
).encode(
    x='Miles_per_Gallon:Q',
    y='Horsepower:Q',
).add_variable(size_var)

c

but this won't, opacity=size_var + 0.2 + 0.1:
TypeError: unsupported operand type(s) for +: 'ExprRef' and 'float'

Because the __add__ function includes the ExprRef:

    def __add__(self, other):
        return ExprRef(expr.core.BinaryExpressionReference("+", self.name, other))

It get stuck in the second round of addition, since size_var + 0.2 has become an ExprRef and cannot add the 0.1 . Another solution is needed, but maybe its a step forward for someone else.

mattijn · 2021-11-12T13:32:09Z

I can completely imagine this should be done very different, but now this also works:

import altair as alt
year_slider = alt.binding_range(min=0, max=1000, step=5)
size_var = alt.variable(bind=year_slider, value=200, name="sample")
expr = size_var ** 2 + 0.1
expr

ExprRef({
  expr: pow(sample,2) + 0.1
})

Where size_var ** 2 + 0.1 can be used eg. for the opacity channel in above example.

Changed code in this commit ChristopherDavisUCI@530a11a

ChristopherDavisUCI · 2021-11-12T16:02:14Z

Thanks @mattijn! I glanced and didn't immediately follow, but I will look more slowly later.

Is it obvious to you why the charts don't display for me in Jupyter notebook? Do you think it's in the same category as the changes you made related to vegalite_mimebundle from last round?

jakevdp · 2021-11-12T16:10:51Z

If you're using the jupyterlab/mimebundle renderer, that will not work, because the vega-lite extension does not yet support vl 5 Switch to the default renderer.

Even with the default renderer, you can still run into issues if charts with previous vega-lite versions are displayed in the notebook, because once one version of vega-lite is loaded, new ones will not load even if the version does not match. Try clearing all outputs, saving the notebook, then reloading the page.

If you're using jupyterlab, the same is true of any notebook tab you have opened in the history of the session, because different notebooks' JS & CSS environments are not sandboxed from each other (don't even get me started...). So close all notebooks, clear outputs from ones you plan to open, click reload the browser, and then open the notebooks you want to use.

mattijn · 2021-11-12T19:51:15Z

Basic idea is to add a number and comparison protocol to the Variable and ExprRef class. This includes methods for mathematical operations such as addition and instance comparison like equality checking. Addition (+) is represented as __add__(self, other) which carry out self + other.
So an expression x + y is interpreted as x.__add__(y).
In this case, within this __add__ method we use the defined name of the Variable (eg. sample) append the operand + and the definition of other and return this as an ExprRef. Given the following Variable:

import altair as alt
year_slider = alt.binding_range(min=0, max=1000, step=5)
size_var = alt.variable(bind=year_slider, value=200, name="sample")
size_var

Variable('sample', VariableParameter({
  bind: BindRange({
    input: 'range',
    max: 1000,
    min: 0,
    step: 5
  }),
  name: 'sample',
  value: 200
}))

And:

size_var.name

'sample'

Then size_var + 1 is interpreted as size_var.__add__(1). Where the __add__ method does size_var.name + 1 which becomes sample + 1. This result is returned as ExprRef("sample + 1").
Therefor

er = size_var + 1
er

ExprRef({
  expr: sample + 1
})

And then the same works if these mathematical operations are also defined as methods for ExprRef, where the defined expr is used instead of name (ExprRef.expr).
So:

er + 2

becomes er.__add__(2), which does er.expr (in this case this is sample + 1) append + 2, so it becomes sample + 1 + 2 and is again returned as ExprRef as such:

ExprRef({
  expr: sample + 1 + 2
})

The actual implementation is a different story though. I tried to integrate with the existing code in core.py, but had to duplicate code with only very little differences to get it to work. The same for the added code in api.py, this could probably be delegated or moved to somewhere more sensible. All in all, it's more like prototype code.

ChristopherDavisUCI · 2021-11-14T18:49:21Z

In the following example (which doesn't work at the moment), size_var is of type Variable. What's the right way to define the Variable class so that the following code will work. (Notice that size=size_var is being specified inside of encode, not inside of mark_circle. The example does work if you move size=size_var into the mark_circle method.)

import altair as alt
from vega_datasets import data

cars = data.cars.url

year_slider = alt.binding_range(min=0, max=1, step=0.05)
size_var = alt.variable(bind=year_slider, value=0.2, name="sample")

c = alt.Chart(cars).mark_circle().encode(
    x='Miles_per_Gallon:Q',
    y='Horsepower:Q',
    size=size_var,
).add_variable(size_var)

c

I'd like to get something like this working and then add on the arithmetic functionality that @mattijn described above.

mattijn · 2021-11-14T22:03:21Z

Hmm, per https://vega.github.io/vega-lite/docs/parameter.html, I don't think an expression reference is supported as input for an encoding channel. Within an encoding channel it can can only be included as predicate and for data extents (but then as param and not as expr).

ChristopherDavisUCI · 2021-11-14T23:38:31Z

@mattijn Ah, thank you, very interesting!

Playing around on the Vega editor, I think something like this is allowed, where the "expr" has been wrapped as a "value":

{
  "params": [
    { "name": "barHeight", "value": 5}
  ],
  "data": {
    "values": [
      {"a": "A"}, {"a": "B"}, {"a": "C"}
    ]
  },
  "mark": {
    "type": "bar"
  },
  "encoding": {
    "x": {"field": "a", "type": "nominal"},
    "y": {"value": {"expr": "barHeight"}}
  }
}

I guess on the Altair side, we should allow something like alt.value(bar_variable). Do you think we should allow further abbreviations? Maybe not, since we don't allow y = 5, for example (only y = alt.value(5))?

mattijn · 2021-11-15T09:06:35Z

Ah yes I see, yes that make sense. So then you can eventually do the same for datum and field. Something like this:

import altair as alt
from vega_datasets import data

cars = data.cars.url

mpg_slider = alt.binding_range(min=0, max=50, step=1)
rule_mpg = alt.variable(bind=mpg_slider, value=0.2, name="mpg_rule")

dots= alt.Chart(cars).mark_circle().encode(
    x='Miles_per_Gallon:Q',
    y='Horsepower:Q'
)

yrule = alt.Chart().mark_rule(strokeDash=[12, 6], size=2).encode(
    y=alt.datum(rule_mpg)  # <-- how to infer type? eg. alt.Y(alt.datum(rule_mpg), type='quantitative') ?
).add_variable(rule_mpg)

dots + yrule

Open the Chart in the Vega Editor

And:

import altair as alt
from vega_datasets import data

cars = data.cars.url

x_options = alt.binding_select(options=["Horsepower", "Acceleration"])
x_var = alt.variable(bind=x_options, value="Horsepower", name="var_x")

dots = alt.Chart(cars).mark_circle().encode(
    x=alt.field(x_var),  # <--- not existing yet on altair side and how to infer type?
    y='Horsepower:Q'
).add_variable(x_var)

Open the Chart in the Vega Editor (not possibe, as raised in vega/vega-lite#7365)

jakevdp · 2021-11-15T13:34:42Z

One piece of context: the reason we require y=alt.value(5) rather than y=5 is because values may be strings, and otherwise y='x' would cause ambiguity between 'x' as a value and 'x' as a field name.

mattijn · 2021-11-22T22:06:37Z

Gentle ping @domoritz, we have a question regarding #2517 (comment), do you know if something happened or is missing related to vega-utils, vega-embed since VL5?
Edit: or any hints how to debug deeper?

ChristopherDavisUCI · 2021-11-22T22:09:19Z

Thanks @mattijn, I also asked @domoritz on Slack and he suggested it should be something more basic than what I thought. Probably just a version needs to be updated. I will investigate tonight.

domoritz · 2021-11-22T22:30:12Z

Could you generate a standalone HTML page where you get the error with vega-util? I can take a look.

ChristopherDavisUCI · 2021-11-22T23:09:03Z

Thank you! Does this count as a standalone html page? link on Google drive

I produced it by making a Jupyter notebook with the error and then downloading that notebook as an html file.

I can definitely do more debugging myself also, now that I know it should be an out-of-date reference.

mattijn · 2021-11-23T07:50:17Z

I used this piece of code to debug. It works OK in JupyterLab (to start do from cmd jupyter-lab), but not in Jupyter Notebook (to start do from cmd jupyter notebook):

It is based on the spec_to_html() Altair function.

from altair.utils.html import spec_to_html
from IPython.core.display import display, HTML
import json

dct = json.loads("""{
  "data": {
    "values": [
      {"a": "A", "b": 28},{"a": "B", "b": 55},{"a": "C", "b": 43},{"a": "D", "b": 91},
      {"a": "F", "b": 53},{"a": "G", "b": 19},{"a": "H", "b": 87},{"a": "I", "b": 52}
    ]
  },
  "mark": "bar",
  "encoding": {
    "x": {"field": "a", "type": "nominal"},
    "y": {"field": "b", "type": "quantitative"}
  },
  "height": 200,
  "width": 200
}
"""
)
# dct  # this seems ok

h = spec_to_html(
    spec=dct,
    mode='vega-lite',
    vega_version='5.21.0', 
    vegaembed_version='6.20.2', 
    vegalite_version='5.1.1',
    requirejs=False,
    base_url='https://cdn.jsdelivr.net/npm',
    fullhtml=True,
    template='standard'
)

# print(h)  # seems valid HTML

display(HTML(h))  # this doesn't work

For Jupyter Notebook I get this error in the console log:

VM48:17 
        
       Uncaught ReferenceError: vegaEmbed is not defined
    at <anonymous>:17:8
    at b (main.min.js?v=24786799f8766c957f062e6b8f0ea89d00055170b051b7d318e97aedfb60dea047c35f2dfd92a435f058e984b092f71bc1fc7d28aacf0c330637e94e2ac72e5d:2)
    at Pe (main.min.js?v=24786799f8766c957f062e6b8f0ea89d00055170b051b7d318e97aedfb60dea047c35f2dfd92a435f058e984b092f71bc1fc7d28aacf0c330637e94e2ac72e5d:2)
    at S.fn.init.append (main.min.js?v=24786799f8766c957f062e6b8f0ea89d00055170b051b7d318e97aedfb60dea047c35f2dfd92a435f058e984b092f71bc1fc7d28aacf0c330637e94e2ac72e5d:2)
    at OutputArea._safe_append (main.min.js?v=24786799f8766c957f062e6b8f0ea89d00055170b051b7d318e97aedfb60dea047c35f2dfd92a435f058e984b092f71bc1fc7d28aacf0c330637e94e2ac72e5d:59205)
    at OutputArea.append_display_data (main.min.js?v=24786799f8766c957f062e6b8f0ea89d00055170b051b7d318e97aedfb60dea047c35f2dfd92a435f058e984b092f71bc1fc7d28aacf0c330637e94e2ac72e5d:59412)
    at OutputArea.append_output (main.min.js?v=24786799f8766c957f062e6b8f0ea89d00055170b051b7d318e97aedfb60dea047c35f2dfd92a435f058e984b092f71bc1fc7d28aacf0c330637e94e2ac72e5d:59092)
    at OutputArea.handle_output (main.min.js?v=24786799f8766c957f062e6b8f0ea89d00055170b051b7d318e97aedfb60dea047c35f2dfd92a435f058e984b092f71bc1fc7d28aacf0c330637e94e2ac72e5d:59003)
    at output (main.min.js?v=24786799f8766c957f062e6b8f0ea89d00055170b051b7d318e97aedfb60dea047c35f2dfd92a435f058e984b092f71bc1fc7d28aacf0c330637e94e2ac72e5d:60836)
    at Kernel._handle_output_message (main.min.js?v=24786799f8766c957f062e6b8f0ea89d00055170b051b7d318e97aedfb60dea047c35f2dfd92a435f058e984b092f71bc1fc7d28aacf0c330637e94e2ac72e5d:62597)

EDIT: Based on this line: https://github.com/altair-viz/altair/blob/8a8642b2e7eeee3b914850a8f7aacd53335302d9/altair/vega/v5/display.py#L73 I observe that the default parameter for template is universal. If I change the template parameter to universal, I get this error (only once per session, I've to restart Jupyter Notebook to see it again):

VM31:27 
        
       Uncaught Error loading script: Script error for "vega-util", needed by: vega-lite
http://requirejs.org/docs/errors.html#scripterror

Basically meaning I can reproduce it with the following HTML string:

from IPython.core.display import display, HTML
h = """<div id="vis"></div>
<script type="text/javascript">
  (function(spec, embedOpt){
    let outputDiv = document.currentScript.previousElementSibling;
    if (outputDiv.id !== "vis") {
      outputDiv = document.getElementById("vis");
    }
    const paths = {
      "vega": "https://cdn.jsdelivr.net/npm/vega@5.21.0?noext",
      "vega-lib": "https://cdn.jsdelivr.net/npm/vega-lib?noext",
      "vega-lite": "https://cdn.jsdelivr.net/npm/vega-lite@5.1.1?noext",
      "vega-embed": "https://cdn.jsdelivr.net/npm/vega-embed@6.20.2?noext",
    };

    function loadScript(lib) {
      return new Promise(function(resolve, reject) {
        var s = document.createElement('script');
        s.src = paths[lib];
        s.async = true;
        s.onload = () => resolve(paths[lib]);
        s.onerror = () => reject(`Error loading script: ${paths[lib]}`);
        document.getElementsByTagName("head")[0].appendChild(s);
      });
    }

    function showError(err) {
      outputDiv.innerHTML = `<div class="error" style="color:red;">${err}</div>`;
      throw err;
    }

    function displayChart(vegaEmbed) {
      vegaEmbed(outputDiv, spec, embedOpt)
        .catch(err => showError(`Javascript Error: ${err.message}<br>This usually means there's a typo in your chart specification. See the javascript console for the full traceback.`));
    }

    if(typeof define === "function" && define.amd) {
      requirejs.config({paths});
      require(["vega-embed"], displayChart, err => showError(`Error loading script: ${err.message}`));
    } else if (typeof vegaEmbed === "function") {
      displayChart(vegaEmbed);
    } else {
      loadScript("vega")
        .then(() => loadScript("vega-lite"))
        .then(() => loadScript("vega-embed"))
        .catch(showError)
        .then(() => displayChart(vegaEmbed));
    }
  })({"data": {"values": [{"a": "A", "b": 28}, {"a": "B", "b": 55}, {"a": "C", "b": 43}, {"a": "D", "b": 91}, {"a": "F", "b": 53}, {"a": "G", "b": 19}, {"a": "H", "b": 87}, {"a": "I", "b": 52}]}, "mark": "bar", "encoding": {"x": {"field": "a", "type": "nominal"}, "y": {"field": "b", "type": "quantitative"}}, "height": 200, "width": 200}, {"mode": "vega-lite"});
</script>

"""
display(HTML(h))

Only once per session, I've to restart Jupyter Notebook to see it again (not just restart the kernel)

jakevdp · 2021-11-23T13:21:31Z

The error is coming from requirejs. JupyterLab does not use requirejs, while Jupyter Notebook does, and the universal template is the only HTML output that attempts to support requirejs. I suspect the issue is in one of the vega javascript sources, the requirejs mode specifies a dependency on vega-util. I'm not sure whether that's intended.

jakevdp · 2021-11-23T13:27:43Z

You can see the requirement declared at the top of the vega-lite v5 source: https://cdn.jsdelivr.net/npm/vega-lite@5.1:

!function(e,t){"object"==typeof exports&&"undefined"!=typeof module?t(exports,require("vega-util"),require("vega")) ...

Again, this problem will only arise in an environment where requirejs is loaded. If it's intended on the Vega-Lite side, it means that an extra library is necessary when creating vega-lite charts in a requireJS context. We could add this to the universal renderer:

const paths = {
      "vega": "https://cdn.jsdelivr.net/npm/vega@5.21.0?noext",
      "vega-lib": "https://cdn.jsdelivr.net/npm/vega-lib?noext",
      "vega-lite": "https://cdn.jsdelivr.net/npm/vega-lite@5.1.1?noext",
      "vega-embed": "https://cdn.jsdelivr.net/npm/vega-embed@6.20.2?noext",
      "vega-util": "https://cdn.jsdelivr.net/npm/vega-util?noext",
    };

But since it's not mentioned anywhere in the vega-embed documentation, I suspect this is a bug in the vega-lite.js build process. @domoritz might have a more definitive answer.

domoritz · 2021-11-23T20:39:13Z

Good catch. Since Vega exports all the utilities from Vega-Util, we try to replace all imports from Vega-Util with imports from Vega in the bundle. It should happen with https://github.com/vega/vega-lite/blob/12bee8b450c1176434b5166ca5f42515c41a78db/rollup.config.js#L36 but evidently does not here. I will fix this now.

mattijn · 2021-11-23T20:57:33Z

No typescript user, but here it seems mentioned as an external dependency? https://github.com/vega/vega-lite/blob/12bee8b450c1176434b5166ca5f42515c41a78db/rollup.config.js#L81

domoritz · 2021-11-23T21:19:14Z

I think I fixed the issue in vega/vega-lite#7823. It turns out rollup doesn't let us use both external and global. We have to alias the dependencies manually. The original goal was to reduce code duplication and import functionality from the Vega bundle instead of bundling it again.

domoritz · 2021-11-23T22:53:32Z

Can you try Vega-Lite 5.2.0?

ChristopherDavisUCI · 2021-11-24T00:21:53Z

I checked and it's working now in Jupyter Notebook! Thanks @domoritz for the very fast work!

domoritz · 2021-11-24T00:24:25Z

Thank y'all for isolating the issue. It helped tremendously.

ChristopherDavisUCI · 2021-11-24T13:44:37Z

@jakevdp Can I ask, how do you think selection/variable/parameter should exist in the Vega-Lite 5 version of Altair? Do you like the idea of all three existing, or should there only be selection and variable or maybe only parameter?

Currently as @mattijn pointed out above, there are .add_selection, .add_variable, and .add_parameter methods defined on Chart objects that all do the exact same thing. (The first two are aliases for .add_parameter.)

I have some ad hoc code right now to keep old examples from breaking. For example, if a selection has init specified, I change that to value. And if type='single' or type='multi' is specified, I change that to type='point'. And if a selection has empty specified, I have some code to move that into alt.condition or into transform_filter. Does that all sound okay, or is it a longterm headache to have this sort of code to fix one-off problems?

jakevdp · 2021-11-24T14:40:36Z

My feeling on this is we should stick to the language of the Vega-Lite schema; that is, use the name parameter, the method add_parameter, etc. For the 5.0 release, we could keep the previous selection functions around, but have them raise a DeprecationWarning mentioning the new syntax, and that the selection functionality will be removed in a future release. What do you think?

ChristopherDavisUCI · 2021-11-24T19:46:40Z

That sounds good to me and it should be pretty easy to implement!

ChristopherDavisUCI · 2021-11-24T22:10:00Z

@mattijn pointed out this old PR: #1629

Is this a good time to try out adding the code from there?

jakevdp · 2021-11-24T23:30:51Z

Is this a good time to try out adding the code from there?

Probably not... I'm not entirely sure about the approach in that PR. The fundamental issue is that currently for schema objects, attribute access returns the attribute. That is,

import altair as alt
x = alt.X(field='x')
print(x.field)
# x

In order to have the semantics proposed there, attribute access would have to return a function that sets the attribute; that is, we'd have to change the API of Altair objects to be something more like this:

x = alt.X().field('x')
print(x.field)
# <bound method X.field of <__main__.X object at 0x7fbf5f349d00>>
print(x['field'])
# x

That's certainly doable, but it's a fundamental departure from Altair's current object model. We could discuss whether the advantages of that outweight the disadvantages, but I think that's not something that should be done as part of this PR.

ChristopherDavisUCI · 2021-11-24T23:37:21Z

Okay cool. I'll ask no more questions until after Thanksgiving!

mattijn · 2021-11-25T15:35:37Z

I raised it primarily because of the parameter definition. When having a chart with a few sliders, you have to define them now as follow:

rad_slider = alt.binding_range(min=0, max=100, step=1)
rad_var = alt.variable(bind=rad_slider, value=0, name="radius")

rad2_slider = alt.binding_range(min=0, max=100, step=1)
rad_var2 = alt.variable(bind=rad_slider, value=50, name="radius2")

theta_slider = alt.binding_range(min=-2*np.pi, max=2*np.pi)
theta_var = alt.variable(bind=theta_slider, value=-0.73, name="theta_single_arc")

theta_slider2 = alt.binding_range(min=-2*np.pi, max=2*np.pi)
theta2_var = alt.variable(bind=theta_slider, value=0.73, name="theta2_single_arc")

corner_slider = alt.binding_range(min=0, max=50, step = 1)
corner_var = alt.variable(bind=corner_slider, value=0, name="cornerRadius")

pad_slider = alt.binding_range(min=0, max=np.pi/2)
pad_var = alt.variable(bind=pad_slider, value=0, name="padAngle")

I feel it is a bit verbose and wish there is a shorthand similar to #1629, so it can be done as an one-liner, eg.

pad_var = alt.param(value=0, name='padAngle').bind(alt.range(min=0, max=np.pi/2))

But maybe too much for now.

ChristopherDavisUCI · 2021-11-26T18:51:39Z

If a parameter in Vega-Lite is defined by

{
  "name": "sel",
  "select": {"type": "point", "fields": ["Miles_per_Gallon"], "toggle": false}
}

what would be a good way to define that parameter in Altair? Is something like the following okay, or is it too verbose?

s = alt.selection_point(fields=["Miles_per_Gallon"], toggle=False)
p = alt.parameter(select=s)

As far as I can tell, there's no "Parameter" defined in the Vega-Lite schema. There is an

"anyOf": [
  {
    "$ref": "#/definitions/VariableParameter"
  },
  {
    "$ref": "#/definitions/SelectionParameter"
  }
]

jakevdp · 2021-11-27T14:58:12Z

Having to wrap alt.selection in alt.parameter seems a bit cumbersome. But I haven't thought deeply about this question

jakevdp · 2021-11-27T14:58:46Z

Maybe alt.selection_single by itself should return the full parameter definition?

ChristopherDavisUCI · 2021-11-28T01:24:58Z

I started a new Pull Request #2528 so that I could clean up some of the commit history. It uses the most recent suggestion of mostly following the terminology of Vega-Lite schema, but allowing selection_point etc to define parameters.

I'll probably close this PR in a day or two unless someone says I should leave it open.

Thanks and see you at #2528!

ChristopherDavisUCI added 2 commits November 9, 2021 07:55

Creation of v5 folder that is exactly duplicated from v4

e493443

Initial file updates for Vega-Lite v5.1

ed05aad

This was referenced Nov 9, 2021

WIP: update to Vega-Lite 5 #2516

Closed

Update to Vega-Lite 4.17.0 #2513

Merged

Move height/width/view from layer subcharts to parent layer

bfcd0c4

ChristopherDavisUCI and others added 3 commits November 10, 2021 18:04

Fix inconsistent output

146f394

Definitions related to VariableParameter

84e1c1c

Merge branch 'altair-viz:master' into samplev51

4cdc160

Merge branch 'altair-viz:master' into samplev51

a457f18

domoritz mentioned this pull request Nov 23, 2021

feat: smaller bundles by making vega libraries external vega/vega-lite#7823

Merged

Update to Vega-Lite 5.2.0

7a125c1

ChristopherDavisUCI mentioned this pull request Nov 28, 2021

WIP: update to Vega-Lite 5.2 #2528

Merged

ChristopherDavisUCI closed this Dec 2, 2021

ChristopherDavisUCI deleted the samplev51 branch March 25, 2022 17:54

ChristopherDavisUCI mentioned this pull request Nov 23, 2022

Add method based attribute setting to encoding classes for Altair 5 #2592

Closed

4 tasks

WIP: update to Vega-Lite 5 #2517

WIP: update to Vega-Lite 5 #2517

Conversation

ChristopherDavisUCI commented Nov 9, 2021

mattijn commented Nov 10, 2021

jakevdp commented Nov 10, 2021 • edited Loading

ChristopherDavisUCI commented Nov 10, 2021

jakevdp commented Nov 10, 2021 • edited Loading

joelostblom commented Nov 10, 2021

ChristopherDavisUCI commented Nov 10, 2021

jakevdp commented Nov 10, 2021

joelostblom commented Nov 10, 2021

ChristopherDavisUCI commented Nov 11, 2021 • edited Loading

ChristopherDavisUCI commented Nov 11, 2021 • edited Loading

jakevdp commented Nov 11, 2021

mattijn commented Nov 11, 2021 • edited Loading

mattijn commented Nov 12, 2021

ChristopherDavisUCI commented Nov 12, 2021

jakevdp commented Nov 12, 2021 • edited Loading

mattijn commented Nov 12, 2021 • edited Loading

ChristopherDavisUCI commented Nov 14, 2021

mattijn commented Nov 14, 2021

ChristopherDavisUCI commented Nov 14, 2021

mattijn commented Nov 15, 2021

jakevdp commented Nov 15, 2021 • edited Loading

mattijn commented Nov 22, 2021 • edited Loading

ChristopherDavisUCI commented Nov 22, 2021

domoritz commented Nov 22, 2021

ChristopherDavisUCI commented Nov 22, 2021

mattijn commented Nov 23, 2021 • edited Loading

jakevdp commented Nov 23, 2021

jakevdp commented Nov 23, 2021 • edited Loading

domoritz commented Nov 23, 2021

mattijn commented Nov 23, 2021

domoritz commented Nov 23, 2021

domoritz commented Nov 23, 2021

ChristopherDavisUCI commented Nov 24, 2021

domoritz commented Nov 24, 2021

ChristopherDavisUCI commented Nov 24, 2021

jakevdp commented Nov 24, 2021

ChristopherDavisUCI commented Nov 24, 2021

ChristopherDavisUCI commented Nov 24, 2021

jakevdp commented Nov 24, 2021 • edited Loading

ChristopherDavisUCI commented Nov 24, 2021

mattijn commented Nov 25, 2021 • edited Loading

ChristopherDavisUCI commented Nov 26, 2021

jakevdp commented Nov 27, 2021

jakevdp commented Nov 27, 2021

ChristopherDavisUCI commented Nov 28, 2021

jakevdp commented Nov 10, 2021 •

edited

Loading

jakevdp commented Nov 10, 2021 •

edited

Loading

ChristopherDavisUCI commented Nov 11, 2021 •

edited

Loading

ChristopherDavisUCI commented Nov 11, 2021 •

edited

Loading

mattijn commented Nov 11, 2021 •

edited

Loading

jakevdp commented Nov 12, 2021 •

edited

Loading

mattijn commented Nov 12, 2021 •

edited

Loading

jakevdp commented Nov 15, 2021 •

edited

Loading

mattijn commented Nov 22, 2021 •

edited

Loading

mattijn commented Nov 23, 2021 •

edited

Loading

jakevdp commented Nov 23, 2021 •

edited

Loading

jakevdp commented Nov 24, 2021 •

edited

Loading

mattijn commented Nov 25, 2021 •

edited

Loading