bar base and timeline #2626

nicolaskruchten · 2020-07-08T19:28:43Z

I have added tests (if submitting a new feature or correcting a bug) or
modified existing tests.
For a new feature, I have added documentation examples in an existing or
new tutorial notebook (please see the doc checklist as well).
I have added a CHANGELOG entry if fixing/changing/adding anything substantial.

nicolaskruchten · 2020-07-09T17:37:29Z

@emmanuelle ready for review

emmanuelle · 2020-07-14T09:53:55Z

packages/python/plotly/plotly/express/_chart_types.py

@@ -357,6 +358,53 @@ def bar(
 bar.__doc__ = make_docstring(bar, append_dict=_cartesian_append_dict)


+def timeline(
+    data_frame=None,
+    x_start=None,


if these arguments are required they should not be keywords arguments but positional ones I think?

we could think of defaults like providing a global start or end parameter which would be the one for all. But I think the easiest solution is to make the arguments positional.

I would really like to keep this function consistent with all the others, which have the data_frame argument come first. I agree that in general required arguments should come first and be positional, but in this case I prefer consistency. px.pie() sort of works the same way: you need to provide either values or names otherwise you get an empty figure.

To align with something like pie I could just have x be unbound if x_start and x_end aren't provided instead of raising (i.e. we would render an empty figure just like px.pie(df)) but I think this would be less helpful for users.

I see your point but preferring px consistency over python usage (when I see a keyword argument I don't expect the function to raise with its default value, and in particular None usually means optional) is not 100% satisfying. You are right that pie has the same problem, and that if you use all the default values for the other functions, the result is an empty figure so not very interesting. I'd be happy with a modification of the docstring for x_start and x_end mentioning that they are compulsory.

To align with something like pie I could just have x be unbound if x_start and x_end aren't provided instead of raising (i.e. we would render an empty figure just like px.pie(df)) but I think this would be less helpful for users.

but this would be consistent :-)

emmanuelle · 2020-07-14T09:57:26Z

packages/python/plotly/plotly/express/_core.py

+            "Both x_start and x_end must refer to data convertible to datetimes."
+        )
+
+    # note that we are not adding any columns to the data frame here, so no risk of overwrite


not sure I understand here; there's no risk of overwrite because the dataframe is copied?

This is saying that we're not adding a new column to the DF, we're just overwriting an existing known key that we control. If we were to add a new column to the DF called x or base we would run the risk of clobbering an existing column that was mapped to something like color and things would break.

Although... as I write this, I realized that actually this exact situation breaks in this case: if someone maps color="x_end" we will end up clobbering their column... same problem as we encounter in sunburst/treemap when we mess with the DF :)

thanks for the explanations. It's indeed a similar problem as the sunburst / treemap case.

emmanuelle · 2020-07-14T10:01:42Z

Just a comment about the arguments x_start and x_end, which should be positional IMO, otherwise ready to go!

emmanuelle · 2020-07-14T13:12:25Z

For the record about positional vs keyword arguments, since px functions have only keyword arguments, here is the list of functions which raise when called without any argument (the other return an empty figure)

import plotly.express as px
for func_name in px.__all__[:-8]:
    try:
        fig = getattr(px, func_name)()
    except:
        print(func_name, "error")

which results in

scatter_mapbox error
scatter_matrix error
density_mapbox error
line_mapbox error
parallel_coordinates error
parallel_categories error
timeline error
imshow error

nicolaskruchten · 2020-07-14T13:16:03Z

Ah that's a good list to have, thanks for raising this issue. Raising with defaults isn't the best thing, I agree, but at least we should ensure that there is a clear error message in each case.

The problem the way I see it is that in some cases (not timeline) we can't make arguments required because it's like "either A or B is required" so we need to give them default None, but if neither is specified we need to either render an empty figure or raise. I don't know that empty figure is strictly better than raising because it's less helpful to the user.

nicolaskruchten · 2020-07-14T13:39:46Z

Follow-up: all these functions have reasonable error messages except for line_mapbox and scatter_mapbox which blindly read lat/lon

amundhov · 2020-10-29T11:57:15Z

packages/python/plotly/plotly/express/_core.py

+
+    try:
+        x_start = pd.to_datetime(args["data_frame"][args["x_start"]])
+        x_end = pd.to_datetime(args["data_frame"][args["x_end"]])


There is some (for me ) unexpected behavior when the start and/or end times are plain numbers (parseable by pandas to_datetime). The default in pandas is to denominate in nano-seconds, whereas the subsequent time delta in process_dataframe_timeline is converted to mili-seconds.

If you try to plot something small such as [1, 2, 10], timeline will happily convert it and end up with all zeros and an empty plot since 1ns, 2ns and 10ns are all less than 1ms.

Should I create a bug report / enhancement for this issue?

nicolaskruchten requested a review from emmanuelle July 8, 2020 19:28

nicolaskruchten mentioned this pull request Jul 8, 2020

px.NO_COLOR #2614

Merged

4 tasks

nicolaskruchten added this to the 4.9.0 milestone Jul 8, 2020

nicolaskruchten force-pushed the timeline branch 4 times, most recently from 7bcc167 to 22fb60e Compare July 9, 2020 17:36

nicolaskruchten added 2 commits July 13, 2020 12:22

bar base and timeline

fc64079

ff docs

43493cb

nicolaskruchten force-pushed the timeline branch from 040c8fd to 43493cb Compare July 13, 2020 16:22

emmanuelle reviewed Jul 14, 2020

View reviewed changes

clarify docstring

19a0544

nicolaskruchten merged commit 8f32b7d into master Jul 14, 2020

nicolaskruchten deleted the timeline branch July 20, 2020 14:35

amundhov reviewed Oct 29, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bar base and timeline #2626

bar base and timeline #2626

nicolaskruchten commented Jul 8, 2020 •

edited

Loading

nicolaskruchten commented Jul 9, 2020

emmanuelle Jul 14, 2020

emmanuelle Jul 14, 2020

nicolaskruchten Jul 14, 2020

emmanuelle Jul 14, 2020

emmanuelle Jul 14, 2020

emmanuelle Jul 14, 2020

nicolaskruchten Jul 14, 2020

nicolaskruchten Jul 14, 2020

emmanuelle Jul 14, 2020

emmanuelle commented Jul 14, 2020

emmanuelle commented Jul 14, 2020

nicolaskruchten commented Jul 14, 2020

nicolaskruchten commented Jul 14, 2020

amundhov Oct 29, 2020

bar base and timeline #2626

bar base and timeline #2626

Conversation

nicolaskruchten commented Jul 8, 2020 • edited Loading

nicolaskruchten commented Jul 9, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

emmanuelle commented Jul 14, 2020

emmanuelle commented Jul 14, 2020

nicolaskruchten commented Jul 14, 2020

nicolaskruchten commented Jul 14, 2020

Choose a reason for hiding this comment

nicolaskruchten commented Jul 8, 2020 •

edited

Loading