fix: convert column type to str during dual read/write #19701

hughhhh · 2022-04-13T19:48:40Z

SUMMARY

Users are blocked when trying to save a dataset on a query/table that has an ARRAY column in it. Due the python_type returning a list instead of a str for the type. To fix this i've added a util function to check if the type is a list and convert it to a string like this ARRAY

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

Has associated issue:
Required feature flags:
Changes UI
Includes DB Migration (follow approval process in SIP-59)
- Migration is atomic, supports rollback & is backwards-compatible
- Confirm DB migration upgrade and downgrade tested
- Runtime estimates and downtime expectations provided
Introduces new feature or API
Removes existing feature or API

eschutho · 2022-04-13T19:52:04Z

This is great @hughhhh. Can you write a quick unit test for it?

codecov · 2022-04-13T19:54:04Z

Codecov Report

Merging #19701 (8269591) into master (c8304a2) will decrease coverage by 0.00%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master   #19701      +/-   ##
==========================================
- Coverage   66.51%   66.51%   -0.01%     
==========================================
  Files        1684     1684              
  Lines       64559    64584      +25     
  Branches     6626     6626              
==========================================
+ Hits        42941    42955      +14     
- Misses      19923    19934      +11     
  Partials     1695     1695

Flag	Coverage Δ
hive	`52.67% <16.66%> (-0.02%)`	⬇️
javascript	`51.15% <ø> (ø)`
mysql	`81.93% <100.00%> (-0.03%)`	⬇️
postgres	`81.97% <100.00%> (-0.03%)`	⬇️
presto	`52.52% <16.66%> (-0.02%)`	⬇️
python	`82.41% <100.00%> (-0.03%)`	⬇️
sqlite	`81.75% <100.00%> (-0.03%)`	⬇️
unit	`47.73% <100.00%> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
superset/connectors/sqla/utils.py	`91.34% <100.00%> (+0.52%)`	⬆️
superset/sql_lab.py	`78.90% <0.00%> (-2.74%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c8304a2...8269591. Read the comment docs.

ktmud · 2022-04-13T20:26:52Z

Should we save the original database type here? The type field specifically used Text as storage so the originally intention is probably to allow storing the full schema of complex types such as ARRAY and STRUCT?

What does the old column model save?

hughhhh · 2022-04-13T22:11:05Z

Should we save the original database type here? The type field specifically used Text as storage so the originally intention is probably to allow storing the full schema of complex types such as ARRAY and STRUCT?

What does the old column model save?

I'm pretty sure it was still ARRAY, but this is only a patch i'm pretty sure there other types we'd have to map to strings later

ktmud · 2022-04-14T01:27:32Z

superset/connectors/sqla/utils.py

+    if column_type.python_type == list:
+        return "ARRAY"
+    if column_type.python_type == dict:
+        return "JSON"


Can it also be a STRUCT? I guess it probably depends on the database.

so i'm following the pattern from the sqlalchemy.sql.sqltypes

Should we save the original database type here? The type field specifically used Text as storage so the originally intention is probably to allow storing the full schema of complex types such as ARRAY and STRUCT?

I agree, using the original native type is the ideal solution here.

villebro · 2022-04-14T08:47:43Z

@hughhhh @ktmud I've opened up an alternative solution for this issue that reuses existing code that's used when creating physical datasets: #19714

sadpandajoe · 2022-04-18T22:35:00Z

🏷️ preset:2022.15

convert column type in function

c0ba42e

pull-request-size bot added the size/S label Apr 13, 2022

hughhhh changed the title ~~fix: convert column type in function~~ fix: convert column type to str during dual read/write Apr 13, 2022

eschutho requested review from ktmud and betodealmeida April 13, 2022 19:50

hughhhh force-pushed the fix-ds-save-array branch from bb732bc to f7c1bb7 Compare April 14, 2022 01:23

ktmud reviewed Apr 14, 2022

View reviewed changes

udpate with test

8269591

hughhhh force-pushed the fix-ds-save-array branch from f7c1bb7 to 8269591 Compare April 14, 2022 01:29

villebro mentioned this pull request Apr 14, 2022

fix: create virtual table with exotic type #19714

Merged

9 tasks

hughhhh closed this Apr 14, 2022

superset-github-bot bot added the preset:2022.15 label Apr 18, 2022

mistercrunch deleted the fix-ds-save-array branch March 26, 2024 16:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: convert column type to str during dual read/write #19701

fix: convert column type to str during dual read/write #19701

hughhhh commented Apr 13, 2022

eschutho commented Apr 13, 2022

codecov bot commented Apr 13, 2022 •

edited

Loading

ktmud commented Apr 13, 2022 •

edited

Loading

hughhhh commented Apr 13, 2022

ktmud Apr 14, 2022

hughhhh Apr 14, 2022

villebro Apr 14, 2022

villebro commented Apr 14, 2022

sadpandajoe commented Apr 18, 2022

fix: convert column type to str during dual read/write #19701

fix: convert column type to str during dual read/write #19701

Conversation

hughhhh commented Apr 13, 2022

SUMMARY

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

eschutho commented Apr 13, 2022

codecov bot commented Apr 13, 2022 • edited Loading

Codecov Report

ktmud commented Apr 13, 2022 • edited Loading

hughhhh commented Apr 13, 2022

ktmud Apr 14, 2022

Choose a reason for hiding this comment

hughhhh Apr 14, 2022

Choose a reason for hiding this comment

villebro Apr 14, 2022

Choose a reason for hiding this comment

villebro commented Apr 14, 2022

sadpandajoe commented Apr 18, 2022

codecov bot commented Apr 13, 2022 •

edited

Loading

ktmud commented Apr 13, 2022 •

edited

Loading