-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Plugin does not open parquet files on Mac M1 #83
Comments
I've been having issues with the extension since yesterday also. The issue is the same for me as in other reports: "the file takes forever to load". I don't know if it eventually loads or not, since I just close it if it doesn't load in a few seconds. I'm using pandas to check out my files as a workaround, but this extension is very much missed :) @akelsch, does it make sense to specify how to write your parquet files? I would have thought once they are I'm developing on:
Thanks @dvirtz for your work on this great extension. |
Oh, so the issue is not caused by Databricks but rather appeared at the same time as our migration (what a coincidence). I am also on Mac M1 and downgrading the plugin version to 2.2.2 does actually let me open above parquet file. 2.3.0 seems to be the first breaking version for me with 2.3.2 not working too. Looks like some form of regression on Mac M1 only though because it is working on my Windows machine. Edit: Logs from Mac machine:
I am using the default "parquets" backend. |
Nice find. Exactly the same behavior I'm experiencing: 2.2.2 is the last working version. Is this PS: Anecdotal but perhaps useful, this week in a python project I couldn't install arrow8.0 in a python3.7 venv ( |
Also having similar issues:
|
was playing around with ChatGPT a bit, updated Python, Pip and installed PyArrow, doesn't seem to help with error aswell... Python 3.11.4
pip 23.2.1
pyarrow.__version__ 12.0.1
|
mkay i got some slight progress, git clone ...
brew install pipenv
npm i
npm run test it kind worked for a few sec in tests, I'm gonna dig a bit more tomorrow |
error will only be reported when using arrow backend Fixes #83
error will only be reported when using arrow backend Fixes #83
error will only be reported when using arrow backend Fixes #83
error will only be reported when using arrow backend Fixes #83
error will only be reported when using arrow backend Fixes #83
🎉 This issue has been resolved in version 2.3.3 🎉 The release is available on: Your semantic-release bot 📦🚀 |
Hi @dvirtz,
our team has encountered an issue with vscode-parquet-viewer after we have migrated our Apache Spark platform from Azure HDInsight to Azure Databricks.Turns out we cannot open parquet files with your plugin anymore. The tab in VSCode will be loading forever.I did attach a sample file to this issue. Its contents look like this (converted using a different plugin):
I would be really grateful if you could fix this issue as your plugin was the go-to choice to work with parquet files in our team.
Thanks!
Edit: Looks like this is unrelated to Databricks, similar to #81 but only happening on Mac M1
part-00000-tid-8369997251915174921-9ae69399-852b-4480-8335-60e65202bcc8-36-1-c000.snappy.parquet.zip
The text was updated successfully, but these errors were encountered: