-
-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support XML as input format #9459
Comments
I believe XML is not a hot language anymore, but still widely use so I believe adds lots of value. |
It's still massively used in production throughout the world. In my work there's not a single month (or dare I say week) without encountering tabular data presented as XML. |
This would indeed be very helpful! For now, I would use pandas' function What are alternatives? |
@MariusMerkleQC It seems to be essentially a small wrapper around doc = lxml.etree(...)
nodes = doc.xpath(...)
df = pd.DataFrame(nodes) |
Then it should be relatively easy to bring this to |
As it is not supported yet, I just used the library ElementTree to parse the |
Would definitely love to have native xml support in polars. Not hard to add but annoying when coming from pandas. |
polars isn't going to implement an xml reader based on python's xml reader it'd have to be rust. I can't say whether or not the maintainers want the extra binary size |
It seems that calamine (used by fastexcel in the Perhaps something could be done with quick-xml if Calamine integration happens at the Rust level.
|
With XML reading I'd like to see #13063 HTML reading too. |
Problem description
It would be great to be able to read XML into Polars DataFrame, similarly to what pandas offers (https://pandas.pydata.org/docs/reference/api/pandas.read_xml.html).
The text was updated successfully, but these errors were encountered: