rust_xlsxwriter Roadmap #1

jmcnamara · 2022-09-24T20:19:08Z

jmcnamara · 2022-09-24T20:23:25Z

Initial performance data:

$ hyperfine target/release/examples/app_perf_test ./c_perf_test "python py_perf_test.py"
Benchmark 1: target/release/examples/app_perf_test
  Time (mean ± σ):     447.2 ms ±   8.3 ms    [User: 402.8 ms, System: 39.4 ms]
  Range (min … max):   431.5 ms … 460.7 ms    10 runs

Benchmark 2: ./c_perf_test
  Time (mean ± σ):     362.5 ms ±   6.4 ms    [User: 305.2 ms, System: 53.0 ms]
  Range (min … max):   353.2 ms … 371.9 ms    10 runs

Benchmark 3: python py_perf_test.py
  Time (mean ± σ):      2.899 s ±  0.023 s    [User: 2.787 s, System: 0.088 s]
  Range (min … max):    2.868 s …  2.934 s    10 runs

Summary
  './c_perf_test' ran
    1.23 ± 0.03 times faster than 'target/release/examples/app_perf_test'
    8.00 ± 0.16 times faster than 'python py_perf_test.py'

Or in other words, the C version is the fastest and if we take that as 1 then the rust version is 1.2x (or 20%) slower and the Python version is 8x slower.

The Rust version is ~6.5x faster than the Python version.

$ hyperfine target/release/examples/app_perf_test "python py_perf_test.py"
Benchmark 1: target/release/examples/app_perf_test
  Time (mean ± σ):     450.8 ms ±   5.5 ms    [User: 406.8 ms, System: 39.1 ms]
  Range (min … max):   443.5 ms … 459.1 ms    10 runs

Benchmark 2: python py_perf_test.py
  Time (mean ± σ):      2.942 s ±  0.040 s    [User: 2.821 s, System: 0.090 s]
  Range (min … max):    2.877 s …  3.014 s    10 runs

Summary
  'target/release/examples/app_perf_test' ran
    6.53 ± 0.12 times faster than 'python py_perf_test.py'

pickfire · 2023-01-11T06:19:55Z

Is polars support planned? I saw the python xlsxwriter have pandas support, I wonder if the rust xlsxwriter have any plans to support polars.

jmcnamara · 2023-01-11T09:24:50Z

I wonder if the rust xlsxwriter have any plans to support polars.

Good suggestion. That was something that I was thinking about. I wrote the initial xlsxwriter integration into Pandas. I'll have a look in their GitHub issues/requests and see if there is any planned work.

claudiofsr · 2023-04-14T21:52:43Z

Is it possible to use rayon or std thread scope (parallelism) to do each worksheet and at the end add in the final workbook?

This code was just a vain attempt:
https://github.com/claudiofsr/rust-sped/blob/master/src/excel_alternative.rs

jmcnamara · 2023-04-14T22:10:35Z

Is it possible to use rayon or std:: thread ::scope (parallelism) to do each worksheet and at the end add in the final workbook?

It wouldn't be easy. I've thought a good bit about this in the past in relation to the other language version of the library. The main issue is that the xlsx file format has a lot of interlinked "relationships" stored in .rel files. Worksheet strings are also stored in a shared hash table and referenced by id. These, more or less, need to be worked out sequentially and/or with some locking.

However, I would like the library to have the best performance possible (within the limits of the design and file format) so I'll take a look at what can be done.

Update: some backend parallelism was added in v0.44.0

Christoph-AK · 2023-05-12T12:57:14Z

Hi and thanks for the library! This is a really aweswome upgrade to the previous binding to the C lib.
Just wanting to drop I would be really happy to see the tables functionality added. Would love to use that.

jmcnamara · 2023-05-12T13:22:29Z

I would be really happy to see the tables functionality added

That will be the next major feature after I complete more of the chart feature.

jmcnamara · 2023-05-26T23:46:56Z

@Christoph-AK see #41 for initial table support.

Update: completed in v0.40.0

plus1xp-everyday · 2023-07-23T06:32:21Z

Hi.. Thanks for the library!! I was in need of an active excel writer library in rust for my new project and this project seems to be most active and promising.

My project may depend on a lot of existing excel templates. I understand from the readme that currently editing an existing excel file is not supported. Any chances that this could be added in the future?

jmcnamara · 2023-07-23T20:16:50Z

Any chances that this could be added in the future?

Unfortunately no, I don’t plan to tackle reading or rewriting XLSX files.

The XLSX structure is difficult to parse and rewrite for anything beyond basic data (and even that it can be hard for elements like dates).

Instead I’m going to concentrate my efforts to try give Rust a best in class XLSX writing library.

For additional context here is a reply that I gave to a similar request to the Python version of the library: jmcnamara/XlsxWriter#653 (comment)

Hopefully someone will step up at some point to combine one of the Rust XLSX readers with rust_xlsxwriter for a templating/rewriting solution.

jmcnamara · 2023-08-20T20:09:50Z

I've uploaded a new crate called polars_excel_writer for serializing Polars dataframes into Excel Xlsx files using rust_xlsxwriter as a backend engine.

It provides two interfaces for writing a Polars Rust dataframe to an Excel Xlsx file:

ExcelWriter a simple Excel serializer that implements the Polars SerWriter trait to write a dataframe to an Excel Xlsx file. This is similar to the CsvWriter interface.
PolarsXlsxWriter a more configurable Excel serializer that resembles the interface options provided by the Polars Python write_excel() dataframe method. There is still work in progress for this interface.

One useful feature of PolarsXlsxWriter is that you can mix Polars and rust_xlsxwriter code to access Excel features not available in the current interface.

jmcnamara · 2023-11-21T21:20:46Z

I've added support for Conditional Formatting. See Working with Conditional Formats in the docs. #58

jmcnamara · 2023-12-09T09:52:40Z

I have added support for Serde serialization in v0.57.0. See Working with Serde in the rust_xlsxwriter docs and the discussion thread #61.

Some additional serialisation features and helpers will be added in upcoming releases.

Expurple · 2024-02-01T12:03:39Z

Do you have any plans regarding reaching version 1.0? You seem to make a new 0.x release every week or so. Many of them don't have any breaking changes, but still require me to manually bump the version in Cargo.toml to make sure that I don't miss any new bug fixes. It would be nice to actually utilize semantic versioning and indicate non-breaking releases with 1.x bumps. This way, my app would be able to depend on the major version and automatically receive library updates. Note that I'm not asking for any new stability commitments, you can release 2.0 as soon as you want to make a change. This model would already be more comfortable than a breaking bump on every release.

In theory, I assume that major numbers would also make the life easier for library authors who depend on rust_xlsxwriter. Currently, they have to either pin it to a very narrow minor version or define a range with an explicit upper bound and bump it every week.

Christoph-AK · 2024-02-01T12:05:55Z

Obligatory mention of https://github.com/obi1kenobi/cargo-semver-checks - might be worth including.

jmcnamara · 2024-02-01T13:14:58Z

Do you have any plans regarding reaching version 1.0?

I plan to release a 1.x.x version once the feature set is ~ 100% of the Python feature set. Based on the task list above the current feature set is 25/33 features and based on ported integration test cases it is ~ 700/900. I would hope to get to 1.0.0 by the end of the year. Some of the remaining tasks are reasonable big though.

You seem to make a new 0.x release every week or so.

That will probably continue through this year (with an upcoming pause of 1-2 months while I work on some of the other language libraries/features).

Many of them don't have any breaking changes, but still require me to manually bump the version in Cargo.toml to make sure that I don't miss any new bug fixes.

Yes. Some, or many, of those could be patch level releases but most contain a reasonable level of new functionality.

The semver docs say:

Minor version Y (x.Y.z | x > 0) MUST be incremented if new, backward compatible functionality is introduced to the public API. It MUST be incremented if any public API functionality is marked as deprecated. It MAY be incremented if substantial new functionality or improvements are introduced within the private code. It MAY include patch level changes. Patch version MUST be reset to 0 when minor version is incremented.

I am usually in the "MAY" category and sometimes in the "MUST".

Note that I'm not asking for any new stability commitments, you can release 2.0 as soon as you want to make a change. This model would already be more comfortable than a breaking bump on every release.

I think I would end up incrementing a large number of major versions as well. I don't know if that would be better or worse for the end user.

Anyway, overall I think you (and others) will just need to bear with me for the next year or so. The downside is that there will be several more bumps in minor versions but the (hopefully) upside is that there will be new substantive features added on a regular basis.

jmcnamara · 2024-02-26T08:41:30Z

I have released rust_xlsxwriter v0.63.0 with support for embedding images into worksheets. See the Embedded Images example in the docs.

This can be useful if you are building up a spreadsheet of products with a column of images of each product. Embedded images move with the cell so they can be used in worksheet tables or data ranges that will be sorted or filtered.

This functionality is the equivalent of Excel's menu option to insert an image using the option to "Place in Cell" which is available in Excel 365 versions from 2023 onwards. I was a frequently requested feature for Excel and for the xlsxwriter variants.

jmcnamara · 2024-07-19T23:36:21Z

I've added support for adding VBA Macros to rust_xlsxwriter using files extracted from Excel files. This isn't very useful and it is also a little kludgy but it is a reasonably popular feature of the Python library and it has utility in some circumstance.

Also, this lays some of the groundwork for adding cell comments (now called Notes by Excel).

Explanation

An Excel xlsm file is structurally the same as an xlsx file except that it contains an additional vbaProject.bin binary file containing VBA functions and/or macros.

Unlike other components of an xlsx/xlsm file this data isn't stored in an XML format. Instead the functions and macros as stored as a pre-parsed binary format. As such it wouldn't be feasible to programmatically define macros and create a vbaProject.bin file from scratch (at least not in the remaining lifespan and interest levels of the author).

Instead, as a workaround, the Rust vba_extract utility is used to extract vbaProject.bin files from existing xlsm files which can then be added to rust_xlsxwriter files.

See Working with VBA Macros.

Here is an example:

use rust_xlsxwriter::{Button, Workbook, XlsxError};

fn main() -> Result<(), XlsxError> {
    // Create a new Excel file object.
    let mut workbook = Workbook::new();

    // Add the VBA macro file.
    workbook.add_vba_project("examples/vbaProject.bin")?;

    // Add a worksheet and some text.
    let worksheet = workbook.add_worksheet();

    // Widen the first column for clarity.
    worksheet.set_column_width(0, 30)?;

    worksheet.write(2, 0, "Press the button to say hello:")?;

    // Add a button tied to a macro in the VBA project.
    let button = Button::new()
        .set_caption("Press Me")
        .set_macro("say_hello")
        .set_width(80)
        .set_height(30);

    worksheet.insert_button(2, 1, &button)?;

    // Save the file to disk. Note the `.xlsm` extension. This is required by
    // Excel or it raise a warning.
    workbook.save("macros.xlsm")?;

    Ok(())
}

Output:

jmcnamara · 2024-07-26T09:28:31Z

I've added support for cell Notes (previously called Comments) in v0.72.0.

See https://docs.rs/rust_xlsxwriter/latest/rust_xlsxwriter/struct.Note.html

Here is an example:

use rust_xlsxwriter::{Note, Workbook, XlsxError};

fn main() -> Result<(), XlsxError> {
    // Create a new Excel file object.
    let mut workbook = Workbook::new();

    // Add a worksheet to the workbook.
    let worksheet = workbook.add_worksheet();

    // Widen the first column for clarity.
    worksheet.set_column_width(0, 16)?;

    // Write some data.
    let party_items = [
        "Invitations",
        "Doors",
        "Flowers",
        "Champagne",
        "Menu",
        "Peter",
    ];
    worksheet.write_column(0, 0, party_items)?;

    // Create a new worksheet Note.
    let note = Note::new("I will get the flowers myself").set_author("Clarissa Dalloway");

    // Add the note to a cell.
    worksheet.insert_note(2, 0, &note)?;

    // Save the file to disk.
    workbook.save("notes.xlsx")?;

    Ok(())
}

And the output:

I didn't port some of the Python Note/Comment features such as note positioning since they weren't widely used and Excel's implementation tends to surprise people. If people ask of them I'll add them. The infrastructure is already in place.

Note, in versions of Excel prior to Office 365 Notes were referred to as "Comments". The name Comment is now used for a newer style threaded comment and Note is used for the older non threaded version. The newer Threaded Comments are unlikely to be added to rust_xlsxwriter due to fact that it relies on company specific metadata to identify the comment author.

As an aside the internal traits that I had put in place for other worksheet objects (images, charts, buttons) made this feature relatively easy to add. I really like this aspect of Rust where some of the abstractions can give very clean and easy to maintain/refactor code. Overall I really enjoy using Rust as a language.

jmcnamara · 2024-08-23T23:50:26Z

I've uploaded version v0.74.0 of rust_xlsxwriter which adds methods to format cells separately from the data writing functions.

In Excel the data in a worksheet cell is comprised of a type, a value and a format. When using rust_xlsxwriter the type is inferred and the value and format are generally written at the same time using methods like Worksheet::write_with_format().

However, if required you can now write the data separately and then add the format using the new methods like Worksheet::set_cell_format(),Worksheet::set_range_format() and Worksheet::set_range_format_with_border().

For example you can now create border formatting like this with a single method call:

This has always been a heavily requested feature in the Python version but due to some different design decisions it was never easy to implement.

jmcnamara · 2024-09-02T20:07:40Z

I've released v0.75.0 of rust_xlsxwriter which removes the dependency on the regex.rs crate for smaller binary sizes. The only non-optional dependency is now zip.rs.

An example of the size difference is shown below for one of the sample apps:

`app_hello_world`	v0.74.0	v0.75.0
Debug	9.2M	4.2M
Release	3.4M	1.6M

See the discussion at #108

jmcnamara · 2024-09-12T15:07:12Z

Version v0.76.0 of rust_xlsxwriter is out with support for adding Textbox shapes to worksheets.

See the documentation for Shape.

This is the last of the "Larger functionality" features. We are probably on track to be feature complete with, or beyond, the Python version by the end of the year.

jmcnamara · 2024-09-13T12:29:42Z

Folks, I am looking for some input on "constant_memory" mode for rust_xlsxwriter: #111

jmcnamara · 2024-09-17T23:06:02Z

Released version v0.77.0 of rust_xlsxwriter with support for Chartsheets.

A Chartsheet in Excel is a specialized type of worksheet that doesn't have cells but instead is used to display a single chart. It supports worksheet display options such as headers and footers, margins, tab selection and print properties.

Chartsheets aren't widely used these days (as far as I can see) but end users sometimes request this feature.

jmcnamara · 2024-09-21T16:49:29Z

I have an initial working version of the "constant memory" mode on the constant_memory branch (see #111 ). It currently has limited functionality but there is enough to allow me to benchmark the potential savings. I'm reposting the results here for a wider audience and hopefully some feedback/testing.

The memory usage profile is effectively flat (as designed):

Cells	Standard - Size (MB)	Constant Memory - Size (MB)	Standard - Time (s)	Constant Memory - Time (s)
100,000	16.213	0.021	0.101	0.088
200,000	32.405	0.021	0.214	0.179
300,000	52.794	0.021	0.335	0.276
400,000	64.793	0.021	0.443	0.369
500,000	76.792	0.021	0.564	0.468
600,000	105.569	0.021	0.673	0.564
700,000	117.567	0.021	0.768	0.669
800,000	129.567	0.021	0.874	0.799
900,000	141.566	0.021	1.002	0.862
1,000,000	153.565	0.021	1.081	1.022

Which looks like this:

Similarly to the Python version the performance is also slightly better (5-15%) in this mode. Lower time is better. However for numeric only/heavy data (which is the case in practice) the performance is more of less the same.

The tests were run like this:

./target/release/examples/app_memory_test 4000
./target/release/examples/app_memory_test 4000 --constant-memory

So the initial results are good. I'll continue with the functionality.

It would be good to have a few other eyes on this in #111.

hackers267 · 2024-09-23T05:48:11Z

How about the serde sub-struct flattening support?

jmcnamara · 2024-09-23T07:54:16Z

How about the serde sub-struct flattening support?

@hackers267 It is probably worth opening a feature request for that.

The main problem is how would a sub-structure be flattened in the 2D cell matrix of a worksheet. The obvious approach would be to flatten the sub-structure into a a string in a cell, but that might not be a great solution for some (most?) users.

So it is probably best to kick off a Feature Request issue with an example of your data structure and also an Excel example of what you think the output should look like. Anyone else who is facing this issue can then add their opinion/suggestion.

Christoph-AK · 2024-09-23T08:01:42Z

I have an initial working version of the "constant memory" mode on the constant_memory branch (see #111 ).

Wow, this looks incredibly promising. Are there reasons not to use this as default, or maybe switch to it intelligently if only supported features are used?

hackers267 · 2024-09-23T09:16:31Z

How about the serde sub-struct flattening support?

@hackers267 It is probably worth opening a feature request for that.

The main problem is how would a sub-structure be flattened in the 2D cell matrix of a worksheet. The obvious approach would be to flatten the sub-structure into a a string in a cell, but that might not be a great solution for some (most?) users.

So it is probably best to kick off a Feature Request issue with an example of your data structure and also an Excel example of what you think the output should look like. Anyone else who is facing this issue can then add their opinion/suggestion.

@jmcnamara I'm dealing with a scenario where in the generated excel, there are some fixed columns like name, age, gender, etc. and some unfixed columns like 2022 results, 2023 results, January 2024 results... Until results of the month. In this, the column that is fixed can be represented by the field of Struct, while the column that changes can be represented by the sub-struct of the flatten of serde, which can be of HashMap type. I think this is one of the application scenarios for sub-structure.

jmcnamara · 2024-09-23T16:28:28Z

Wow, this looks incredibly promising. Are there reasons not to use this as default, or maybe switch to it intelligently if only supported features are used?

@Christoph-AK There are a number of restrictions around "constant memory" mode which don't make it suitable for all use cases. Such as:

Data needs to be written in strict row/column order. Once row n has been written it is no longer possible to write to any row that is n - 1. This is generally okay when you are converting large datasets that are structured in that way but even then they are often structured based on columns (like Pandas or Polars).
Some Excel functionality like merged ranges, tables and array formulas write across multiple rows in one go and therefore they move the row number forward and don't allow the user to update previous rows. I will fix this in the upstreamed version using a lookahead buffer for the merge ranges and array formula cases. Tables may take a bit longer to resolve since it contains more edges cases. I fixed all of these cases.
Cell formatting that is separate from the data won't be supported.
~~Embedded (but not inserted) images need to adhere to the row by row order~~. Fixed
Worksheet::autofit() doesn't work unless you run it at the end of each row.
The "constant memory" mode works by writing data to disk instead of keeping it in memory and this means that each worksheet in constant memory mode consumes a filehandle. This has been been an issue with users of the Python/C versions who wanted to create workbooks with thousands of worksheets and ran into the ulimit for open files. Not a common occurrence but still one that happened enough for it to be reported.
On systems where the disk is loaded in memory there isn't any memory saving. :-) This has also been reported a few times.

So in general there are a few too many gotchas to turn it on as the default. In fact it will probably be behind a feature flag since it will require an additional dependency on the tempfile crate.

However, for use cases where it makes sense it will be a big efficiency gain. Also, it is possible to turn it on on a worksheet by worksheet basis so users can mix and match worksheets as required.

jmcnamara · 2024-09-23T16:43:34Z

I'm dealing with a scenario where in the generated excel, there are some fixed columns like name, age, gender, etc. and some unfixed columns like 2022 results, 2023 results, January 2024 results... Until results of the month. In this, the column that is fixed can be represented by the field of Struct, while the column that changes can be represented by the sub-struct of the flatten of serde, which can be of HashMap type

@hackers267 Thanks for the description. The HashMap type at least may be feasible to support. Open a feature request and put in a sample struct and some example data and I will look at what can be done.

jmcnamara · 2024-09-30T23:30:51Z

Released v0.78.0 with support for "constant memory" mode to reduce memory usage when writing large worksheets.

The constant_memory mode works by flushing the current row of data to disk when the user writes to a new row of data. This limits the overhead to one row of data stored in memory. Once this happens it is no longer possible to write to a previous row since the data in the Excel file must be in row order. As such this imposes the limitation of having to structure your code to write in row by row order. The benefit is that the required memory usage is very low,and effectively constant, regardless of the amount of data written.

@jpmckinney You asked about this a while back. If you are still interested you can check it out.

jmcnamara · 2024-10-04T19:39:41Z

I've released v0.79.0 of rust_xlsxwriter with support for files larger than 4GB.

The rust_xlsxwriter library uses the zip.rs crate to provide the zip container for the xlsx file that it generates. The size limit for a standard zip file is 4GB for the overall container or for any of the uncompressed files within it. Anything greater than that requires ZIP64 support. In practice this would apply to worksheets with approximately 150 million cells, or more.

See Workbook::use_zip_large_file().

jmcnamara · 2024-11-01T08:08:52Z

For anyone who is interested I wanted to highlight a new wasm wrapper for rust_xlsxwriter that looks fairly complete even in the initial stages: https://github.com/estie-inc/wasm-xlsxwriter

Update: the performance of this looks good: #38 (comment)

brunoargolo · 2024-11-13T21:13:06Z

Hi @jmcnamara,

I currently use nodejs/exceljs to write very large xlsx files. I'm looking into potentially faster alternatives, including this lib.

I have created a quick benchmark comparing different language/lib combinations https://github.com/brunoargolo/excel-writing-multilanguage-benchmark

I'm a 100% beginner in Rust, GO and Python. So I might have royally fudged something.

I was hoping Rust would prove significantly faster than node but its pretty much a tie.

If you find any mistakes in my approach please let me know, I tried to make the repo as easy to reproduce as possible so anyone can test it.

Node and Go are using the streaming function of their respective libs, on python I used the constant memory mode on rust I tried both low_memory and constant memory with similar results.

Language + Lib	1 sheet	9 sheets
Nodejs with exceljs	5s	41s
Go with excelize	5s	30s
Python3 with xlsxwriter	26s	236s
Rust with rust_xlsxwriter	5s	40s

jmcnamara · 2024-11-13T23:42:01Z

I tried to make the repo as easy to reproduce as possible so anyone can test it.

Thanks. That is helpful.

Overall, I don't see anything wrong with the benchmarks although the rust version does a bit too much type conversion. A real world example would probably use serde_json, or similar, and do the type conversion as part of the read step. However, this adds at most a 10% overhead to the write phase. Also, in this particular case the add_worksheet_with_constant_memory() method should be 5-10% faster.

So overall, with some fine tuning and you could get 1.1-1.2 times better performance but that would still be in the same ballpark as the nodejs/exceljs solution, so probably not worth the effort of changing.

I also crosschecked with the examples/app_perf_test.rs program for a similar number of cells and the runtime was approximately the same. For comparison could you run this command from the root of the rust_xlsxwriter repo and give me the best of 3 results for comparison:

cargo run --example app_perf_test --release 140240

brunoargolo · 2024-11-14T14:44:08Z

I really appreciate you taking the time to go over this, thank you!

My guess is we're probably hitting the limits of what is viable in any language... I imagine we're not going to see 2x or 3x faster no matter the language/lib combination.

The one thing that might be significantly faster could be writing to XLSB format, but it is probably a ridiculous amount of work...

Do you still have any major performance related items on the roadmap or do you think this is mostly as good as it gets plus or minus a few percentage points?

jmcnamara · 2024-11-14T15:17:27Z

My guess is we're probably hitting the limits of what is viable in any language... I imagine we're not going to see 2x or 3x faster no matter the language/lib combination.

Agreed. The bottlenecks come down to IO/writing the worksheet.xml files and zipping them and other supporting files into a zip container. So most implementations will probably hit the same performance floor. Notwithstanding that I'm impressed with the ExcelJS performance.

Do you still have any major performance related items on the roadmap or do you think this is mostly as good as it gets plus or minus a few percentage points?

There may be some small (< 10%) IO/writing gains still to be made but most of the low hanging fruit have been picked. A potentially bigger gain would probably come from a threaded/parallelized zip library.

The C library gets a 20-30% performance improvement from using the Milo Yip DTOA library library for float to string conversion but I didn't get a similar speed increase from the ryu library except in very large files.

It is also possible that a properly parallelized backend worksheet assembly using rayon or similar might improve multiple worksheet writing. I have thread::scope parallelization in the backend but someone with more multithreaded rust experience might be able to do better:
https://github.com/jmcnamara/rust_xlsxwriter/blob/main/src/packager.rs#L122-L135

But to take a step back it is worth noting that the Rust version is at almost 90% of the C library performance so any 2-3x gain is unlikely without a radically different approach.

The one thing that might be significantly faster could be writing to XLSB format, but it is probably a ridiculous amount of work...

Yes. I maintained a library for the similar XLS format and it wasn't fun to maintain/extend. So for me that would be a non-runner. :-)

P.S. I would still be interested in the app_perf_test performance, see above, if you get a chance to run it.

Christoph-AK · 2024-11-14T15:26:09Z

I wonder if a different allocator can give more performance in this benchmark.

Combining all approaches (allocator, faster serialisation and floats, better parallelization) would probably approach the go time.

I think garbage collected languages like go and js have an advantage vs default rust here as the gc can easier reuse allocated memory, and go does have an advantage when it comes to easily throwing multithreading at everything.

Of cause all of those characteristics can be programmed in rust with some care, so there shouldn't really be a language other then c thats inherently faster.

It might be interesting to throw the js, go and rust versions into flamegraphs and watch where each spend their time.

jmcnamara · 2024-12-07T17:28:45Z

I've released v0.80.0 or rust_xlsxwriter with an improvement/fix for heap memory usage in constant_memory mode. This is a recommended upgrade for anyone using that mode/feature. See #120 for more information.

Zcb991 · 2025-01-28T08:41:07Z

Are there any plans to support batch writing of multiple lines? Will it be much faster if it supports batch writing of multiple rows?

Zcb991 · 2025-01-28T08:46:26Z

Are there any plans to support batch writing of multiple lines? Will it be much faster if it supports batch writing of multiple rows?是否有任何计划支持多行写作？如果支持多行的批处理写作，会更快吗？

fn write_to_excel(
    file_name: &str,
    headers: &[String],
    data: &[Vec<String>],
) -> Result<(), XlsxError> {
    let mut workbook = Workbook::new();
    let worksheet = workbook.add_worksheet_with_constant_memory();
    worksheet.write_row(0, 0, headers)?;
    for (row_index, row) in data.iter().enumerate() {
        worksheet.write_row((row_index + 1) as u32, 0, row)?;

        // worksheet.write_multi_row(row, col, data)?;  ??? like this.
    }
    workbook.save(file_name)?;
    Ok(())
}

jmcnamara · 2025-01-28T09:53:15Z

Are there any plans to support batch writing of multiple lines?

The Worksheet::write_row_matrix() method does this.

Will it be much faster if it supports batch writing of multiple rows?

No, it is just syntactic sugar around a loop. It is effectively the same as the code you show above. Batching won't help performance here.

See the Performance section of the rust_xlsxwrier docs for some options on increasing performance.

jmcnamara mentioned this issue Sep 24, 2022

Rust port of xlsxwriter jmcnamara/XlsxWriter#910

Closed

jmcnamara changed the title ~~Roadmap~~ rust_xlsxwriter Roadmap Sep 24, 2022

jmcnamara mentioned this issue Sep 24, 2022

Rust port of libxlsxwriter jmcnamara/libxlsxwriter#379

Closed

kevinmatthes mentioned this issue Nov 3, 2022

question: editing existing files planned? #10

Closed

jmcnamara mentioned this issue Nov 7, 2022

Don't depend on tempfile #12

Closed

jmcnamara mentioned this issue Jan 11, 2023

Add DataFrame.write_excel pola-rs/polars#5568

Closed

jmcnamara mentioned this issue Apr 17, 2023

feature request: Change Rc to Arc #29

Closed

jmcnamara added the roadmap Implementation of a feature on the roadmap label Apr 20, 2023

Repository owner deleted a comment from claudiofsr Apr 22, 2023

jmcnamara mentioned this issue Feb 3, 2024

Is it possible for us to add comments to cells? #79

Closed

jmcnamara closed this as completed Jul 26, 2024

jmcnamara reopened this Jul 26, 2024

Expurple mentioned this issue Aug 26, 2024

feature request: row/column walkers (would you accept a PR?) #109

Closed

jmcnamara mentioned this issue Nov 22, 2024

Help wanted: create a Wasm example/tutorial using rust_xlsxwriter #38

Closed

rust_xlsxwriter Roadmap #1

rust_xlsxwriter Roadmap #1

Comments

jmcnamara commented Sep 24, 2022 • edited Loading

Phase 1: Basic functionality

Phase 2: Medium functionality

Phase 3: Larger functionality

jmcnamara commented Sep 24, 2022 • edited Loading

Initial performance data:

pickfire commented Jan 11, 2023

jmcnamara commented Jan 11, 2023

claudiofsr commented Apr 14, 2023 • edited by jmcnamara Loading

jmcnamara commented Apr 14, 2023 • edited Loading

Christoph-AK commented May 12, 2023

jmcnamara commented May 12, 2023

jmcnamara commented May 26, 2023 • edited Loading

plus1xp-everyday commented Jul 23, 2023

jmcnamara commented Jul 23, 2023 • edited Loading

jmcnamara commented Aug 20, 2023

jmcnamara commented Nov 21, 2023 • edited Loading

jmcnamara commented Dec 9, 2023

Expurple commented Feb 1, 2024

Christoph-AK commented Feb 1, 2024

jmcnamara commented Feb 1, 2024 • edited Loading

jmcnamara commented Feb 26, 2024

jmcnamara commented Jul 19, 2024

Explanation

jmcnamara commented Jul 26, 2024 • edited Loading

jmcnamara commented Aug 23, 2024

jmcnamara commented Sep 2, 2024

jmcnamara commented Sep 12, 2024 • edited Loading

jmcnamara commented Sep 13, 2024

jmcnamara commented Sep 17, 2024

jmcnamara commented Sep 21, 2024

hackers267 commented Sep 23, 2024

jmcnamara commented Sep 23, 2024

Christoph-AK commented Sep 23, 2024

hackers267 commented Sep 23, 2024

jmcnamara commented Sep 23, 2024 • edited Loading

jmcnamara commented Sep 23, 2024

jmcnamara commented Sep 30, 2024

jmcnamara commented Oct 4, 2024

jmcnamara commented Nov 1, 2024 • edited Loading

brunoargolo commented Nov 13, 2024

jmcnamara commented Nov 13, 2024 • edited Loading

brunoargolo commented Nov 14, 2024

jmcnamara commented Nov 14, 2024 • edited Loading

Christoph-AK commented Nov 14, 2024 • edited Loading

jmcnamara commented Dec 7, 2024

Zcb991 commented Jan 28, 2025

Zcb991 commented Jan 28, 2025

jmcnamara commented Jan 28, 2025 • edited Loading

jmcnamara commented Sep 24, 2022 •

edited

Loading

jmcnamara commented Sep 24, 2022 •

edited

Loading

claudiofsr commented Apr 14, 2023 •

edited by jmcnamara

Loading

jmcnamara commented Apr 14, 2023 •

edited

Loading

jmcnamara commented May 26, 2023 •

edited

Loading

jmcnamara commented Jul 23, 2023 •

edited

Loading

jmcnamara commented Nov 21, 2023 •

edited

Loading

jmcnamara commented Feb 1, 2024 •

edited

Loading

jmcnamara commented Jul 26, 2024 •

edited

Loading

jmcnamara commented Sep 12, 2024 •

edited

Loading

jmcnamara commented Sep 23, 2024 •

edited

Loading

jmcnamara commented Nov 1, 2024 •

edited

Loading

jmcnamara commented Nov 13, 2024 •

edited

Loading

jmcnamara commented Nov 14, 2024 •

edited

Loading

Christoph-AK commented Nov 14, 2024 •

edited

Loading

jmcnamara commented Jan 28, 2025 •

edited

Loading