-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parallel performance #12
Comments
You may check the following:
If the above two tests do not change the situation, please send me the mtx and rhs files and I will test them. For the current version (20230325), oparm[27] (for row mode, i.e., Ax=b) and oparm[28] (for column mode, i.e., (A^T)x=b) reflect the solving algorithms. Their meaning is a little complex. They are two-digit numbers in decimalism. The least significant digit is the solving algorithm for L, and the other digit is for U. 0 means sequential and 1,2,3 mean the corresponding parallel algorithms. For example, if oparm[27]=20, it means the row mode solving uses sequential algorithm for Ly=b and uses the 2nd parallel algorithm for Ux=y. |
Thank you for the answer, it took some time to check all I could. My Windows test program is not in Visual Studio (Delphi), and uses the DLL through the C interface. Also may I ask if CKTSO uses custom created threads or it is based on a ready-made solution like OpenMP? Anyway I modified the benchmark program so it can read unsorted MTX files too. It may be useful for you too. On this page you can find the modified benchmark VS project, if interested. Thank you for any suggestions. |
Thank you for providing the detailed information. I first provide some quick answers. After I try your cases, I will answer other questions. |
My main problem is, that checking oparm[27] after CKTSO_Solve, it is zero even with the example matrix add20.mtx |
If you need my test program , I attached it. Maybe you can inspect the dll calls and how cktso.dll works with that. |
Dear Mr.Chen, |
Thank you so much for providing your test cases. I have tried your cases. Here are some notes for the observations.
|
Dear Mr.Chen,
we have tested your CKTSO matrix solver in our circuit simulation software and generally liked the performance. We tried it on AMD/Intel Windows platforms, with CSR formatted matrices (row-major).
However we did not achieve multi-thread performance improvement over single-thread mode, only slowdown.
We did transient simulations - many refactorization and solve calls.
We think we are using the library as recommended in the user guide.
To check that we are doing everything properly, could you help in these:
Thank you and best regards,
Gergely
The text was updated successfully, but these errors were encountered: