Replies: 1 comment
-
Hi Bartek, I agree with both of these points. Keep in mind that DaCe itself is a programming model and optimization framework, and both of these optimizations can be implemented with SDFGs :-) We just don't have everything automated, so it must be implemented in the SDFG. However, the Regarding 512-bit reads, we have the Gearbox node, which lets you easily convert between 512-bit and lower widths. There is also an effort to integrate this into the streaming memory transformation. (disclaimer: I no longer work at the university, and will only have time to contribute basic maintenance in the future, so the rest of the team are responsible for larger upgrades) |
Beta Was this translation helpful? Give feedback.
-
Hello,
@definelicht please take a look at this document: https://www.xilinx.com/html_docs/xilinx2020_1/vitis_doc/methodologyacceleratingapplications.html#yfz1555544741101
![image](https://user-images.githubusercontent.com/47795888/153566476-a0eba0f8-d127-4a4a-b77f-70493f995a27.png)
Also all input should be 512 bit width if possible:
![image](https://user-images.githubusercontent.com/47795888/153567032-6fefd450-14a3-4cb6-8c09-c2e48ea0dccf.png)
https://github.com/Xilinx/Vitis_Accel_Examples/blob/master/performance/axi_burst_performance/details.rst
I think FPGA kernels are missing some performance because of no cache buffers. If not, please prove me wrong :)
Take care,
BR
Beta Was this translation helpful? Give feedback.
All reactions