These variants are implemented mainly for LANPR to run smoothly with its double precision internal calculations.
Matrix multiply function needs a new __SSE2__ implementation.
Differential D5499
Several double variants for BLI_math Authored by YimingWu (NicksBest) on Aug 16 2019, 5:24 AM.
Details These variants are implemented mainly for LANPR to run smoothly with its double precision internal calculations. Matrix multiply function needs a new __SSE2__ implementation.
Diff Detail Event Timeline
Comment Actions updated two finctions mul_m4_m4m4_db_uniq() and mul_m4db_m4db_m4fl_uniq() for __SSE2__ support. Comment Actions __m128d is double[2] instead of double[4], but __m256d needs AVX support. I removed SIMD instructions for those and leave the optimization to the compiler. double version of the matrix is not frequently called, there should be little performance impact on this. Comment Actions This patch has an approved status however it was not committed yet. So I'm assuming the other reviewers are blocking. Updating the reviewers list to reflect this. This way the patch status still show as Need Review. Though in this case we can probably just drop this? | ||||||||||||||||||||||||