I want to create a histogram of vertical gradients in an 8 bit gray image. The vertical distance to calculate the gradient can be specified. I already managed to speed up another part of my code using Intrinsics, but it does not work here. The code runs without exception if the _mm_store_si128 is commented out. When it is not commented, I get an access violation.
You can find the question and the code listed here:
[url removed, login to view]
This should be very easy for programmers experienced with C++ and Intrinsics.
Please bid only if you know the solution already.
You should provide:
1. suggest a working bugfix to the code I provided
2. provide a second solution based on AVX/AVX2