icc -masm=intel generating invalid/unrecognized assembly?
Hello,I am building code that contains some asm() blocks written following Intel assembly syntax. When I compile this code for the MIC with -masm=intel, I get an assembler error, which is not related...
View ArticleEfficient branching on double vector comparison using intrinsics?
Hi,I want to check the range of a vector of double-precision variables, in order to branch to a slow path on exceptional out-of-range cases. My code looks like the following: // if(any(!(x < 4.)...
View ArticleQuad precision architecture one day?
Hello.I have a dream in which the IEEE-754 quad precision is implemented in hardware in order to allow fast extended precision computation in a portable way instead of the current 80-bit extended...
View ArticleHow to convert _mm512 to float
Is there an easy way to extract component 0 from _mm512 vector ?Looking at assembly of _mm512_reduce_gmin_ps it really computes an _mm512 (of course), which is then passed to scalar operations.I tried...
View Article_mm512_mul_epi32 not working?
#include <immintrin.h> #include <zmmintrin.h> // not needed but put here to show it is indeed included ... __m512i a,b,c; a = _mm512_mul_epi32(b,c); produces this error: undefined...
View Articlecast __m512 to __m512d
Hey all, simple question: How does the cast operation _mm512_castps_pd work?A __m512 data type holds 16 floats i.e. 16 elements. Contrary to that a __m512d data type can only hold 8 elements -- so what...
View ArticleHow to test a mask register for any non-zeros?
Dear all,I want to test a mask register for any non-zero values. I am not able to test this on my own MIC for a few days so I decided to ask here if this is possible to do efficiently, i.e. not via...
View Article