Is there an easy way to extract component 0 from _mm512 vector ?
Looking at assembly of _mm512_reduce_gmin_ps it really computes an _mm512 (of course), which is then passed to scalar operations.
I tried doing
static inline float _mm512_get_first_ps(_mm512 v)
{
return v.__m512_f32[0] ;
}
but this does not work..