| From: | "Amonson, Paul D" <paul(dot)d(dot)amonson(at)intel(dot)com> |
|---|---|
| To: | Nathan Bossart <nathandbossart(at)gmail(dot)com> |
| Cc: | Bruce Momjian <bruce(at)momjian(dot)us>, Alvaro Herrera <alvherre(at)alvh(dot)no-ip(dot)org>, Andres Freund <andres(at)anarazel(dot)de>, "pgsql-hackers(at)lists(dot)postgresql(dot)org" <pgsql-hackers(at)lists(dot)postgresql(dot)org>, "Shankaran, Akash" <akash(dot)shankaran(at)intel(dot)com> |
| Subject: | RE: Proposal for Updating CRC32C with AVX-512 Algorithm. |
| Date: | 2024-08-26 18:44:55 |
| Message-ID: | BL1PR11MB530433DCF3F3EC3EFC5A3430DC8B2@BL1PR11MB5304.namprd11.prod.outlook.com |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
> I'm curious about where exactly the regression is coming from. Is it possible
> that your build for the SSE 4.2 tests was using it unconditionally, i.e.,
> optimizing away the function pointer?
I am calling the SSE 4.2 implementation directly; I am not even building the pg_sse42_*_choose.c file with the AVX512 choice. As best I can tell there is one extra function call and one extra int64 conditional test when bytes are <256 and a of course a JMP instruction to skip the AVX512 implementation.
Paul
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Nathan Bossart | 2024-08-26 18:50:00 | Re: Proposal for Updating CRC32C with AVX-512 Algorithm. |
| Previous Message | Nathan Bossart | 2024-08-26 18:38:30 | Re: Proposal for Updating CRC32C with AVX-512 Algorithm. |