From: | Arjan van de Ven <arjan(at)linux(dot)intel(dot)com> |
---|---|
To: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Japin Li <japinli(at)hotmail(dot)com> |
Cc: | pgsql-hackers(at)postgresql(dot)org, mark(dot)dilger(at)enterprisedb(dot)com, andres(at)anarazel(dot)de, pgsql-hackers(at)lists(dot)postgresql(dot)org |
Subject: | Re: [PATCH v2] src/port/snprintf.c: Optimize the common base=10 case in fmtint |
Date: | 2021-10-27 22:18:13 |
Message-ID: | 834df8cf-80dc-bcf9-1a0c-653eed463428@linux.intel.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On 10/26/2021 6:39 PM, Tom Lane wrote:
> Japin Li <japinli(at)hotmail(dot)com> writes:
>> Why do we need likely() for base=10, however, base=16 and base=8 don't need?
>
> Yeah, I was a little unconvinced about that too. I concur with writing
> it as an if/else chain instead of a switch, but I'm not sure that likely()
> adds anything to that.
fair enough:
[PATCH v3] src/port/snprintf.c: Optimize the division away in fmtint
fmtint() turns an integer into a string for a given base, and to do this
it does a divide/modulo operation iteratively.
The only possible base values are 8, 10 and 16
On just about any CPU, generic divides are a pretty expensive operation, generally
10x to 20x or more expensive than adds or multiplies.
By special casing the base values, the compiler (gcc or other) can (and will)
replace the divide by a multiply with 0xcccccccccccccccd (for base 10) or bitops
for base 8 and 16, yielding a lot faster code.
I considered a switch statement, but since base 10 is the most common by far,
I implemented it as a series of if/else statements for simplicity.
Even though this only shows up in the database creation phase of pgbench and not so much
during the normal run time, the optimization is simple and high value enough that
in my opinion it's worth doing
diff --git a/src/port/snprintf.c b/src/port/snprintf.c
index 7c21429369..547a59d4a0 100644
--- a/src/port/snprintf.c
+++ b/src/port/snprintf.c
@@ -1076,11 +1076,31 @@ fmtint(long long value, char type, int forcesign, int leftjust,
else
{
/* make integer string */
- do
- {
- convert[sizeof(convert) - (++vallen)] = cvt[uvalue % base];
- uvalue = uvalue / base;
- } while (uvalue);
+
+ /*
+ * Special case each of the possible base values (8, 10, 16) to avoid an
+ * expensive divide operation
+ * (the compiler will use a multiply, shift or boolean ops for this)
+ */
+ if (base == 10) {
+ do
+ {
+ convert[sizeof(convert) - (++vallen)] = cvt[uvalue % 10];
+ uvalue = uvalue / 10;
+ } while (uvalue);
+ } else if (base == 16) {
+ do
+ {
+ convert[sizeof(convert) - (++vallen)] = cvt[uvalue % 16];
+ uvalue = uvalue / 16;
+ } while (uvalue);
+ } else if (base == 8) {
+ do
+ {
+ convert[sizeof(convert) - (++vallen)] = cvt[uvalue % 8];
+ uvalue = uvalue / 8;
+ } while (uvalue);
+ }
}
zeropad = Max(0, precision - vallen);
From | Date | Subject | |
---|---|---|---|
Next Message | Mark Dilger | 2021-10-27 22:21:43 | Re: CREATEROLE and role ownership hierarchies |
Previous Message | Bossart, Nathan | 2021-10-27 21:38:49 | Re: [PATCH v2] use has_privs_for_role for predefined roles |