From: | Claudio Freire <klaussfreire(at)gmail(dot)com> |
---|---|
To: | Larry White <ljw1001(at)gmail(dot)com> |
Cc: | obartunov(at)gmail(dot)com, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Bruce Momjian <bruce(at)momjian(dot)us>, Peter Geoghegan <pg(at)heroku(dot)com>, Stephen Frost <sfrost(at)snowman(dot)net>, Robert Haas <robertmhaas(at)gmail(dot)com>, Kevin Grittner <kgrittn(at)ymail(dot)com>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: jsonb format is pessimal for toast compression |
Date: | 2014-08-14 19:24:34 |
Message-ID: | CAGTBQpasTRmU0wJ113DTvD6qH46NDVdVKYAPjociH0AHkUDDJA@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On Thu, Aug 14, 2014 at 3:49 PM, Larry White <ljw1001(at)gmail(dot)com> wrote:
> I attached a json file of approximately 513K. It contains two repetitions of
> a single json structure. The values are quasi-random. It might make a decent
> test case of meaningfully sized data.
I have a 59M in plain SQL (10M compressed, 51M on-disk table size)
collection of real-world JSON data.
This data is mostly counters and anciliary info stored in json for the
flexibility, more than anything else, since it's otherwise quite
structured: most values share a lot between each other (in key names)
but there's not much redundancy within single rows.
Value length stats (in text format):
min: 14
avg: 427
max: 23239
If anyone's interested, contact me personally (I gotta anonimize the
info a bit first, since it's production info, and it's too big to
attach on the ML).
From | Date | Subject | |
---|---|---|---|
Next Message | Robert Haas | 2014-08-14 19:25:43 | Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ] |
Previous Message | Robert Haas | 2014-08-14 19:20:31 | Re: minor typo in pgbench doc (2) |