From: | "Mkrtchyan, Tigran" <tigran(dot)mkrtchyan(at)desy(dot)de> |
---|---|
To: | "Graeme B(dot) Bell" <graeme(dot)bell(at)nibio(dot)no> |
Cc: | Steve Crawford <scrawford(at)pinpointresearch(dot)com>, "Wes Vaske (wvaske)" <wvaske(at)micron(dot)com>, pgsql-performance <pgsql-performance(at)postgresql(dot)org> |
Subject: | Re: New server: SSD/RAID recommendations? |
Date: | 2015-07-07 10:28:18 |
Message-ID: | 1253229687.3798367.1436264898828.JavaMail.zimbra@desy.de |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-performance |
Thanks for the Info.
So if RAID controllers are not an option, what one should use to build
big databases? LVM with xfs? BtrFs? Zfs?
Tigran.
----- Original Message -----
> From: "Graeme B. Bell" <graeme(dot)bell(at)nibio(dot)no>
> To: "Steve Crawford" <scrawford(at)pinpointresearch(dot)com>
> Cc: "Wes Vaske (wvaske)" <wvaske(at)micron(dot)com>, "pgsql-performance" <pgsql-performance(at)postgresql(dot)org>
> Sent: Tuesday, July 7, 2015 12:22:00 PM
> Subject: Re: [PERFORM] New server: SSD/RAID recommendations?
> Completely agree with Steve.
>
> 1. Intel NVMe looks like the best bet if you have modern enough hardware for
> NVMe. Otherwise e.g. S3700 mentioned elsewhere.
>
> 2. RAID controllers.
>
> We have e.g. 10-12 of these here and e.g. 25-30 SSDs, among various machines.
> This might give people idea about where the risk lies in the path from disk to
> CPU.
>
> We've had 2 RAID card failures in the last 12 months that nuked the array with
> days of downtime, and 2 problems with batteries suddenly becoming useless or
> suddenly reporting wildly varying temperatures/overheating. There may have been
> other RAID problems I don't know about.
>
> Our IT dept were replacing Seagate HDDs last year at a rate of 2-3 per week (I
> guess they have 100-200 disks?). We also have about 25-30 Hitachi/HGST HDDs.
>
> So by my estimates:
> 30% annual problem rate with RAID controllers
> 30-50% failure rate with Seagate HDDs (backblaze saw similar results)
> 0% failure rate with HGST HDDs.
> 0% failure in our SSDs. (to be fair, our one samsung SSD apparently has a bug
> in TRIM under linux, which I'll need to investigate to see if we have been
> affected by).
>
> also, RAID controllers aren't free - not just the money but also the management
> of them (ever tried writing a complex install script that interacts work with
> MegaCLI? It can be done but it's not much fun.). Just take a look at the
> MegaCLI manual and ask yourself... is this even worth it (if you have a good
> MTBF on an enterprise SSD).
>
> RAID was meant to be about ensuring availability of data. I have trouble
> believing that these days....
>
> Graeme Bell
>
>
> On 06 Jul 2015, at 18:56, Steve Crawford <scrawford(at)pinpointresearch(dot)com> wrote:
>
>>
>> 2. We don't typically have redundant electronic components in our servers. Sure,
>> we have dual power supplies and dual NICs (though generally to handle external
>> failures) and ECC-RAM but no hot-backup CPU or redundant RAM banks and...no
>> backup RAID card. Intel Enterprise SSD already have power-fail protection so I
>> don't need a RAID card to give me BBU. Given the MTBF of good enterprise SSD
>> I'm left to wonder if placing a RAID card in front merely adds a new point of
>> failure and scheduled-downtime-inducing hands-on maintenance (I'm looking at
>> you, RAID backup battery).
>
>
>
> --
> Sent via pgsql-performance mailing list (pgsql-performance(at)postgresql(dot)org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance
From | Date | Subject | |
---|---|---|---|
Next Message | Graeme B. Bell | 2015-07-07 10:38:10 | Re: New server: SSD/RAID recommendations? |
Previous Message | Graeme B. Bell | 2015-07-07 10:22:00 | Re: New server: SSD/RAID recommendations? |