Does anyone have recommendations for hardware and/or OS to work with
around 5TB datasets?
The data is for analysis, so there is virtually no inserting besides a
big bulk load. Analysis involves full-database aggregations - mostly
basic arithmetic and grouping. In addition, much smaller subsets of data
would be pulled and stored to separate databases.
I have been working with datasets no bigger than around 30GB, and that
(I'm afraid to admit) has been in MSSQL.
Thanks,
Adam