From: | Viji V Nair <viji(at)fedoraproject(dot)org> |
---|---|
To: | Jeff Janes <jeff(dot)janes(at)gmail(dot)com> |
Cc: | pgsql-performance(at)postgresql(dot)org |
Subject: | Re: Distributed/Parallel Computing |
Date: | 2009-10-06 06:51:06 |
Message-ID: | 84c89ac10910052351vdd470f8x199e3cfca91f898f@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-performance |
Hi Jeff,
These are bulk updates of GIS data and OLTP. For example, we are running
some sqls to remove specific POIs those are intersecting with others, for
such exercise we need to compare and remove the data form diffrent tables
including the 20M data tables.
Apart form these there are bulk selects (read only) which are coming form
the client systems also.
Thanks
Viji
On Tue, Oct 6, 2009 at 8:10 AM, Jeff Janes <jeff(dot)janes(at)gmail(dot)com> wrote:
> On Mon, Oct 5, 2009 at 12:11 PM, Viji V Nair <viji(at)fedoraproject(dot)org>
> wrote:
> > Hi Team,
> >
> > This question may have asked many times previously also, but I could not
> > find a solution for this in any post. any help on the following will be
> > greatly appreciated.
> >
> > We have a PG DB with PostGIS functions. There are around 100 tables in
> the
> > DB and almost all the tables contains 1 million records, around 5 table
> > contains more than 20 million records. The total DB size is 40GB running
> on
> > a 16GB, 2 x XEON 5420, RAID6, RHEL5 64bit machines, the questions is
> >
> > 1. The geometry calculations which we does are very complex and it is
> taking
> > a very long time to complete. We have optimised PG config to the best,
> now
> > we need a mechanism to distribute these queries to multiple boxes. What
> is
> > best recommended way for this distributed/parallel deployment. We have
> tried
> > PGPOOL II, but the performance is not satisfactory. Going for a try with
> > GridSQL
>
> What is the nature of the transactions being run? Are they primarily
> read-only other than bulk updates to the GIS data, are they OLTP in
> regards to the GIS data, or are they transactional with regards to
> other tables but read-only with respect to the GIS?
>
> Jeff
>
From | Date | Subject | |
---|---|---|---|
Next Message | keshav upadhyaya | 2009-10-06 07:28:01 | What is the role of #fsync and #synchronous_commit in configuration file . |
Previous Message | Jeff Janes | 2009-10-06 02:40:50 | Re: Distributed/Parallel Computing |