From: | "Nathan Barnett" <nbarnett(at)cellularphones(dot)com> |
---|---|
To: | "'Timothy H(dot) Keitt'" <keitt(at)nceas(dot)ucsb(dot)edu>, <pgsql-general(at)postgresql(dot)org> |
Subject: | RE: Statistical Analysis |
Date: | 2000-07-24 20:52:52 |
Message-ID: | 71975481CD04D4118E57004033A2596E0DF949@ip205.82.136.216.in-addr.arpa |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
Tim,
Hmm... this might just work because I could actually perform myrandfunc() <
.2 and then do a LIMIT on it for 10% or what not. That would almost
gurantee the exact amount of rows.
-----------------
Nathan Barnett
-----Original Message-----
From: keitt(at)ulysses(dot)nceas(dot)ucsb(dot)edu
[mailto:keitt(at)ulysses(dot)nceas(dot)ucsb(dot)edu]On Behalf Of Timothy H. Keitt
Sent: Monday, July 24, 2000 3:41 PM
To: Nathan Barnett
Subject: Re: [GENERAL] Statistical Analysis
You would need to add a pseudorandom number function to postgresql. If
your function returns numbers on [0, 1), then you could do:
select * from mytable where myrandfunc() < 0.1;
and get back (asymtotically) 10% of the rows. If you want exactly n
randomly chosen rows, its a bit more expensive computationally.
Another more involved approach would be to implement random cursors.
This would be great for bootstrapping analysis.
Tim
Nathan Barnett wrote:
>
> I am having to perform a large data analysis query fairly frequently and
the
> execution time is not exceptable, so I was looking at doing a statictical
> sample of the data to get fairly accurate results. Is there a way to
> perform a query on a set number of random rows instead of the whole
dataset?
> I have looked through the documentation for a function that would do this,
> but I have not seen any. If this is a RTFM type question, then feel free
to
> tell me so and point me in the right direction because I just haven't been
> able to find any info on it.
>
> Thanks ahead of time.
>
> ---------------
> Nathan Barnett
--
Timothy H. Keitt
National Center for Ecological Analysis and Synthesis
735 State Street, Suite 300, Santa Barbara, CA 93101
Phone: 805-892-2519, FAX: 805-892-2510
http://www.nceas.ucsb.edu/~keitt/
From | Date | Subject | |
---|---|---|---|
Next Message | Teruel Tony | 2000-07-24 21:14:31 | Clients error |
Previous Message | Andrew McMillan | 2000-07-24 20:51:16 | Re: Statistical Analysis |