![]() | This is an archive of past discussions about Cumulative distribution function. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 1 |
This page had a redirect from distribution function, which I've now made into its own article describing a related but distinct concept in physics. I'll try to modify the pages pointing here through that redirect so that the net change in the wikipedia is minimal.SMesser 16:12, 24 Feb 2005 (UTC)
I originally created the redirect cumulative density function in March to point to this article. Why? A simple google test for cumulative density function shows 41,000 hits while cumulative distribution function shows 327,000 hits. Michael Hardy's contention is that "cumulative density" is patent nonsense (see deletion log) and a redirect shouldn't exist.
Regardless of the correctness of "cumulative density", there still is a significant usage of it in reference to this article and its content. "Cumulative density function" is even used in a doctoral thesis. Hardly patent nonsense.
Even if "cumulative density function" is incorrect, someone still may look for it, find nothing, and create an article paralleling this article. If you don't buy the "it's not patent nonsense, or even just nonsense" then I invoke (from WP:R#When should we delete a redirect?) that it increases accidental linking and therefore should not be deleted.
Michael, if you have a problem with the correctness of "cumulative density" then by all means add a section here or change the redirect to an article and explain it there. Either way, cumulative density function needs to be a valid link. Cburnett 14:42, 14 December 2005 (UTC)
Please be consistent! In Probability theory the integral of the "probability density function" "PDF" is called "cumulative density function" CDF or simply "distribution function". Thus the adjective cumulative.
See http://mathworld.wolfram.com/DistributionFunction.html
The term "Cumulative distribution function" is nonsense because it implies the integral of the integral of the PDF. Utterly nonsense! Please correct this link! User:lese 4 Nov 2007.
The word "cumulative distribution function" is used in many elementary books. It is a pretty stupid term, but we are stuck with it. The best we can do is acknowledge that the term is out there, that is should simply be "distribution function" and that it's definition MUST be with <= or else many tables, software routines, etc will be incorrectly used. —Preceding unsigned comment added by Jmsteele (talk • contribs)
"Note that in the definition above, the "less or equal" sign, '≤' could be replaced with "strictly less" '<'. This would yield a different function, but either of the two functions can be readily derived from the other. The only thing to remember is to stick to either definition as mixing them will lead to incorrect results. In English-speaking countries the convention that uses the weak inequality (≤) rather than the strict inequality (<) is nearly always used."
Surely it doesn't matter at all! Since the probability of one single value is 0, hence the two interval boundaries can be included or excluded.
Please consider some very important distributions: The Binomial, Poisson, Hypergeometric. You simply MUST use the definition F(x) = P(X <= x) or else all software packages and all tables will be misundestood. PS I am a professor of statistics, so give me some slack here. This is not a matter of delta functions it is a matter of sums of coin flips ... very basic stuff.
I completely disagree with "It is conventional to use a capital F for a cumulative distribution function, in contrast to the lower-case f used for probability density functions and probability mass functions." From all the literature I have read, is the cumulative distribution function and is used for probability density/mass functions. Where's the reference to make such a bold claim that F and f are convention? See the probit article which uses for the inverse to cdf. -- Thoreaulylazy 19:13, 3 October 2006 (UTC)
====This is a collapsed disctintion. One uses Phi for the normal distribution and phi for the normal density. These are reseved symbols for these purposes --- see any statistics book. One uses F and f fo the generic distributions and densities, but these are not reserved. In many books and papers one will find G g , H h etc. Each time the capital representing distribution and the lower case the density.
I've been looking for a better algorithm to generate a random value based on an arbitrary CDF (better than the one I wrote). For example, if one would like to obtain a random value with a "flat" distribution, one can use the 'rand()' function in C's math.h . However, I wrote this function to use an arbitrary function to generate the random value:
// xmin and xmax are the range of outputs you want // ymin and ymax are the actual limits of the function you want // function is a function pointer that points to the CDF long double randfunc(double xmin, double xmax, long double (*function)(long double), double ymin, double ymax) { long double val; while(1) { if( (ymax-ymin)*( rand()/((long double)RAND_MAX + 1)) + ymin < function(val= ((xmax-xmin)*( rand()/((long double)RAND_MAX + 1)) + xmin))) { return val; } } }
I was trying to find a way to do it faster/better. If anyone knows of anything.. let me know. Fresheneesz 07:53, 27 December 2006 (UTC)
Hello,
i removed the comment that probability distribution function is the same as CDF, which i assert to be wrong. My reference is "Probability and Statistics for Engineering and the Sciences" pp 140 (J. Devore) . The PDF is the same as the probability density function, not the CDF. The CDF is the integral of the PDF, not the PDF itself.
Please comment. 129.78.208.4 05:28, 12 March 2007 (UTC)
I would also like to see probability distribution function removed as an alternative name to CDF as it only confuses readers since probability distribution function can refer to the probability mass/density fxn (http://cnx.org/content/m13466/latest/ http://mathworld.wolfram.com/Probability.html ) or cumulative dist fxn depending on the author. If it is included as an alternative name, then I think the different interpretations need to be pointed out in the same paragraph.Wiki me (talk) 17:31, 3 March 2009 (UTC)
Sixth edition, page 147:
Fifth edition, page 145:
So Devore agrees with everyone else that pdf stands for probability density function.
It appears that our anonymous user who cites Devore simply saw the words "probability density function (pdf)", made a big flying leap to the conclusion that the "d" in "pdf" stands for "distribution" rather than "density", and thought Devore was saying the probability density function is the same thing as the probability distribution function. Lousy reading comprehension. Michael Hardy (talk) 20:26, 3 March 2009 (UTC)
The formula after 'If X is a discrete random variable, then it attains values x1, x2, ... with probability pi = P(xi),' has the final sum of p(xi), why would this not be the sum of pi since we have already introduced pi? Also, I think pi = P(xi) should be introduced as pi = P(X=xi) for clarity? Chrislawrence5 17:31, 16 April 2007 (UTC)
I'll preface my comment by saying that I am not a mathematician, so I may be off base. However, should there be some sort of reminder statement that the limits (especially the lower limit, ) of integration for the expression
should also be compatible with the range of applicability of ? Some distributions are not defined over the entire range of . I was scratching my head confirming the CDF for the Pareto distribution starting with the PDF and couldn't get the listed answer until I realized this. Perhaps this would be obvious to some, but I suggest it to others who are more up on this stuff as a possible point of clarification. I will defer this change, however, to someone who is more of an authority on this. --Lacomj (talk) 21:07, 20 December 2008 (UTC)
What is the reasons to define F(x)=P[X<=x], why not F(x)=P[X>=x]? just convention or what? Jackzhp (talk) 15:24, 2 March 2009 (UTC)
There is a formula expressing CDF as an expectation of an indicator function:
Maybe it should be included somewhere, as it is probably not so obvious for people without statistical background. The formula is particularly useful for numerical estimation of function F (in which case we simply replace expectation with sample mean), and in analysis of limiting properties of estimated cdf through the use of central limit theorem. —Preceding unsigned comment added by Stpasha (talk • contribs) 00:03, 19 June 2009 (UTC)
![]() | This is an archive of past discussions about Cumulative distribution function. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 1 |