如何在Perl中计算给定正态分布的点的概率？-解网

问：

Perl 中是否有一个包可以让你计算每个给定点的概率分布高度。例如，这可以在 R 中以这种方式完成：

> dnorm(0, mean=4,sd=10)
> 0.03682701

也就是说，点 x=0 落入正态分布的概率为 0.0368，均值 = 4，sd=10。我看了一下 Statistics：:D istribution，但它并没有给出很多函数来做到这一点。

Perl R 统计概率

use strict; use warnings;

use Math::SymbolicX::Statistics::Distributions qw/normal_distribution/;

my $norm = normal_distribution(qw/mean sd/);
print $norm->value(mean => 4, sd => 10, x => 0), "\n";

# curry it with the parameter values
$norm->implement(mean => 4, sd => 10);
print $norm->value(x => 0),"\n"; # prints the same as above

该模块中的 normal_distribution（）函数是函数的生成器。$norm将是可以修改的 Math：：Symbolic （：：Operator）对象。例如，使用 implement，在上面的示例中，它将两个参数变量替换为常量。

然而，请注意，正如 Dirk 所指出的，你可能想要正态分布的累积函数。或者更一般地说，是一定范围内的积分。

不幸的是，Math：：Symbolic 不能以符号方式进行积分。因此，您必须求助于Math：：Integral：：Romberg之类的数值积分。（或者，在 CPAN 中搜索错误函数的实现。这可能很慢，但仍然很容易做到。将以下内容添加到上面的代码片段中：

use Math::Integral::Romberg 'integral';

my ($int_sub) = $norm->to_sub(); # compile to a faster Perl sub
print $int_sub->(0),"\n";  # same number as above

print "p=" . integral($int_sub, -100., 0) . "\n";
# -100 is an arbitrary, small number

这应该给你 ~0.344578258389676 来自 Dirk 的答案。

1赞 Jouni K. Seppänen 9/5/2009 #4

正如其他人所指出的，您可能想要累积分布函数。这可以通过误差函数（按均值平移，按正态分布的标准差缩放）获得，该函数存在于标准数学库中，并且可以通过 Math：：Libm 在 Perl 中访问。

3赞 Eonwe 10/25/2010 #5

如果你真的想要密度函数，为什么不直接使用它：

$pi = 3.141593;
$x = 2.02;
$mean = 2;
$sd = .24;
print 1/($sd * sqrt(2*$pi)) * exp(-($x-$mean)**2 / (2 * $sd**2));

它给出的 1.65649768474891 与 R 中的 dnorm 大致相同。

2赞 Jonathan Ledlie 3/20/2012 #6

我不认为 Jouni 是完全正确的。这似乎给出了一个合理的 PDF 版本（如果您只想要一个特定的 x-y 点，请提取循环的中间部分）：

!/usr/bin/perl

use strict;
use Getopt::Std;
use POSIX qw(ceil floor);

# Usage
# Outputs normal density function given a mean and sd
# -s standard deviation
# -m mean
# -n normalization factor (multiply result by this amount), optional

my %para = ();
getopts('s:m:n:', \%para);
if (!exists ($para{'s'}) || !exists ($para{'m'})) {
   die ("mean and standard deviation required");
}

my $norm = 1.0;
if (exists ($para{'n'})) {
   $norm = $para{'n'};
}

my $sd = $para{'s'};
my $mean = $para{'m'};

my $start = floor($mean - ($sd * 5));
my $end = ceil($mean + ($sd * 5));

my $pi = 3.141593;

my $var = $sd**2;

for (my $x = $start; $x < $end; $x+=0.1) {
    my $e = exp( -1 * (($x-$mean)**2) / (2*$var));
    my $d = sqrt($var) * sqrt(2*$pi);
    my $y = 1.0/$d*$e * $norm;
    printf ("%5.5f %5.5f\n", $x, $y);
}

1赞 Kit 8/27/2013 #7

使用 Perl 的 Statistics：:D istributions，您可以通过以下方式实现此目的：

#!/usr/bin/perl

use strict; use warnings;
use Statistics::Distributions qw(uprob);

my $x       = 0;
my $mean    = 4;
my $stdev   = 10;

print "Height of probablility distribution at point $x = "
    . (1-uprob(($x-$mean)/$stdev))."\n";

“点 0 处的概率分布高度 = 0.34458”的结果

上一个：在数据帧上：在 R 中写入文件和命名绑定向量

下一个：在 R 中，列表中所有数据帧中一列的加权平均值

如何在Perl中计算给定正态分布的点的概率？

How can I compute the probability at a point given a normal distribution in Perl?

评论

评论

评论