Python中生成随机数

时间 2019-11-12

原文原文链接

目录html

今天在一个公众号上看到了一篇有关Python基础的文章，其中提到了Numpy模块中有关生成随机数的使用；这才联想到，本身对这一块也不熟悉，每次想要验证某个函数的功能时，老是有些机械的去手动输入一些数据，显得有些low。所以，总结这篇文章供本身学习，若是也有帮到你，那也是很好的。python

在此声明，本文中部分观点或内容来源于其余博文，会在文末给出相应的引用连接，文中再也不具体说明来源何处。数组

本文主要介绍：app

random模块
numpy.random的使用

# 版本信息
Python 3.6.8
Numpy 1.14.5

开始写的时候才发现，其中又涉及到Python中自带的random模块，查看源码后发现，这TM工程量很庞大啊；然而本着学习的态度，就是干！！！dom

写到后面发现，这仅仅是本人学习过程当中做的笔记，并不必定可以给人很好的阅读感觉；若是你也想学习Python中有关随机数的生成，强烈推荐去阅读源码以及文档；函数

random:学习

源码：import randomflex

文档：文档-python3.6-random网站

Numpy.random:spa

源码：from numpy import random

文档：文档-v1.14-numpy.random

1. random模块

1.1 设置随机种子

下面介绍如何设置随机种子、获取当前随机数环境状态、设置当前随机数环境状态；

1.1.1 设置随机种子

import random
random.seed(5)  # 设置随机数，用于保证每次随机生成获得的数据是一致的

1.1.2 获取当前随机数环境状态

import random 

state = random.getstate()  # 获取当前随机数环境状态

1.1.3 设置当前随机数环境状态

import random

state = random.getstate()  # 获取当前随机数环境状态，用于下次使用

random.setstate(state)  # 设置当前随机数环境状态

示例：

import random

def example_1():
    num1 = random.random()  # 获得一个随机数
    num2 = random.random()  # 获得一个随机数
    print("example_1: ", num1)
    print("example_1: ", num2)
    # 能够发现，每次执行该部分代码，num1, num2这两个随机数老是和以前生成的不同

def example_2():
    random.seed(5) # 设置随机种子，能够获得相同的随机数序列
    num3 = random.random() 
    num4 = random.random()
    print("example_2: ", num3)
    print("example_2: ", num4)  # num3与num4这两个随机数不相同；可是执行屡次后能够发现，这个值老是固定不变的，这就可以理解random.seed()的做用了。
    
    state = random.getstate()  # 获取当前随机数的状态，并返回；也就是说当前随机数位于num4
    return state

def example_3(state):
    num5 = random.random()  # 因为example_2中已经设置随机种子；此时会按照随机数序列继续返回随机数
    print("example_3: ", num5) # 新的随机数
    random.setstate(state)  # 设置随机数环境状态
    num6 = random.random()  # 会发现num2=num3，这是由于将随机数环境状态重置到“未生成随机数num5”前的状态。
    print("example_3: ", num6)
    num7 = random.random()
    print("example_3: ", num7)  # 会获得新的随机数

if __name__ == "__main__":
    example_1()
    state = example_2()
    example_3(state)

example_1:  0.3469160199002712
example_1:  0.9869904001422904
example_2:  0.6229016948897019
example_2:  0.7417869892607294
example_3:  0.7951935655656966
example_3:  0.7951935655656966
example_3:  0.9424502837770503

1.2 random模块中的方法

# 这是有关Python中自带的random模块功能简介
# 
"""Random variable generators.

    integers
    --------
           uniform within range

    sequences
    ---------
           pick random element
           pick random sample
           pick weighted random sample
           generate random permutation

    distributions on the real line:
    ------------------------------
           uniform
           triangular
           normal (Gaussian)
           lognormal
           negative exponential
           gamma
           beta
           pareto
           Weibull

    distributions on the circle (angles 0 to 2pi)
    ---------------------------------------------
           circular uniform
           von Mises
```

1.3 使用：生成整形随机数

下面将介绍两个方法：randrange()、randint()

random.randrange(start, stop, step)

返回指定范围[start, stop)内，按照指定步长step递增的一个随机数；

示例：

import random

random.seed(5)  # 保证你和我执行的结果是一致的
number = random.randrange(0, 100, 2)
print(number)

random.randint(a, b)

返回指定范围[a, b]内的一个随机整数；

random.randint(self, a, b):
    return self.randrange(a, b+1)

# 其实调用的仍是random.randrange()方法，默认步长为1

示例：

import random

random.seed(5)
number = random.randint(0, 10)
print(number)

1.3 使用：生成序列随机数

都是给定序列，从序列中随机选取；

下面将介绍四种方法：choice()、shuffle()、sample()、choices()

random.choice(self, seq)

从给定序列seq中返回一个随机值，seq能够是列表、元组、字符串；

示例：

import random

random.seed(5) 

result1 = random.choice([1, 2, 3.2, 5, 9])
result2 = random.choice('A String')

print(result1)
print(result2)

9
r

random.shuffle(self, x, random=None)

对x进行乱序；

示例：

import random

random.seed(5)
a = [1, 2, 3, 5, 9]
random.shuffle(a)
print(a)

[1, 2, 5, 3, 9]

random.sample(self, population, k)

从序列population中随机取出k个数；population的类型能够是列表、元组、集合、字符串；

注意：不放回随机抽样，k要小于population的长度；

示例：

import random

random.seed(5)
a = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
b = (0, 1, 2, 3, 4, 5, 6, 7, 8, 9)
c = "Hi random"
d = set([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
print(random.sample(a, 6))
print(random.sample(b, 6))
print(random.sample(c, 6))
print(random.sample(d, 6))

[9, 4, 5, 6, 7, 8]
[0, 7, 3, 5, 9, 1]
['i', 'n', 'r', 'm', 'd', 'H']
[9, 3, 0, 5, 1, 8]

random.choices(self, population, weights=None, *, cum_weights=None, k=1):

从population中有放回的随机取k个数据；population的类型能够是列表、元组、字符串；

weights表示population中各元素的相对权重；

cum_weights表示从前日后元素的累计权重；

注意：不限制k的大小；

示例1：population的类型能够是列表、元组、字符串

random.seed(5)
a = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
b = (0, 1, 2, 3, 4, 5, 6, 7, 8, 9)
c = "Hi random"

print(random.choices(a, k=3))
print(random.choices(b, k=3))
print(random.choices(c, k=3))

[6, 7, 7]
[9, 7, 9]
['H', 'a', 'm']

示例2：相对权重weights的使用

若是指定了权重序列，则会根据权重序列进行选择；若是给定相对权重序列，则最终会转化成累计权重序列；

也就是说对应元素权重越大，越有可能被选中；

import random

random.seed(5)
a = "random"
weights = [5, 1, 15, 2, 3, 2]
b = random.choices(a, weights=weights, k=1)
print(b)

['n']

1.4 使用：生成随机实值分布

random.random()  # 返回一个[0, 1)范围内的浮点数

1.4.1 均匀分布

random.uniform(self, a, b):

返回给定范围[a, b]或[a, b)内的一个浮点数；

计算方式：

a + (b-a) * self.random()

示例：

import random

random.seed(5)
number = random.uniform(2, 10)
print(number)

6.983213559117615

1.4.2 三角分布

random.triangular(self, low=0.0, high=1.0, mode=None):

返回给定范围[low, high]内的随机浮点数；mode默认为边界的中点，给出对称分布；

维基百科-Triangular distribution

不举示例了，由于只是出来一个在指定范围内的随机数；可是想要弄明白这个随机数是怎么计算的，须要源码；

1.4.3 正态分布

random.normalvariate(self, mu, sigma):

Args:
    mu: # 表示均值
    sigma: # 表示标准差

1.4.4 对数正太分布

random.lognormvariate(self, mu, sigma):
    return _exp(self.normalvariate(mu, sigma)) # 使用的仍是正态分布

Args:
    mu: # 均值
    sigma: # 标准差

1.4.5 指数分布

random.expovariate(self, lambd):
    return -_log(1.0 - self.random())/lambd

该方法返回一个随机数，值的大小与lambda有关；

假如咱们想要获得1000个样本，并且这1000个样本的平均值大体为：10，则须要设定lambda=1/10=0.1；

示例：

import random

random.seed(5)
a = []
for i in range(1000):
    number = random.expovariate(0.1)
    a.append(number)
print(sum(a)/len(a))

9.810064281255109 # 最后获得1000个输的平均值为10

1.4.6 von Mises 分布

random.vonmisesvariate(self, mu, kappa):

Args:
    mu: # mean angle, 以弧度表示，介于(0, 2*pi)之间
    kapp: # 浓度参数，大于或等于0；若是等于0，(0, 2*pi)的均匀随机角度

1.4.7 gamma分布

random.gammavariate(self, alpha, beta):
    
Args:
    alpha: # 大于0
    beta: # 大于0
# mean is alpha*beta, variance is alpha*beta**2

计算方式：

x ** (alpha - 1) * math.exp(-x / beta)
          pdf(x) =  --------------------------------------
                      math.gamma(alpha) * beta ** alpha

1.4.8 高斯分布

random.gauss(self, mu, sigma):
    
Args:
    mu: # mean
    sigma: # standard deviation
        
# 速度快于方法：random.normalvariate(self, mu, sigma):

1.4.9 贝塔分布

random.betavariate(self, alpha, beta):
    
Args: 
    alpha: # 大于0
    beta: # 大于0

Return:
    # 介于0,1之间

1.4.10 Pareto 分布

random.paretovariate(self, alpha):
    u = 1.0 - self.random()
    return 1.0 / u ** (1.0/alpha)

1.4.11 Weibull 分布

random.weibullvariate(self, alpha, beta):
    u = 1.0 - self.random()
    return alpha * (-_log(u)) ** (1.0/beta)

花了一上午的时间，终于将python自带的random模块大体搞明白了一些；然而，写到后面才发现，在平时的使用当中，根本用不了这么多，或许经常使用的如：random(), randrange(),randint(),choice(),shuffle(),sample()，后面的“分布”，可能不多用到；

random模块基本上完事了，后续不知道还会不会有再次补充、修改的机会；

接下来，继续学习Numpy模块中的random吧，但愿能够顺利些；

2. numpy.random

学习的过程按照numpy.random源码中的顺序来学习，参考文档-python3.6-numpy.random；

分红六类：

Utility functions：实用方法
Compatibility functions：兼容方法
Univariate distributions：单变量分布
Multivariate distributions：多变量分布
Standard distributions：标准分发
Internal functions：内部功能

========================
Random Number Generation
========================

==================== =========================================================
Utility functions
==============================================================================
random               Uniformly distributed values of a given shape.
bytes                Uniformly distributed random bytes.
random_integers      Uniformly distributed integers in a given range.
random_sample        Uniformly distributed floats in a given range.
random               Alias for random_sample
ranf                 Alias for random_sample
sample               Alias for random_sample
choice               Generate a weighted random sample from a given array-like
permutation          Randomly permute a sequence / generate a random sequence.
shuffle              Randomly permute a sequence in place.
seed                 Seed the random number generator.
==================== =========================================================

==================== =========================================================
Compatibility functions
==============================================================================
rand                 Uniformly distributed values.
randn                Normally distributed values.
ranf                 Uniformly distributed floating point numbers.
randint              Uniformly distributed integers in a given range.
==================== =========================================================

==================== =========================================================
Univariate distributions
==============================================================================
beta                 Beta distribution over ``[0, 1]``.
binomial             Binomial distribution.
chisquare            :math:`\\chi^2` distribution.
exponential          Exponential distribution.
f                    F (Fisher-Snedecor) distribution.
gamma                Gamma distribution.
geometric            Geometric distribution.
gumbel               Gumbel distribution.
hypergeometric       Hypergeometric distribution.
laplace              Laplace distribution.
logistic             Logistic distribution.
lognormal            Log-normal distribution.
logseries            Logarithmic series distribution.
negative_binomial    Negative binomial distribution.
noncentral_chisquare Non-central chi-square distribution.
noncentral_f         Non-central F distribution.
normal               Normal / Gaussian distribution.
pareto               Pareto distribution.
poisson              Poisson distribution.
power                Power distribution.
rayleigh             Rayleigh distribution.
triangular           Triangular distribution.
uniform              Uniform distribution.
vonmises             Von Mises circular distribution.
wald                 Wald (inverse Gaussian) distribution.
weibull              Weibull distribution.
zipf                 Zipf's distribution over ranked data.
==================== =========================================================

==================== =========================================================
Multivariate distributions
==============================================================================
dirichlet            Multivariate generalization of Beta distribution.
multinomial          Multivariate generalization of the binomial distribution.
multivariate_normal  Multivariate generalization of the normal distribution.
==================== =========================================================

==================== =========================================================
Standard distributions
==============================================================================
standard_cauchy      Standard Cauchy-Lorentz distribution.
standard_exponential Standard exponential distribution.
standard_gamma       Standard Gamma distribution.
standard_normal      Standard normal distribution.
standard_t           Standard Student's t-distribution.
==================== =========================================================

==================== =========================================================
Internal functions
==============================================================================
get_state            Get tuple representing internal state of generator.
set_state            Set state of generator.
==================== =========================================================

2.1 Utility functions：实用方法

numpy.random.random_sample(size=None)

返回范围[0.0, 1.0)的指定大小的随机数或随机数组；

Args:
    size: int 或 tuple of ints, 可选的
Returns:
    out: float 或 ndarray of floats

示例：

import numpy as np
np.random.seed(5)  # 设置随机数种子
a = np.random.random_sample(size=(1,))
b = np.random.random_sample(size=(2, 2))
print(a)
print(b)

[0.22199317]
[[0.87073231 0.20671916]
 [0.91861091 0.48841119]]

numpy.random.bytes(length)

返回指定长度的随机字节；

Args:
    length: int, Number of random bytes.
Return:
    out: str, String of length.

示例：

import numpy as np
np.random.seed(5)
a = np.random.bytes(length=10)
print(a)

b'c\x8b\xd48\xceH \x0e\xefO'

numpy.random.choice(a, size=None, replace=True, p=None)

从给定的一维数组中生成随机样本；

Args:
    a: 1-D array-like or int
        # 若是是1-D数组，则随机返回其中一个元素；
        # 若是是int，则随机返回range(a)中的一个元素；
    size: int or tuple of ints, optional
        # 输出维度大小，(m,n,k)
        # 若是不指定，则默认为(1,)
    replace: boolean, optional
        # 是否重复采样，默认重复采样
    p: 1-D array-like, optional
        # 与a中元素对应的几率，默认均匀分布
Return:
    samples: single item or ndarray
        # 生成的随机样本

示例：

import numpy as np
np.random.seed(5)
aa_milne_arr = ['pooh', 'rabbit', 'piglet', 'Christopher']
a = np.random.choice(aa_milne_arr, size=5, p=[0.5, 0.1, 0.1, 0.3])
print(a)

['pooh' 'Christopher' 'pooh' 'Christopher' 'pooh']

numpy.random.permutation(x)

对指定序列进行随机置换，返回新的序列；

Args:
    x: int or array_like
        # 若是x是int，则会根据range(x)返回一个随机序列；
        # 若是x是1-D数组，则会返回随机置换后的序列；
        # 若是x是multi-D数组，则会对行进行随机置换；

示例：

# 示例1
import numpy as np
np.random.seed(5)
seq = np.random.permutation(10)
print(seq)

[9 5 2 4 7 1 0 8 6 3]

# 示例2
import numpy as np
np.random.seed(5)
seq = np.random.permutation([1, 4, 9, 12, 15])
print(seq)

[15  1  4  9 12]

# 示例3
import numpy as np
np.random.seed(5)
arr = np.arange(9).reshape((3,3))
seq = np.random.permutation(arr)
print(seq)

[[0 1 2]
 [3 4 5]
 [6 7 8]]

numpy.random.shuffle(x)

对给定数组进行置换，无返回值；

Args:
    x: array_like
        # 须要置换的数组或列表
Return:
    None
        # 无返回值

示例：

# 示例1
import numpy as np
np.random.seed(5)
arr = np.arange(10)
np.random.shuffle(arr)
print(arr)

[9 5 2 4 7 1 0 8 6 3]

# 示例2
import numpy as np
np.random.seed(5)
arr = np.arange(9).reshape((3,3))
np.random.shuffle(arr)
print(arr)

[[0 1 2]
 [3 4 5]
 [6 7 8]]

2.2 Compatibility functions：兼容方法

numpy.random.rand(d0, d1, ..., dn)

从均匀分布中返回指定维度的随机样本

相似于numpy.random.random_sample()，后者的输出是一个元组；

numpy.random.randn(d0, d1, ..., dn)

从标准正态分布中返回指定维度的随机样本；

相似于numpy.random.standard_normal()，后者的输出是一个元组；

Args:
    d0, d1, ..., dn: int, optional
        # 返回随机数组的维度
Return:
    Z: ndarrau or float
        # 维度大小(d0, d1, ..., dn), 数据来自于标准正态分布

想要从\(N(\mu, \sigma^2)\)获得样本，使用：

sigma * np.random.randn(...) + mu

示例：

import numpy as np
np.random.seed(5)
num = np.random.randn()
arr = 2.5 * np.random.randn(2, 4) + 3  # 返回的数组知足N(3, 6.25)
print(num)
print(arr)

0.44122748688504143
[[2.17282462 9.07692797 2.36976968 3.2740246 ]
 [6.95620279 0.72691899 1.52090836 3.46900806]]

未完待续......

博主我的网站：https://chenzhen.online

Python中生成随机数

1. random模块

1.1 设置随机种子

1.1.1 设置随机种子

1.1.2 获取当前随机数环境状态

1.1.3 设置当前随机数环境状态

1.2 random模块中的方法

1.3 使用：生成整形随机数

random.randrange(start, stop, step)

random.randint(a, b)

1.3 使用：生成序列随机数

random.choice(self, seq)

random.shuffle(self, x, random=None)

random.sample(self, population, k)

random.choices(self, population, weights=None, *, cum_weights=None, k=1):

1.4 使用：生成随机实值分布

1.4.1 均匀分布

1.4.2 三角分布

1.4.3 正态分布

1.4.4 对数正太分布

1.4.5 指数分布

1.4.6 von Mises 分布

1.4.7 gamma分布

1.4.8 高斯分布

1.4.9 贝塔分布

1.4.10 Pareto 分布

1.4.11 Weibull 分布

2. numpy.random

2.1 Utility functions：实用方法

numpy.random.random_sample(size=None)

numpy.random.bytes(length)

numpy.random.choice(a, size=None, replace=True, p=None)

numpy.random.permutation(x)

numpy.random.shuffle(x)

2.2 Compatibility functions：兼容方法

numpy.random.rand(d0, d1, ..., dn)

numpy.random.randn(d0, d1, ..., dn)

Reference