闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助_代写数学

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

DSC 212 — Probability and Statistics for Data Science

January 26, 2023

Lecture 6

6.1 Recap: Convergence in Probability

A sequence of random variables Xn X if

→∞

Theorem 1 (Weak Law of Large Numbers (WLLN)). If Xi are i.i. d. random variables with mean EXi = µ, then

Xn = 工 Xi

(6.1)

Xn is a new sequence of random variables: Xn −→ µ, where µ is a constant.

Figure 6.1: Illustration of the distribution of Xn . It concentrates around the mean µ .

Note 1 . If gR → R is a measurable function, i.e., g(Xi) is a valid random variable, and Eg(Xi) = g ,

then we have 对 g(Xi) →− g .

6.2 Central Limit Theorem (CLT)

Definition 1 (Convergence in Distribution) . Let Xi be a sequence of random variables and let Fn be the CDF of Xn . Then Xn ⇒ X (converges in distribution to X) if limn→∞ Fn(t) = F(t) for all t at which F is continuous.

Note 2 . We assume nothing about Xi, except the existence of the mean and variance.

Remark 1 . For any continuously bounded function g: Eg(Xn) →− Eg(x)

Theorem 2 (Central Limit Theorem) . If Xi are i.i. d. random variables, EXi = µ , Var(Xi) = σ 2

X1 + X2 + . . . Xn

Xn(n) − EXn Xn − µ

then Zn ⇒ Z ∼ N(0, 1), i. e ., Zn converges in distribution to Z which has the Standard Normal

Distribution .

Remark 2 . Hence any statistic of Zn can be approximated by Z ,

Eg(Zn) → Eg(Z) = \ g(t)exp(−t2 /2)dt.

Figure 6.2: Deviations around the mean. We multiply by ^n to zoom into Figure 6.1. For any

P(average error per program ≤ 5.5) = P(Xn ≤ 5.5) = P(√n( ) ≤ √n( ))

≈ P(Z ≤ √n(~~5.5~~σ− µ )) = P(Z ≤ √125( 5. 5 ))

5.5 − 5

= FZ(2.5) = \− e dt = 0.9938

Conclusion 1 . With probability .9938, there are at most 55 errors on average.

Definition 2 . Dn: empirical average

P ( √N(σ(Dn) − µ) < √n(σ(t)− µ)) = .99

t ≥ µ + · F1 (0.99)

Example 2 . Toss an unbiased (P = ) coin 1000 times.

(1) What is the probability of seeing ≥ 600 Heads?

(2) E(total heads) = 500 Find a t, such that

total heads ∈ [500 − t,500 + t]

with probability 99%, 99.99% ?

1000

Xi ∼ Bernoulli(1/2) Y = 之 Xi ∼ Binomial (1000, 1/2). (6.2)

i=1

P(Y = k) = ( 1k(00)0 ) 2−1000

P({Y ≥ 600}) = 之(1000) ( 1k(00)0 ) 2−1000 (unwieldy)

P(Y ≥ 600) = P(Y/1000 ≥ 0.6) = P ) ≥ ) = P ( √1000 − 0.5 ≥ )

≈ P (Z ≥ ) = 1 − FZ(2 √10) ≈ 10 −10 .

(6.3) (6.4)

(6.5)

(6.6)

(6.7)

Remark 4 . FZ(−t) = 1 − FZ(t) for any symmetric PDF. See Figure 6.3 for an illustration.

− t/50 · ^10 ≤^1000 − 0.5 ) ≤ t/50^10

P(Y ∈ [500 − t,500 + t])

= P (Z ∈ [ − , ]) = P (z ≤ ) − P (z ≤ − ) = FZ ( t) − FZ ( − ) = 2FZ ( ) − 1

FZ ( − t) = 1 − FZ ( )

(6.8) (6.9) (6.10)

(6.11)

(6.12)

(6.13)

Figure 6.3: Probability of distribution around the origin

2FZ ( ) − 1 = 0.99 or 0.9999

FZ ( ) = 0.995 or 0.99995

t = ^10 · F1 (0.995) or ^10F1 (0.99995)

6.3 Delta Method

Suppose ^n ( ) ⇒ N(0, 1). Let g be a differentiable function with g\ (µ) 0. Then,

^n =⇒ N(0, 1). (6.14)

2023-02-25

Java

物理(Physical)

LINUX

C++

Python

Processing

sas

ios

maths

maple

C语言

R语言

Internet and World Wide Web

Principles of Programming Languages

sql

scheme

prolog

JavaScript

Haskell

essay

HDL

VBA

会计学(Accounting)

Rust

经济学（ Economics）

算法分析（Algorithm analysis）

MATLAB

心理学

Ethics

建筑学

TCAD

Adobe Photoshop

语言学 (linguistics)

历史学 (History)

文学 (Literature)

教育学 (Pedagogy)

天文学 (Astronomy)

地质学（geology）

SWOT

CAD(计算机辅助设计)

G语言

地理学（Geography）

Project management （管理学）

SysML

社会学（Sociology）

商业分析(Business Analysis)

市场营销学(Marketing)

人类学(Anthropology)

人文艺术(Arts and humanities)

电气工程（Electrical Engineering）

材料学（hylology）

生物科学（biological science）

哲学（Philosophy）

管理科学与工程类（Management science and Engineering）

工商管理（Business Administration）

数学（mathematics）

计算机（computer）

网络安全（Cyber Security）

统计学 Statistics

经济与贸易 Economy and trade

Excel

Chemistry

LaTeX

OCaml

SPSS

Project

ASP

Stata

FORTRAN

Information system

SDLC

Basic

Biological

Android

ruby

HTML/CSS

Scala

PHP语言

C#