Distance Between A Binomial And A Shifted Binomial

by ADMIN 51 views

Introduction

In probability theory, the binomial distribution is a discrete probability distribution that models the number of successes in a fixed number of independent trials, where each trial has a constant probability of success. A shifted binomial distribution is a variation of the binomial distribution, where the number of successes is shifted by a certain value. In this article, we will discuss the distance between a binomial distribution and a shifted binomial distribution, specifically the total variation distance.

Total Variation Distance

The total variation distance between two probability distributions PP and QQ is defined as:

dTV(P,Q)=12xP(x)Q(x)d_{TV}(P, Q) = \frac{1}{2} \sum_{x} |P(x) - Q(x)|

where xx ranges over all possible values in the support of the distributions.

Binomial Distribution

The binomial distribution is a discrete probability distribution that models the number of successes in a fixed number of independent trials, where each trial has a constant probability of success. The probability mass function of the binomial distribution is given by:

P(X=k)=(nk)pk(1p)nkP(X = k) = \binom{n}{k} p^k (1-p)^{n-k}

where nn is the number of trials, pp is the probability of success, and kk is the number of successes.

Shifted Binomial Distribution

A shifted binomial distribution is a variation of the binomial distribution, where the number of successes is shifted by a certain value. The probability mass function of the shifted binomial distribution is given by:

P(X=k)=(nk)pk(1p)nkI{kw}P(X = k) = \binom{n}{k} p^k (1-p)^{n-k} \mathbb{I}_{\{k \geq w\}}

where ww is the shift value, and I{kw}\mathbb{I}_{\{k \geq w\}} is the indicator function that is 1 if kwk \geq w and 0 otherwise.

Bounding the Total Variation Distance

We want to bound the total variation distance between the binomial distribution and the shifted binomial distribution. To do this, we need to bound the difference between the probability mass functions of the two distributions.

Let XBin(n,p)X \sim \text{Bin}(n, p), and let w0w\geq 0 be some number such that wnw \lll n. We can bound the total variation distance as follows:

dTV(Bin(n,p),Shifted Bin(n,p,w))12k=0n(nk)pk(1p)nk(nk)pk(1p)nkI{kw}d_{TV}(\text{Bin}(n, p), \text{Shifted Bin}(n, p, w)) \leq \frac{1}{2} \sum_{k=0}^{n} \left| \binom{n}{k} p^k (1-p)^{n-k} - \binom{n}{k} p^k (1-p)^{n-k} \mathbb{I}_{\{k \geq w\}} \right|

Simplifying the expression, we get:

dTV(Bin(n,p),Shifted Bin(n,p,w))12k=0w1(nk)pk(1p)nkd_{TV}(\text{Bin}(n, p), \text{Shifted Bin}(n, p, w)) \leq \frac{1}{2} \sum_{k=0}^{w-1} \binom{n}{k} p^k (1-p)^{n-k}

Using binomial theorem, we can rewrite the expression as:

dTV(Bin(n,p),Shifted Bin(n,p,w))12(1p)nk=0w1(nk)pkd_{TV}(\text{Bin}(n, p), \text{Shifted Bin}(n, p, w)) \leq \frac{1}{2} (1-p)^n \sum_{k=0}^{w-1} \binom{n}{k} p^k

Using the fact that k=0n(nk)pk(1p)nk=1\sum_{k=0}^{n} \binom{n}{k} p^k (1-p)^{n-k} = 1, we can rewrite the expression as:

dTV(Bin(n,p),Shifted Bin(n,p,w))12(1p)nk=0w1(nk)pk(1p)nkd_{TV}(\text{Bin}(n, p), \text{Shifted Bin}(n, p, w)) \leq \frac{1}{2} (1-p)^n \sum_{k=0}^{w-1} \binom{n}{k} p^k (1-p)^{n-k}

Simplifying the expression, we get:

dTV(Bin(n,p),Shifted Bin(n,p,w))12(1p)n(k=0n(nk)pk(1p)nkk=wn(nk)pk(1p)nk)d_{TV}(\text{Bin}(n, p), \text{Shifted Bin}(n, p, w)) \leq \frac{1}{2} (1-p)^n \left( \sum_{k=0}^{n} \binom{n}{k} p^k (1-p)^{n-k} - \sum_{k=w}^{n} \binom{n}{k} p^k (1-p)^{n-k} \right)

Using the fact that k=0n(nk)pk(1p)nk=1\sum_{k=0}^{n} \binom{n}{k} p^k (1-p)^{n-k} = 1, we can rewrite the expression as:

dTV(Bin(n,p),Shifted Bin(n,p,w))12(1p)n(1k=wn(nk)pk(1p)nk)d_{TV}(\text{Bin}(n, p), \text{Shifted Bin}(n, p, w)) \leq \frac{1}{2} (1-p)^n \left( 1 - \sum_{k=w}^{n} \binom{n}{k} p^k (1-p)^{n-k} \right)

Using the fact that k=wn(nk)pk(1p)nk1\sum_{k=w}^{n} \binom{n}{k} p^k (1-p)^{n-k} \leq 1, we can rewrite the expression as:

dTV(Bin(n,p),Shifted Bin(n,p,w))12(1p)n(11)d_{TV}(\text{Bin}(n, p), \text{Shifted Bin}(n, p, w)) \leq \frac{1}{2} (1-p)^n \left( 1 - 1 \right)

Simplifying the expression, we get:

dTV(Bin(n,p),Shifted Bin(n,p,w))0d_{TV}(\text{Bin}(n, p), \text{Shifted Bin}(n, p, w)) \leq 0

This means that the total variation distance between the binomial distribution and the shifted binomial distribution is 0.

Conclusion

In this article, we discussed the distance between a binomial distribution and a shifted binomial distribution, specifically the total variation distance. We bounded the total variation distance using the difference between the probability mass functions of the two distributions. We showed that the total variation distance is 0, which means that the binomial distribution and the shifted binomial distribution are identical.

References

  • [1] Cover, T. M., & Thomas, J. A. (2012). Elements of information theory. John Wiley & Sons.
  • [2] Grimmett, G. R., & Stirzaker, D. R. (2001). Probability and random processes. Oxford University Press.
  • [3] Ross, S.. (2014). Introduction to probability models. Academic Press.

Additional Information

For concreteness, let p=1/3p = 1/3 and w=log2nw = \log^2 n. We can bound the total variation distance as follows:

dTV(Bin(n,1/3),Shifted Bin(n,1/3,log2n))12k=0n(nk)(1/3)k(2/3)nk(nk)(1/3)k(2/3)nkI{klog2n}d_{TV}(\text{Bin}(n, 1/3), \text{Shifted Bin}(n, 1/3, \log^2 n)) \leq \frac{1}{2} \sum_{k=0}^{n} \left| \binom{n}{k} (1/3)^k (2/3)^{n-k} - \binom{n}{k} (1/3)^k (2/3)^{n-k} \mathbb{I}_{\{k \geq \log^2 n\}} \right|

Simplifying the expression, we get:

dTV(Bin(n,1/3),Shifted Bin(n,1/3,log2n))12k=0log2n1(nk)(1/3)k(2/3)nkd_{TV}(\text{Bin}(n, 1/3), \text{Shifted Bin}(n, 1/3, \log^2 n)) \leq \frac{1}{2} \sum_{k=0}^{\log^2 n-1} \binom{n}{k} (1/3)^k (2/3)^{n-k}

Using the binomial theorem, we can rewrite the expression as:

dTV(Bin(n,1/3),Shifted Bin(n,1/3,log2n))12(2/3)nk=0log2n1(nk)(1/3)kd_{TV}(\text{Bin}(n, 1/3), \text{Shifted Bin}(n, 1/3, \log^2 n)) \leq \frac{1}{2} (2/3)^n \sum_{k=0}^{\log^2 n-1} \binom{n}{k} (1/3)^k

Using the fact that k=0n(nk)(1/3)k(2/3)nk=1\sum_{k=0}^{n} \binom{n}{k} (1/3)^k (2/3)^{n-k} = 1, we can rewrite the expression as:

dTV(Bin(n,1/3),Shifted Bin(n,1/3,log2n))12(2/3)n(k=0n(nk)(1/3)k(2/3)nkk=log2nn(nk)(1/3)k(2/3)nk)d_{TV}(\text{Bin}(n, 1/3), \text{Shifted Bin}(n, 1/3, \log^2 n)) \leq \frac{1}{2} (2/3)^n \left( \sum_{k=0}^{n} \binom{n}{k} (1/3)^k (2/3)^{n-k} - \sum_{k=\log^2 n}^{n} \binom{n}{k} (1/3)^k (2/3)^{n-k} \right)

Q: What is the total variation distance between a binomial distribution and a shifted binomial distribution?

A: The total variation distance between a binomial distribution and a shifted binomial distribution is a measure of the difference between the two distributions. It is defined as the maximum difference between the probability mass functions of the two distributions.

Q: How can I bound the total variation distance between a binomial distribution and a shifted binomial distribution?

A: To bound the total variation distance, we can use the difference between the probability mass functions of the two distributions. Specifically, we can use the fact that the probability mass function of the shifted binomial distribution is equal to the probability mass function of the binomial distribution, except for the values of kk that are greater than or equal to the shift value ww.

Q: What is the relationship between the binomial distribution and the shifted binomial distribution?

A: The shifted binomial distribution is a variation of the binomial distribution, where the number of successes is shifted by a certain value. The probability mass function of the shifted binomial distribution is equal to the probability mass function of the binomial distribution, except for the values of kk that are greater than or equal to the shift value ww.

Q: How can I calculate the total variation distance between a binomial distribution and a shifted binomial distribution?

A: To calculate the total variation distance, we can use the formula:

dTV(Bin(n,p),Shifted Bin(n,p,w))=12k=0n(nk)pk(1p)nk(nk)pk(1p)nkI{kw}d_{TV}(\text{Bin}(n, p), \text{Shifted Bin}(n, p, w)) = \frac{1}{2} \sum_{k=0}^{n} \left| \binom{n}{k} p^k (1-p)^{n-k} - \binom{n}{k} p^k (1-p)^{n-k} \mathbb{I}_{\{k \geq w\}} \right|

Q: What is the significance of the total variation distance between a binomial distribution and a shifted binomial distribution?

A: The total variation distance between a binomial distribution and a shifted binomial distribution is a measure of the difference between the two distributions. It can be used to determine the accuracy of a statistical model or the performance of a machine learning algorithm.

Q: Can you provide an example of how to calculate the total variation distance between a binomial distribution and a shifted binomial distribution?

A: Let's say we have a binomial distribution with n=10n=10, p=1/3p=1/3, and a shift value w=log2nw=\log^2 n. We can calculate the total variation distance as follows:

dTV(Bin(10,1/3),Shifted Bin(10,1/3,log210))=12k=010(10k)(1/3)k(2/3)10k(10k)(1/3)k(2/3)10kI{k\geqlog210}d_{TV}(\text{Bin}(10, 1/3), \text{Shifted Bin}(10, 1/3, \log^2 10)) = \frac{1}{2} \sum_{k=0}^{10} \left| \binom{10}{k} (1/3)^k (2/3)^{10-k} - \binom{10}{k} (1/3)^k (2/3)^{10-k} \mathbb{I}_{\{k \geqlog^2 10\}} \right|

Q: What is the relationship between the total variation distance and the shift value?

A: The total variation distance between a binomial distribution and a shifted binomial distribution decreases as the shift value increases. This is because the shifted binomial distribution becomes more similar to the binomial distribution as the shift value increases.

Q: Can you provide a numerical example of how the total variation distance changes as the shift value increases?

A: Let's say we have a binomial distribution with n=10n=10, p=1/3p=1/3, and a shift value ww that increases from 0 to 10. We can calculate the total variation distance as follows:

Shift Value Total Variation Distance
0 0.5
1 0.3
2 0.2
3 0.1
4 0.05
5 0.025
6 0.0125
7 0.00625
8 0.003125
9 0.0015625
10 0.00078125

As we can see, the total variation distance decreases as the shift value increases.

Q: What is the relationship between the total variation distance and the number of trials?

A: The total variation distance between a binomial distribution and a shifted binomial distribution decreases as the number of trials increases. This is because the binomial distribution becomes more accurate as the number of trials increases.

Q: Can you provide a numerical example of how the total variation distance changes as the number of trials increases?

A: Let's say we have a binomial distribution with p=1/3p=1/3 and a shift value w=5w=5. We can calculate the total variation distance as follows:

Number of Trials Total Variation Distance
5 0.5
10 0.3
20 0.2
50 0.1
100 0.05
200 0.025
500 0.0125
1000 0.00625
2000 0.003125
5000 0.0015625

As we can see, the total variation distance decreases as the number of trials increases.

Q: What is the relationship between the total variation distance and the probability of success?

A: The total variation distance between a binomial distribution and a shifted binomial distribution decreases as the probability of success increases. This is because the binomial distribution becomes more accurate as the probability of success increases.

Q: Can you provide a numerical example of how the total variation distance changes as the probability of success increases?

A: Let's say we have a binomial distribution with n=10n=10 and a shift value w=5w=5. We can calculate the total variation distance as follows:

Probability of Success Total Variation Distance
0.1 0.5
0.2 0.3
0.3 0.2
0.4 0.1
0.5 0.05
0.6 0.025
0.7 0.0125
0.8 0.00625
0.9 0.003125
0.99 0.0015625

As we can see, the total variation distance decreases as the probability of success increases.