All Flashcards
What is the formula for the mean of the sampling distribution of the difference between two means?
μ(x̄1 - x̄2) = μ1 - μ2
What is the formula for the standard deviation of the sampling distribution of the difference between two means?
How do you calculate the standard deviation of the sampling distribution when population standard deviations are unknown?
Estimate using sample standard deviations:
What is the formula to calculate the Z-score for the difference in sample means?
.
What is the formula to calculate the degrees of freedom for the t-distribution when comparing two independent means?
The degrees of freedom can be approximated using Welch's t-test formula, which is complex and often calculated by statistical software. A conservative approach is to use the smaller of n1-1 and n2-1.
Explain the concept of the sampling distribution of the difference between two means.
It represents the distribution of differences in sample means (x̄1 - x̄2) from repeated samples. It allows us to make inferences about the true difference in population means (μ1 - μ2).
Explain the importance of the Central Limit Theorem (CLT) in the context of the difference between two means.
The CLT allows us to assume the sampling distribution of the difference in sample means is approximately normal when sample sizes are large (n ≥ 30 for both samples), even if the populations are not normally distributed.
Explain why variances are added when calculating the standard deviation of the difference between two means.
Variances are added because they represent the squared deviations from the mean, and adding them combines the variability from both distributions. Taking the square root then returns the combined standard deviation.
Explain the impact of sample size on the standard deviation of the sampling distribution.
Larger sample sizes decrease the standard deviation of the sampling distribution, leading to more precise estimates of the true difference in population means.
Explain the importance of checking conditions (normality or large sample size) before performing inference on two means.
Checking conditions ensures that the sampling distribution is approximately normal, which is a requirement for using normal-based statistical tests. Failing to check can lead to invalid conclusions.
Define sampling distribution of the difference between two means.
The distribution of all possible differences in sample means (x̄1 - x̄2) that we would get if we repeated the study many times.
What is the Central Limit Theorem (CLT)?
If sample sizes are large enough (typically n ≥ 30 for both samples), then the sampling distribution of the difference in sample means will be approximately normal, regardless of the shape of the original population distributions.
Define population mean.
The average value of a variable in the entire population.
Define population standard deviation.
A measure of the spread or variability of data points around the population mean.
Define sample mean.
The average value of a variable calculated from a sample of the population.