Think about a cricket player’s average runs in 10 matches, you add the scores and divide to get the average. That’s the basic idea of statistics: turning numbers into useful information. From statistics class 9 and statistics class 10 basics to advanced concepts in statistics class 11, the subject helps us understand data better. Whether it’s learning statistics formulas, checking a p-value in statistics, or knowing the father of statistics, this branch of maths is everywhere. In this article, we’ll explore the meaning of statistics, important topics, formulas, and resources to make learning simple.
Statistics is a branch of mathematics dealing with data and its interpretation. Statistics is the science of counting, averages, estimates, and probability. Statistics is of two types, namely, Descriptive Statistics and Inferential Statistics.
Descriptive Statistics is about organizing and summarizing the collected data to understand its meaning and the purpose of it.
Inferential Statistics is about predicting and drawing a conclusion from the organized data.
For example, In a set of marks of a student population, descriptive statistics is finding the average marks, the range of marks from highest to lowest, etc. while inferential statistics is about giving a conclusion like comparing this set of marks with the previous test marks to find the improvement.
The interpretation of data using statistics is mainly based on standard formulas. From mean, median, and mode to variance and probability, these statistics formulas help in analyzing and understanding different types of data.
Any bit of information is called data. For example, the average marks you scored in an exam or the number of goals in a football match are data points. In simple words, statistics meaning is about collecting and organizing such data to draw conclusions.
Data can be classified into two types:
Quantitative Data – Numeric data that can be counted or measured (e.g., height, marks, temperature).
Qualitative Data – Non-numeric data that describes qualities or characteristics (e.g., colors, opinions, feedback).
The collection of data is the first step in statistics. It can be done through:
Surveys – Asking individuals questions to collect information.
Experiments – Collecting data under specific controlled conditions.
Observational Studies – Collecting data by simply observing without interference.
The data once collected must be arranged or organized in a way so that inferences or conclusions can be made out from it.
The following are the ways for representation of data
The frequency of a data point represents the number of times it appears in a dataset. To make data more readable, we often use different types of frequency distributions in statistics, such as ungrouped distribution, ungrouped frequency distribution, and grouped frequency distribution.
1- Ungrouped Distribution
In an ungrouped distribution, all the values are written individually, separated by commas, without organizing them into classes.
Example:
Marks obtained (out of 100) by 30 Class XI students:
$10, 20, 36, 92, 95, 40, 50, 56, 60, 70,$
$92, 88, 80, 70, 72, 70, 36, 40, 36, 40,$
$92, 40, 50, 50, 56, 60, 70, 60, 60, 88$
This is called an ungrouped distribution because the data is simply listed without further organization.
2- Ungrouped Frequency Distribution
Instead of repeating values, we can show how many times each value occurs. For example, in the dataset above, 4 students scored 70 marks, so the frequency of 70 is 4.
This can be shown in a table called an Ungrouped Frequency Distribution:
Marks | Number of Students |
---|---|
10 | 1 |
20 | 1 |
36 | 3 |
40 | 4 |
50 | 3 |
56 | 2 |
60 | 4 |
70 | 4 |
72 | 1 |
80 | 1 |
88 | 2 |
92 | 3 |
95 | 1 |
Total | 30 |
3- Grouped Frequency Distribution
To make large data more manageable, we can arrange it into class intervals (ranges) and record the number of values in each range.
Example:
Class Interval | 10-25 | 25-40 | 40-55 | 55-70 | 70-85 | 85-100 |
---|---|---|---|---|---|---|
Number of Students | 2 | 3 | 7 | 6 | 6 | 6 |
Here, the class width is $15$ (for example, $25 - 10 = 15$, $70 - 55 = 15$). The width can be chosen based on convenience.
It is often convenient to have a single number that represents the whole data. Such a number is called a Measure of Central Tendency. This value usually lies near the middle of the data and helps in understanding the overall pattern. The three most common measures of central tendency are Mean, Median, and Mode.
The mean of given values is the sum of all observations divided by the total number of observations.
If we have $n$ values $x_1, x_2, x_3, \ldots, x_n$, then the mean $\bar{x}$ (read as “x-bar”) is: $\bar{x} = \frac{x_1 + x_2 + \cdots + x_n}{n}$
Example: To calculate the mean marks of 50 students, add the marks of all 50 students and divide by 50.
If $n$ observations are $x_1, x_2, x_3, \ldots, x_n$, then: $\bar{x} = \frac{x_1+x_2+\cdots+x_n}{n} = \frac{1}{n} \sum_{i=1}^n x_i$
If values are $x_1, x_2, x_3, \ldots, x_n$ with respective frequencies $f_1, f_2, f_3, \ldots, f_n$, then: $\bar{x} = \frac{f_1x_1+f_2x_2+f_3x_3+\cdots+f_nx_n}{f_1+f_2+f_3+\cdots+f_n} = \frac{\sum_{i=1}^n f_i x_i}{\sum_{i=1}^n f_i}$
The median is the middle value of data when arranged in ascending or descending order. It divides the dataset into two equal halves.
Example:
Data: $65, 55, 89, 56, 35, 14, 56, 55, 87, 45, 92$
Arranged order: $14, 35, 45, 55, 55, 56, 56, 65, 87, 89, 92$
The middle value is $56$, so the median = 56.
If $n$ is even, the median is the average of the two middle values.
If the number of observations is $n$:
If $n$ is odd: $\text{Median} = \left(\frac{n+1}{2}\right)^{th} \text{ observation}$
If $n$ is even: $\text{Median} = \frac{\text{Value of } \left(\frac{n}{2}\right)^{th} \text{ observation } + \text{Value of } \left(\frac{n}{2}+1\right)^{th} \text{ observation}}{2}$
Example:
Data: $1, 11.5, 6, 7.2, 4, 8, 9, 10, 6.8, 8.3, 2, 2, 10, 1$
Ordered: $1, 1, 2, 2, 4, 6, 6.8, 7.2, 8, 8.3, 9, 10, 10, 11.5$
Here $n = 14$. Median = average of 7th and 8th values = $(6.8 + 7.2)/2 = 7$.
Let total frequency = $N$.
If $N$ is odd: Median = the observation with cumulative frequency $\geq \frac{N+1}{2}$.
If $N$ is even: Median = average of the values corresponding to cumulative frequencies $\geq \frac{N}{2}$ and $\geq \frac{N}{2}+1$.
The mode is the most frequently occurring value in a dataset.
Example: Data: $65, 55, 89, 56, 35, 14, 56, 55, 87, 45, 92, 55$
Here, $55$ occurs most often, so the mode = 55.
Central tendency helps us find a single value that represents the whole data, while measures of dispersion tell us how spread out the data is. Common measures of dispersion include range, mean deviation, variance, standard deviation, and coefficients of variation.
The range is the simplest measure of dispersion in statistics. It represents the difference between the largest and smallest observations in a dataset. It gives a quick idea of how spread out the data is but does not consider the distribution of values in between.
$\text{Range} = x_{\max} - x_{\min}$
Mean deviation measures the average distance of all observations from a central value (mean, median, or any chosen value). It provides a simple understanding of data spread in statistics class 10, 11, or 12.
Ungrouped Data: $\text{M.D.}(a) = \frac{1}{n} \sum_{i=1}^n |x_i - a|$
Mean Deviation about Mean: $\text{M.D.}(\bar{x}) = \frac{1}{n} \sum_{i=1}^n |x_i - \bar{x}|$
Frequency Data: $\text{M.D.}(\bar{x}) = \frac{\sum_{i=1}^n f_i |x_i - \bar{x}|}{\sum_{i=1}^n f_i}$
Variance is a fundamental concept in statistics class 11 and 12 that measures the average of the squares of deviations from the mean. It tells us how far each observation is from the mean and is a key step in calculating standard deviation.
Ungrouped Data: $\sigma^2 = \frac{1}{n} \sum_{i=1}^n (x_i - \bar{x})^2$
Frequency Data: $\sigma^2 = \frac{1}{N} \sum_{i=1}^n f_i (x_i - \bar{x})^2$
Standard deviation is the positive square root of variance and gives a measure of dispersion in the same units as the data. It is widely used in inferential statistics and helps compare the spread of different datasets.
Ungrouped Data: $\sigma = \sqrt{\frac{1}{n} \sum_{i=1}^n (x_i - \bar{x})^2}$
Frequency Data: $\sigma = \sqrt{\frac{1}{N} \sum_{i=1}^n f_i (x_i - \bar{x})^2}$
Coefficients of dispersion are unit-free measures of variability, which allow comparison of the spread between datasets of different scales. They are often used in statistics class 10 and 11 for analyzing data consistency.
Shows relative spread using range.
Formula: $\text{C.R.} = \frac{x_{\max} - x_{\min}}{x_{\max} + x_{\min}}$
Expresses mean deviation relative to the mean for a normalized measure of spread.
Formula: $\text{C.M.D.} = \frac{\text{M.D.}}{\bar{x}}$
Measures the relative standard deviation, giving insight into data variability independent of units.
Formula: $\text{C.S.D.} = \frac{\sigma}{\bar{x}}$
Expresses standard deviation as a percentage of the mean. It is widely preferred in statistics for comparing datasets.
Formula: $\text{C.V.} = \frac{\sigma}{\bar{x}} \times 100, \quad \bar{x} \neq 0$
This section lists all the key formulas used in statistics, from mean, median, and mode to variance, standard deviation, and coefficients of dispersion. These formulas help summarize, analyze, and interpret data efficiently in both Class 11 and 12.
Concept | Definition | Formula |
---|---|---|
Mean (Ungrouped Data) | The average of all observations; represents the central value of the dataset. | $\bar{x} = \frac{x_1 + x_2 + \cdots + x_n}{n} = \frac{1}{n} \sum_{i=1}^n x_i$ |
Mean (Ungrouped Frequency Distribution) | Weighted average where each observation occurs with a certain frequency. | $\bar{x} = \frac{\sum_{i=1}^n f_i x_i}{\sum_{i=1}^n f_i}$ |
Mean (Grouped Frequency Distribution) | Average using class midpoints for grouped data. | $\bar{x} = \frac{\sum_{i=1}^n f_i m_i}{\sum_{i=1}^n f_i}, \quad m_i = \frac{\text{Lower boundary} + \text{Upper boundary}}{2}$ |
Median (Ungrouped Data) | The middle value when data is arranged in order; divides data into two equal halves. | Odd $n$: $\text{Median} = \text{Value at } \frac{n+1}{2}^{th} \text{ position}$ Even $n$: $\text{Median} = \frac{\text{Value at } \frac{n}{2}^{th} + \text{Value at } (\frac{n}{2}+1)^{th}}{2}$ |
Median (Grouped Data) | Middle value estimated from cumulative frequencies in grouped distribution. | $\text{Median} = l + \frac{\frac{N}{2} - cf}{f} \times h$ |
Mode (Grouped Data) | Estimated value within the class having maximum frequency (modal class). | $\text{Mode} = l + \frac{f_1 - f_0}{2f_1 - f_0 - f_2} \times h$ |
Range | Difference between largest and smallest observation; simple measure of spread. | $\text{Range} = x_{\max} - x_{\min}$ |
Mean Deviation about Mean | Average of absolute deviations from the mean; measures spread. | $\text{M.D.}(\bar{x}) = \frac{1}{n} \sum_{i=1}^n$ |
Variance | Average of squared deviations from the mean; shows how far data spreads. | $\sigma^2 = \frac{1}{n} \sum_{i=1}^n (x_i - \bar{x})^2$ For frequency: $\sigma^2 = \frac{1}{N} \sum_{i=1}^n f_i (x_i - \bar{x})^2$ |
Standard Deviation | Positive square root of variance; widely used measure of spread. | $\sigma = \sqrt{\frac{1}{n} \sum_{i=1}^n (x_i - \bar{x})^2}$ Frequency: $\sigma = \sqrt{\frac{1}{N} \sum_{i=1}^n f_i (x_i - \bar{x})^2}$ |
Coefficient of Range | Normalized measure of dispersion independent of units. | $\text{C.R.} = \frac{x_{\max} - x_{\min}}{x_{\max} + x_{\min}}$ |
Coefficient of Mean Deviation | Mean deviation expressed relative to the mean; unit-free. | $\text{C.M.D.} = \frac{\text{M.D.}}{\bar{x}}$ |
Coefficient of Standard Deviation | Standard deviation relative to the mean; useful for comparing datasets. | $\text{C.S.D.} = \frac{\sigma}{\bar{x}}$ |
Coefficient of Variation (C.V.) | Standard deviation expressed as percentage of mean; more consistent than other measures. | $\text{C.V.} = \frac{\sigma}{\bar{x}} \times 100$ |
This section covers the important Statistics topics in Class 11, including data collection, presentation, measures of central tendency, and dispersion as per NCERT and JEE Main syllabus.
Introduction to statistics |
Statistics and Probability |
Measures of Central Tendency (Mean, Median, Mode) |
Measures of Dispersion (Range, Variance, Standard Deviation) |
Coefficient of Dispersion |
Here you’ll find the best books and resources that explain Statistics concepts clearly, with examples and practice problems to strengthen your basics.
Book Title | Author / Publisher | Description |
---|---|---|
NCERT Mathematics Class 11 | NCERT | Official textbook covering Statistics concepts and exercises. |
Mathematics for Class 11 | R.D. Sharma | Detailed explanations and solved problems on Statistics. |
Objective Mathematics | R.S. Aggarwal | Multiple-choice questions and practice on Class 11 Statistics. |
Arihant All-In-One Mathematics | Arihant | Comprehensive approach with solved and practice questions. |
NCERT Class 11 Mathematics book provides detailed explanations, solved examples, and exercises that form the core foundation of Statistics.
NCERT Maths Solutions for Class 11th Chapter 15 - Statistics
NCERT Maths Exemplar Solutions for Class 11th Chapter 15 - Statistics
This section provides subject-specific NCERT materials such as notes, exemplar problems, and detailed solutions across different subjects, helping students strengthen their concepts along with related topics.
This section brings you practice questions designed to test your understanding of formulas and concepts, helping you prepare for both school exams and competitive tests.
Frequently Asked Questions (FAQs)
The two main types are:
Descriptive Statistics: Summarizing and presenting data (mean, median, graphs).
Inferential Statistics: Drawing conclusions and making predictions from data.
Statistics is the branch of mathematics that deals with collecting, organizing, analyzing, and interpreting data to make meaningful conclusions or predictions.
The most common measures are Mean, Median, Mode, Variance, Standard Deviation, and Probability. These help in understanding the spread and central tendency of data.
Data is the raw information (like marks of students), while Statistics is the method of organizing and analyzing that data to find patterns or results.
Mean: Average of all values.
Median: Middle value when data is arranged.
Mode: Value that occurs most often.
p value in statistics is used in distributions like normal distribution to validate the hypothesis of the analysis.