Statistics: Definitions, Types, Formulas, Applications & Importance

Statistics: Definitions, Types, Formulas, Applications & Importance

Komal MiglaniUpdated on 18 Sep 2025, 05:42 PM IST

Think about a cricket player’s average runs in 10 matches, you add the scores and divide to get the average. That’s the basic idea of statistics: turning numbers into useful information. From statistics class 9 and statistics class 10 basics to advanced concepts in statistics class 11, the subject helps us understand data better. Whether it’s learning statistics formulas, checking a p-value in statistics, or knowing the father of statistics, this branch of maths is everywhere. In this article, we’ll explore the meaning of statistics, important topics, formulas, and resources to make learning simple.

What is Statistics?

Statistics is a branch of mathematics dealing with data and its interpretation. Statistics is the science of counting, averages, estimates, and probability. Statistics is of two types, namely, Descriptive Statistics and Inferential Statistics.

Descriptive Statistics

Descriptive Statistics is about organizing and summarizing the collected data to understand its meaning and the purpose of it.

Inferential Statistics

Inferential Statistics is about predicting and drawing a conclusion from the organized data.

For example, In a set of marks of a student population, descriptive statistics is finding the average marks, the range of marks from highest to lowest, etc. while inferential statistics is about giving a conclusion like comparing this set of marks with the previous test marks to find the improvement.

The interpretation of data using statistics is mainly based on standard formulas. From mean, median, and mode to variance and probability, these statistics formulas help in analyzing and understanding different types of data.

What is Data?

Any bit of information is called data. For example, the average marks you scored in an exam or the number of goals in a football match are data points. In simple words, statistics meaning is about collecting and organizing such data to draw conclusions.

Types of Data

Data can be classified into two types:

  1. Quantitative Data – Numeric data that can be counted or measured (e.g., height, marks, temperature).

  2. Qualitative Data – Non-numeric data that describes qualities or characteristics (e.g., colors, opinions, feedback).

Methods of Data Collection

The collection of data is the first step in statistics. It can be done through:

  1. Surveys – Asking individuals questions to collect information.

  2. Experiments – Collecting data under specific controlled conditions.

  3. Observational Studies – Collecting data by simply observing without interference.

Representation of Data

The data once collected must be arranged or organized in a way so that inferences or conclusions can be made out from it.

The following are the ways for representation of data

  1. Ungrouped distribution
  2. Ungrouped frequency distribution
  3. Grouped frequency distribution
JEE Main Highest Scoring Chapters & Topics
Focus on high-weightage topics with this eBook and prepare smarter. Gain accuracy, speed, and a better chance at scoring higher.
Download E-book

Frequency Distribution in Statistics

The frequency of a data point represents the number of times it appears in a dataset. To make data more readable, we often use different types of frequency distributions in statistics, such as ungrouped distribution, ungrouped frequency distribution, and grouped frequency distribution.

1- Ungrouped Distribution

In an ungrouped distribution, all the values are written individually, separated by commas, without organizing them into classes.

Example:
Marks obtained (out of 100) by 30 Class XI students:

$10, 20, 36, 92, 95, 40, 50, 56, 60, 70,$
$92, 88, 80, 70, 72, 70, 36, 40, 36, 40,$
$92, 40, 50, 50, 56, 60, 70, 60, 60, 88$

This is called an ungrouped distribution because the data is simply listed without further organization.

2- Ungrouped Frequency Distribution

Instead of repeating values, we can show how many times each value occurs. For example, in the dataset above, 4 students scored 70 marks, so the frequency of 70 is 4.

This can be shown in a table called an Ungrouped Frequency Distribution:

MarksNumber of Students
101
201
363
404
503
562
604
704
721
801
882
923
951
Total30

3- Grouped Frequency Distribution

To make large data more manageable, we can arrange it into class intervals (ranges) and record the number of values in each range.

Example:

Class Interval10-2525-4040-5555-7070-8585-100
Number of Students237666

Here, the class width is $15$ (for example, $25 - 10 = 15$, $70 - 55 = 15$). The width can be chosen based on convenience.

Measures of Central Tendency

It is often convenient to have a single number that represents the whole data. Such a number is called a Measure of Central Tendency. This value usually lies near the middle of the data and helps in understanding the overall pattern. The three most common measures of central tendency are Mean, Median, and Mode.

Mean

The mean of given values is the sum of all observations divided by the total number of observations.

If we have $n$ values $x_1, x_2, x_3, \ldots, x_n$, then the mean $\bar{x}$ (read as “x-bar”) is: $\bar{x} = \frac{x_1 + x_2 + \cdots + x_n}{n}$

Example: To calculate the mean marks of 50 students, add the marks of all 50 students and divide by 50.

Mean of Ungrouped Data

If $n$ observations are $x_1, x_2, x_3, \ldots, x_n$, then: $\bar{x} = \frac{x_1+x_2+\cdots+x_n}{n} = \frac{1}{n} \sum_{i=1}^n x_i$

Mean of Ungrouped Frequency Distribution

If values are $x_1, x_2, x_3, \ldots, x_n$ with respective frequencies $f_1, f_2, f_3, \ldots, f_n$, then: $\bar{x} = \frac{f_1x_1+f_2x_2+f_3x_3+\cdots+f_nx_n}{f_1+f_2+f_3+\cdots+f_n} = \frac{\sum_{i=1}^n f_i x_i}{\sum_{i=1}^n f_i}$

Median

The median is the middle value of data when arranged in ascending or descending order. It divides the dataset into two equal halves.

Example:
Data: $65, 55, 89, 56, 35, 14, 56, 55, 87, 45, 92$

Arranged order: $14, 35, 45, 55, 55, 56, 56, 65, 87, 89, 92$

The middle value is $56$, so the median = 56.

If $n$ is even, the median is the average of the two middle values.

Median of Ungrouped Data

If the number of observations is $n$:

  • If $n$ is odd: $\text{Median} = \left(\frac{n+1}{2}\right)^{th} \text{ observation}$

  • If $n$ is even: $\text{Median} = \frac{\text{Value of } \left(\frac{n}{2}\right)^{th} \text{ observation } + \text{Value of } \left(\frac{n}{2}+1\right)^{th} \text{ observation}}{2}$

Example:
Data: $1, 11.5, 6, 7.2, 4, 8, 9, 10, 6.8, 8.3, 2, 2, 10, 1$

Ordered: $1, 1, 2, 2, 4, 6, 6.8, 7.2, 8, 8.3, 9, 10, 10, 11.5$

Here $n = 14$. Median = average of 7th and 8th values = $(6.8 + 7.2)/2 = 7$.

Median of Ungrouped Frequency Distribution

Let total frequency = $N$.

  • If $N$ is odd: Median = the observation with cumulative frequency $\geq \frac{N+1}{2}$.

  • If $N$ is even: Median = average of the values corresponding to cumulative frequencies $\geq \frac{N}{2}$ and $\geq \frac{N}{2}+1$.

Mode

The mode is the most frequently occurring value in a dataset.

Example: Data: $65, 55, 89, 56, 35, 14, 56, 55, 87, 45, 92, 55$

Here, $55$ occurs most often, so the mode = 55.

Measures of Central Tendency and Dispersion

Central tendency helps us find a single value that represents the whole data, while measures of dispersion tell us how spread out the data is. Common measures of dispersion include range, mean deviation, variance, standard deviation, and coefficients of variation.

Range

The range is the simplest measure of dispersion in statistics. It represents the difference between the largest and smallest observations in a dataset. It gives a quick idea of how spread out the data is but does not consider the distribution of values in between.

$\text{Range} = x_{\max} - x_{\min}$

Mean Deviation

Mean deviation measures the average distance of all observations from a central value (mean, median, or any chosen value). It provides a simple understanding of data spread in statistics class 10, 11, or 12.

  • Ungrouped Data: $\text{M.D.}(a) = \frac{1}{n} \sum_{i=1}^n |x_i - a|$

  • Mean Deviation about Mean: $\text{M.D.}(\bar{x}) = \frac{1}{n} \sum_{i=1}^n |x_i - \bar{x}|$

  • Frequency Data: $\text{M.D.}(\bar{x}) = \frac{\sum_{i=1}^n f_i |x_i - \bar{x}|}{\sum_{i=1}^n f_i}$

Variance

Variance is a fundamental concept in statistics class 11 and 12 that measures the average of the squares of deviations from the mean. It tells us how far each observation is from the mean and is a key step in calculating standard deviation.

  • Ungrouped Data: $\sigma^2 = \frac{1}{n} \sum_{i=1}^n (x_i - \bar{x})^2$

  • Frequency Data: $\sigma^2 = \frac{1}{N} \sum_{i=1}^n f_i (x_i - \bar{x})^2$

Standard Deviation

Standard deviation is the positive square root of variance and gives a measure of dispersion in the same units as the data. It is widely used in inferential statistics and helps compare the spread of different datasets.

  • Ungrouped Data: $\sigma = \sqrt{\frac{1}{n} \sum_{i=1}^n (x_i - \bar{x})^2}$

  • Frequency Data: $\sigma = \sqrt{\frac{1}{N} \sum_{i=1}^n f_i (x_i - \bar{x})^2}$

Coefficients of Dispersion

Coefficients of dispersion are unit-free measures of variability, which allow comparison of the spread between datasets of different scales. They are often used in statistics class 10 and 11 for analyzing data consistency.

Coefficient of Range

Shows relative spread using range.

Formula: $⁡\text{C.R.} = \frac{x_{\max} - x_{\min}}{x_{\max} + x_{\min}}$

Coefficient of Mean Deviation

Expresses mean deviation relative to the mean for a normalized measure of spread.

Formula: $\text{C.M.D.} = \frac{\text{M.D.}}{\bar{x}}$

Coefficient of Standard Deviation

Measures the relative standard deviation, giving insight into data variability independent of units.

Formula: $\text{C.S.D.} = \frac{\sigma}{\bar{x}}$

Coefficient of Variation (C.V.)

Expresses standard deviation as a percentage of the mean. It is widely preferred in statistics for comparing datasets.

Formula: $\text{C.V.} = \frac{\sigma}{\bar{x}} \times 100, \quad \bar{x} \neq 0$

Important Statistics Formulae List

This section lists all the key formulas used in statistics, from mean, median, and mode to variance, standard deviation, and coefficients of dispersion. These formulas help summarize, analyze, and interpret data efficiently in both Class 11 and 12.

ConceptDefinitionFormula
Mean (Ungrouped Data)The average of all observations; represents the central value of the dataset.$\bar{x} = \frac{x_1 + x_2 + \cdots + x_n}{n} = \frac{1}{n} \sum_{i=1}^n x_i$
Mean (Ungrouped Frequency Distribution)Weighted average where each observation occurs with a certain frequency.$\bar{x} = \frac{\sum_{i=1}^n f_i x_i}{\sum_{i=1}^n f_i}$
Mean (Grouped Frequency Distribution)Average using class midpoints for grouped data.$\bar{x} = \frac{\sum_{i=1}^n f_i m_i}{\sum_{i=1}^n f_i}, \quad m_i = \frac{\text{Lower boundary} + \text{Upper boundary}}{2}$
Median (Ungrouped Data)The middle value when data is arranged in order; divides data into two equal halves.Odd $n$: $\text{Median} = \text{Value at } \frac{n+1}{2}^{th} \text{ position}$ Even $n$: $\text{Median} = \frac{\text{Value at } \frac{n}{2}^{th} + \text{Value at } (\frac{n}{2}+1)^{th}}{2}$
Median (Grouped Data)Middle value estimated from cumulative frequencies in grouped distribution.$\text{Median} = l + \frac{\frac{N}{2} - cf}{f} \times h$
Mode (Grouped Data)Estimated value within the class having maximum frequency (modal class).$\text{Mode} = l + \frac{f_1 - f_0}{2f_1 - f_0 - f_2} \times h$
RangeDifference between largest and smallest observation; simple measure of spread.$\text{Range} = x_{\max} - x_{\min}$
Mean Deviation about MeanAverage of absolute deviations from the mean; measures spread.$\text{M.D.}(\bar{x}) = \frac{1}{n} \sum_{i=1}^n$
VarianceAverage of squared deviations from the mean; shows how far data spreads.$\sigma^2 = \frac{1}{n} \sum_{i=1}^n (x_i - \bar{x})^2$ For frequency: $\sigma^2 = \frac{1}{N} \sum_{i=1}^n f_i (x_i - \bar{x})^2$
Standard DeviationPositive square root of variance; widely used measure of spread.$\sigma = \sqrt{\frac{1}{n} \sum_{i=1}^n (x_i - \bar{x})^2}$ Frequency: $\sigma = \sqrt{\frac{1}{N} \sum_{i=1}^n f_i (x_i - \bar{x})^2}$
Coefficient of RangeNormalized measure of dispersion independent of units.$\text{C.R.} = \frac{x_{\max} - x_{\min}}{x_{\max} + x_{\min}}$
Coefficient of Mean DeviationMean deviation expressed relative to the mean; unit-free.$\text{C.M.D.} = \frac{\text{M.D.}}{\bar{x}}$
Coefficient of Standard DeviationStandard deviation relative to the mean; useful for comparing datasets.$\text{C.S.D.} = \frac{\sigma}{\bar{x}}$
Coefficient of Variation (C.V.)Standard deviation expressed as percentage of mean; more consistent than other measures.$\text{C.V.} = \frac{\sigma}{\bar{x}} \times 100$

List of Topics related to Statistics

This section covers the important Statistics topics in Class 11, including data collection, presentation, measures of central tendency, and dispersion as per NCERT and JEE Main syllabus.

Important Books and Resources for Statistics

Here you’ll find the best books and resources that explain Statistics concepts clearly, with examples and practice problems to strengthen your basics.

Book TitleAuthor / PublisherDescription
NCERT Mathematics Class 11NCERTOfficial textbook covering Statistics concepts and exercises.
Mathematics for Class 11R.D. SharmaDetailed explanations and solved problems on Statistics.
Objective MathematicsR.S. AggarwalMultiple-choice questions and practice on Class 11 Statistics.
Arihant All-In-One MathematicsArihantComprehensive approach with solved and practice questions.

NCERT Resources

NCERT Class 11 Mathematics book provides detailed explanations, solved examples, and exercises that form the core foundation of Statistics.

NCERT Subjectwise resources

This section provides subject-specific NCERT materials such as notes, exemplar problems, and detailed solutions across different subjects, helping students strengthen their concepts along with related topics.

Frequently Asked Questions (FAQs)

Q: What are types of Statistics?
A:

The two main types are:

Descriptive Statistics: Summarizing and presenting data (mean, median, graphs).

Inferential Statistics: Drawing conclusions and making predictions from data.

Q: What is Statistics in simple terms?
A:

Statistics is the branch of mathematics that deals with collecting, organizing, analyzing, and interpreting data to make meaningful conclusions or predictions.

Q: What are the common measures in Statistics?
A:

The most common measures are Mean, Median, Mode, Variance, Standard Deviation, and Probability. These help in understanding the spread and central tendency of data.

Q: What is the difference between Data and Statistics?
A:

Data is the raw information (like marks of students), while Statistics is the method of organizing and analyzing that data to find patterns or results.

Q: What is the difference between Mean, Median, and Mode?
A:

Mean: Average of all values.

Median: Middle value when data is arranged.

Mode: Value that occurs most often.

Q: What is p value in statistics?
A:

p value in statistics is used in distributions like normal distribution to validate the hypothesis of the analysis.