Descriptive statistics is a branch of statistics that summarizes and describes the main features of a dataset.

It does not involve prediction or inference.
Focuses on understanding data characteristics through numerical summaries and visualizations.

It is a key step in Exploratory Data Analysis (EDA).

EDA = Descriptive Statistics + Data Visualization

Dataset and Data Types

A dataset is a collection of data objects (examples: records, patterns, samples, observations).
Each data object is described by attributes (features or properties).

Example: Sample Patient Table (Table 2.2)

Patient ID	Name	Age	Blood Test	Fever	Disease
1	John	21	Negative	Low	No
2	Andre	36	Positive	High	Yes

Types of Data (Figure 2.1)

Data is broadly classified into:

1. Categorical (Qualitative) Data

Describes qualities or labels, not measurable.
Further classified into:
- Nominal Data: No inherent order.
  e.g., Patient ID, Blood Group, Gender.
  - Only equality comparisons (=, ≠) are valid.
- Ordinal Data: Has a meaningful order.
  e.g., Fever = {Low, Medium, High}
  - Can be ranked, but exact differences are unknown.

2. Numerical (Quantitative) Data

Represents measurable quantities.
Subtypes:
- Interval Data: Numeric values with meaningful differences, but no true zero.
  e.g., Temperature in Celsius or Fahrenheit.
  - Operations allowed: +, –
- Ratio Data: Has a meaningful zero and allows all mathematical operations.
  e.g., Age, Height, Weight.

Based on Values

Type	Description	Example
Discrete	Countable integers	Employee ID, Survey scores
Continuous	Values with decimal precision, measurable	Age (e.g., 12.5), Height, Weight

Based on Number of Variables (Figure 2.2)

Type	Description
Univariate	One variable per record
Bivariate	Two variables
Multivariate	Three or more variables

Dataset and Data Types

Example: Sample Patient Table (Table 2.2)

Types of Data (Figure 2.1)

1. Categorical (Qualitative) Data

2. Numerical (Quantitative) Data

Based on Values

Based on Number of Variables (Figure 2.2)

Related Posts

Continuous and Discrete probability distributions

BCS402 – Advanced Java Solved Previous Year Question Paper

Explain any four string modification methods of String class

Leave a ReplyCancel Reply