UCS654:Predictive Analytics using Stats

UCS654: Predictive Analytics using Statistics
(Jan to June 26 - EVEN2526)

Table of Content Join WhatsApp Group | Click Here

01 - Syllabus

02 - Lecture Resources

03 - LDP

04 - Lab Experiments

05 - Evaluation Scheme

06 - Assignments

Recommended Books

Peter Dalgaard, Introductory Statistics with R, Springer, Second Edition
Brett Lantz, Machine Learning with R (2nd Edition), www.PacktPub.com.

Reference Books

Online Resource| Click Here
Introduction to Machine Learning in R| Click Here

Anchor 1

tableofcontent

01 - Syllabus

UCS654 (Scheme-2023) | Syllabus | Link

Syllabus for MST

Topic-1(Dr. Suresh): Random variable and its properties,
Topic-2(Dr. Suresh): PDF, PMF, Distributions (their descriptive statistics like mean, variance, median and mode)
Topic-3: Topsis
Topic-4: Data Generation using Modeling and Simulation + Finding Outliers using IQR and Z-Score Methods
Topic-5: Sampling
Topic-6: Hypothesis-Testing

Course Learning Outcomes (CLOs) / Course Objectives (COs)

CO1: Demonstrate the ability to use basic probability concepts with descriptive statistics. [Covered for MST]
CO2: Visualize the patterns in the data. [Covered for MST]
CO3: Demonstrate the use of statistical methods to estimate characteristics of the data. [Covered for MST]
CO4: Explain and demonstrate the use of predictive analytics in the field of data science. [Covered before EST]

Instruction(s)

The MST exam will be a blend of the mathematical, numerical and derivation based question.
The exam will contain question with equal distribution of the marks. The maximum marks may be 25 marks.
There will be negative marking, if answers are not found in the sequence.
The cutting of the answer and any answer using pencil, will be awarded zero.
The answers will be evaluated with reference to the ideal solution.

syllabus

02 - Lecture Resources

Topic-1(Dr. Suresh)
- Random variable and its properties

Topic-2(Dr. Suresh)
- PDF, PMF, Distributions (their descriptive statistics like mean, variance, median and mode)

Study Material (Including Topic1 and Topic2)

Link1, Link2

lecture_resources

Practice Sheet for MST

Link

(Mandatory to Practice)

03 - Lecture Delivery Plan [till MST]

Week 1: Data Distribution & Identity [CODE Link]

L1: Random variable, Probability, PMF, and PDF. Application: Using PDFs to detect data drift and reason the normalization of input data.

L2: Random Variables & Different Distributions (Gaussian, Bernoulli). Application: Initializing Neural Network weights.

Week 2: Relationships between Features

L3: Statistical Independence. Application: Identifying redundant features in a dataset to reduce model complexity.

L4: Variance, Co-variance, and Correlation. Application: Constructing correlation heatmaps to prevent Multi-collinearity in training.

ldp

04 - Lab Experiments

Experiment-1: Probability Distribution and PDF Fitting [Click Here]

Objective: To generate data and fit a probability density function to understanding data assumptions.
Theory: PDFs describe continuous random variables. ML models assumed data to be distributed as Gaussian distributions for stable learning.

Tasks to be performed

Generate synthetic data
Plot histogram
Fit Gaussian PDF

Expected Output: Histogram with overlaid Gaussian PDF.

Data Visualization and Statistical Analysis using Python [Guided Project]: Select one numerical feature from a real dataset, analyze its distribution, fit a Gaussian PDF, and justify whether normalization is required before applying a Machine Learning model.

lab_experiments

evaluation_scheme

Experiment-2: Importance of the Variance [Click Here]

Objective:
- To understand how variance represents information content in data
- To analyze correlation and redundancy among features
- To demonstrate dimensionality reduction using variance (PCA)
- To justify normalization before ML models based on feature distribution.

05 - Evaluation Scheme (as proposed by the subject coordinator)

Instructions

Guided projects must be of average to high difficulty level. The project completion certificate must be submitted through the link provided by the coordinator.
Guided projects must be submitted only during the scheduled lab session. Any submission after the scheduled date and time will be awarded zero marks.
Lab experiment details and the manual will be shared through the course page.
Each lab experiment must be performed within the first 30 minutes of the lab session. A viva-voce may be conducted during the evaluation to assess conceptual understanding.
Independent work is mandatory. Students must be able to explain their approach, logic, and results. Inability to justify the work may result in reduced or zero marks, even if the output is correct.
Use of AI tools, code generators, or online assistance during lab evaluation is strictly prohibited, unless explicitly permitted by the instructor.
Absence from the lab, irrespective of the reason, will not be considered for awarding marks in the lab experiment.
Latecomers will not be permitted to attend the lab under any circumstances.
Any form of unethical practice (plagiarism, copying, proxy attendance, or impersonation) will be dealt with strictly as per institute academic policies.
Evaluation will emphasize conceptual clarity, methodology, and interpretation, not merely producing correct results.
Students are advised to come well prepared by reviewing the lab manual and relevant theory in advance to ensure meaningful learning during the lab.

06 - Assignments

Assignment-01 [Click Here]

Deadline: 21 Jan 2026 till 11.59PM

Assignment-02 [Click Here] [Submission Link]

Deadline: 28 Jan 2026 till 11.59PM

UCS654: Predictive Analytics using Statistics (Jan to June 26 - EVEN2526)

Table of Content Join WhatsApp Group | Click Here

UCS654: Predictive Analytics using Statistics
(Jan to June 26 - EVEN2526)