Machine Learning GATE DA: Syllabus, Books & PYQs

Q: Which book for GATE DA machine learning?

Aurélien Géron's Hands-On Machine Learning . It covers nearly every algorithm in the syllabus. Pair with mentor notes for theory (SVM, bias-variance, LDA vs PCA).

Machine learning on GATE DA is not a Kaggle competition. It is a pen-and-paper exam where you are expected to reason about model assumptions, compute decision boundaries by hand, trace k-means iterations on five data points, and decompose error into bias² + variance + noise. The syllabus is bounded — every model you need is named — and the biggest mistake aspirants make is over-studying deep learning and ensemble methods that are not on the list.

Below: the official syllabus, a model-comparison decision matrix (print it), the recommended book (Hands-On ML by Géron, supplemented with theory), and a study plan that starts with the prerequisite maths you need before touching any ML topic.

Verify: Confirm the syllabus on the official GATE 2026 syllabus page (IIT Guwahati).

Official Syllabus

Supervised Learning

Regression — simple linear, multiple linear, ridge
Logistic regression
K-nearest neighbour, naive Bayes classifier
Linear discriminant analysis (LDA)
Support vector machine (SVM)
Decision trees
Bias-variance trade-off
Cross-validation — leave-one-out (LOO), k-folds
Multi-layer perceptron (neural networks), feed-forward

Unsupervised Learning

Clustering — k-means, k-medoid, hierarchical (top-down, bottom-up); single and multiple linkage
Dimensionality reduction — PCA

Every model you need is on this list. Random forests, gradient boosting, transformers, reinforcement learning, and generative models are not. The explicit mention of bias-variance trade-off and cross-validation means these are direct question targets, not background reading.

The Model Decision Matrix

Print this. Most GATE DA ML questions test whether you can pick the right model for a described problem and reason about its assumptions.

Model	When to use	Typical PYQ style
Linear regression	Continuous target, linear relationship	Closed-form solution, residuals, OLS assumptions
Ridge regression	Linear regression + L2 regularisation	Effect of λ on coefficients, bias-variance
Logistic regression	Binary classification, linear boundary	Sigmoid, log-loss, boundary computation
KNN	Non-parametric classification/regression	Effect of k, distance metric, dimensionality curse
Naive Bayes	Categorical features, fast probabilistic	Bayes + conditional independence; compute posterior
LDA	Classification under Gaussian-class assumptions; also dimensionality reduction	Within/between-class scatter; LDA vs PCA
SVM	Maximum-margin classifier; kernel for non-linear	Identify support vectors, effect of C
Decision tree	Interpretable classification/regression	Splits via Gini/entropy; pruning
K-means	Unsupervised clustering, k known	Trace iterations; initialisation sensitivity
Hierarchical clustering	Unsupervised; k not pre-specified	Single/complete/average linkage traces
PCA	Unsupervised dimensionality reduction (variance-preserving)	Principal components via covariance; SVD link
Neural network (MLP)	Non-linear function approximation	Forward/backprop trace; activation functions

Book and Resources

Resource	Role	Use for	Skip
Aurélien Géron — Hands-On Machine Learning	Primary book	Practice, intuition, code, model walk-throughs	Deployment, distributed training, RL chapter
Goodfellow et al. — Deep Learning	Supplementary lookup	Perceptrons, backprop, activation functions	Generative models, transformers, RL
The ML Hub mentor notes	Theory layer	SVM derivations, LDA vs PCA, bias-variance algebra	—
Official GATE DA PYQs (2024 onwards)	Practice	Model-identification, conceptual MSQs, trace questions	—

Full book list: GATE DA books and resources guide.

Topic Walkthrough

Regression — Linear, Multiple, Ridge

OLS estimator (closed-form), residuals, R², assumptions. Ridge adds L2 penalty; understand how λ shrinks coefficients and trades bias for variance. Be able to compute the closed-form ridge solution.

Logistic Regression

Sigmoid activation, binary cross-entropy loss, linear decision boundary. Maximum likelihood interpretation. The decision boundary shape is the same hyperplane as a linear SVM — different optimisation objective, different boundary position.

KNN and Naive Bayes

KNN is non-parametric; effect of k and distance metric matter; curse of dimensionality is a common MSQ target. Naive Bayes assumes conditional independence given the class; this is applied Bayes' theorem.

LDA — and Why PCA Is Not the Same Thing

LDA: maximises between-class / within-class scatter. PCA: maximises variance. LDA is supervised; PCA is unsupervised. Both reduce dimensions; different objectives. This distinction is one of the most consistently tested topics on the paper.

SVM

Maximum-margin classifier. Hard-margin vs soft-margin (C controls the trade-off). Support vectors are points on the margin; the boundary depends only on them. Kernel trick (polynomial, RBF) for non-linear boundaries — conceptual exposure, not derivation depth.

Decision Trees

Splits via information gain (entropy reduction) or Gini impurity. Pruning controls overfitting. Be able to compute the next split given a small dataset.

Bias-Variance Trade-off

Error = bias² + variance + irreducible. Underfitting = high bias; overfitting = high variance. Regularisation (ridge), early stopping, and reduced complexity trade variance for bias. PYQs ask you to identify which is responsible for a train/test gap.

Cross-Validation

LOO: uses n − 1 for training, 1 for validation, repeated n times — high variance, expensive. K-fold: splits into k partitions — lower variance. Stratified k-fold preserves class proportions.

Clustering

K-means: minimise within-cluster sum of squares; iterate assignment/centroid update; sensitive to initialisation and k. Hierarchical: agglomerative (bottom-up) or divisive (top-down); linkage choices (single, complete, average).

PCA

Eigen-decomposition of the covariance matrix; principal components = eigenvectors of largest eigenvalues. PCA = SVD applied to centred data — this is where linear algebra meets ML.

Neural Networks

Perceptron, MLP, activation functions (sigmoid, tanh, ReLU), forward pass, backpropagation. Be able to trace a small forward/backprop by hand. Vanishing gradient with sigmoid — conceptual exposure. DL is a small slice of the paper; do not over-invest.

What to Skip

Random forests, gradient boosting, XGBoost — ensemble methods beyond basic intuition
Reinforcement learning — Q-learning, policy gradients
Transformers and attention mechanisms
Generative models — GANs, VAEs, diffusion
Production ML / MLOps
Specific DL frameworks beyond conceptual MLP
Statistical learning theory proofs (VC dimension, PAC bounds) beyond conceptual awareness

The biggest risk: over-studying deep learning. DL is a few marks; in-syllabus models are the rest.

Past-Paper Patterns

ML PYQs since 2024 cluster around six types:

Identify the right model for a described problem
Compute the next iteration of k-means or a decision-tree split
Bias-variance attribution given train/test errors
PCA computation on a small covariance matrix
Bayes-style probabilistic classification
SVM support-vector identification

Solve every ML question from official GATE DA 2024 and 2025 papers. The decision matrix above tells you what to look for; PYQs train the pattern recognition.

Free benchmark

The ML chapter in our free GATE DA demo course includes mentor-led lectures on linear regression, SVM, PCA and neural networks plus a topic-test.

Study Plan

Prerequisites (do first)

Probability and statistics (naive Bayes, bias-variance, evaluation) and linear algebra (PCA, ridge, SVM, neural networks). Do both before ML.

Weeks 1–4: Supervised Learning

Linear, multiple, ridge regression — Géron + theory notes.
Logistic regression — sigmoid, log-loss, boundary.
KNN, naive Bayes.
LDA — and the PCA comparison.
SVM — hard/soft margin, support vectors, kernel intuition.
Decision trees — Gini, entropy, information gain.
Bias-variance + cross-validation.

Weeks 5–6: Unsupervised + Neural Networks

K-means, hierarchical clustering — trace on small datasets.
PCA — covariance eigen-decomposition, SVD connection.
Neural networks — MLP, activations, forward/backprop trace.

Weeks 7–8: PYQs and Revision

Every ML PYQ from GATE DA 2024 and 2025.
2–3 topic-wise tests from The ML Hub GATE DA test series.
Finalise the decision matrix and PCA-vs-LDA reference.

Mistakes That Cost Marks

Over-studying deep learning. DL is a small slice. RL, generative models, and transformers are not in the syllabus.
Confusing PCA and LDA. PCA = unsupervised, maximises variance. LDA = supervised, maximises class separation.
Treating Hands-On ML as complete. Géron is practice-leaning; supplement theory for SVM, bias-variance algebra, LDA vs PCA.
Skipping bias-variance. Explicitly in the syllabus, common PYQ target.
Forgetting PCA = SVD on centred data. Reinforces the LA connection.
Studying ensembles "just in case". Not in the syllabus. Allocate that time to in-syllabus models.

ML in The ML Hub's Course

The ML block in The ML Hub's GATE DA course layers mentor-led theory on top of Géron's practical material. Every in-syllabus model is covered, plus the bias-variance decomposition and cross-validation methods the syllabus names explicitly. Topic-wise tests include dedicated ML packs aligned to PYQ patterns. See ranker journeys for how AIR 9 and AIR 6 candidates used this material.

The subject that matters beyond GATE

ML is the subject most directly tied to the careers GATE DA leads into — and the one where structured prep beats scattered tutorials most clearly.

Mentor-led lectures on every in-syllabus model, with the theory Géron underplays
Model decision matrix and PCA-vs-LDA disambiguation from GATE DA rankers
Topic-wise tests on regression, classification, clustering, PCA in the test series

Explore the Course · Test Series · Free Demo

FAQs

Which book for GATE DA machine learning?

Aurélien Géron's Hands-On Machine Learning. It covers nearly every algorithm in the syllabus. Pair with mentor notes for theory (SVM, bias-variance, LDA vs PCA).

Is deep learning in GATE DA?

Yes — MLP, feed-forward, activation functions, and backpropagation are in the syllabus. But DL is a small slice. Generative models, transformers, and RL are not.

Is SVM in the GATE DA syllabus?

Yes. Cover the maximum-margin formulation, hard/soft margin (C), support vectors, and kernel trick at conceptual level.

What is bias-variance trade-off in GATE DA?

Explicitly in the syllabus. Error = bias² + variance + irreducible. Underfitting = high bias; overfitting = high variance.

Related Guides

ML's natural neighbours: Artificial Intelligence (overlaps on probabilistic reasoning), Probability & Statistics and Linear Algebra (both prerequisites). Full subject map: GATE DA books and resources · GATE DA syllabus 2027 guide.