Deep Learning¶
Status: π§ Coming soon β chapters are being written.
Deep learning uses neural networks with many layers to learn representations from data. It's behind modern computer vision, NLP, speech, and the LLMs you've been studying in the GEN AI section.
What this section will cover¶
- Neural network fundamentals β neurons, layers, activations, backprop
- Frameworks β PyTorch (primary), TensorFlow/Keras for comparison
- Optimization β SGD, momentum, Adam, learning-rate schedules
- Regularization β dropout, batch norm, weight decay, data augmentation
- Architectures β
- CNNs for vision
- RNNs / LSTMs / GRUs for sequences
- Transformers (the architecture behind LLMs)
- Transfer learning, fine-tuning, parameter-efficient fine-tuning (LoRA / QLoRA)
- Training tricks: mixed precision, gradient accumulation, distributed training
Currently available β related material¶
Deep-learning-adjacent context lives in these existing sections:
- Machine Learning β Linear Regression (gradient descent foundation)
- Machine Learning β Linear Regression β types of gradient descent
- LangChain β Models
A dedicated deep-learning track (PyTorch + transformers) is next.