- Language modeling
- Feed-forward neural networks
- Recurrent neural networks
A Language Model is a statistical tool that uses algorithms to predict the next word or sequence of words in a sentence. These models are fundamental in natural language processing (NLP) and are used to understand and generate human language.
Key Points:
Types of Models:
A machine learning subfield of learning representations of data. Exceptional effective at learning patterns.
Deep learning algorithms attempt to learn (multiple levels of) representation by using a hierarchy of multiple layers.
\[h = \sigma(W_1x + b_1)\] \[y = \sigma(W_2h + b_2)\]
Optimize
objective/cost function \(J\)\((\theta)\)
Generate
error signal that measures difference between predictions and target values