undefined

by Jo J.

How do Convolutional Neural Networks (CNNs) process text data?

CNNs use filters with a sliding window that merge values in filter region into an output value to detect local patterns.

Key Features:

What is the main difference between CNNs and RNNs in NLP?

CNNs focus on local patterns with filters and work well for text classification.
RNNs process sequences recursively, handling context-dependent tasks like translation.

How does a 1D CNN operate in Natural Language Processing (NLP)? Draw a picture

Draw an RNN as Encoder

What NLP tasks are commonly solved using RNNs?

How does an RNN function as an encoder in NLP tasks?

In sequence modeling, an RNN encodes an input sequence into a single vector representing the meaning of the input.

What are the main challenges in training RNNs?

3 Advantages of RNNs

Allow for arbitrarily sized inputs (Model global patterns in seuquence data)
Recursion gets unrolled to from a standard computation graph -> Trainable with backpropagation & gradient descent
Allows for conditional generation models (Ability to condition the next output on an entire sentence history)

What is the Encoder-Decoder architecture, and how is it used?

The Encoder-Decoder model processes sequence input & output for different tasks.

Features:

Why is attention important in NLP models?

Attention helps the model focus on relevant parts of the input while making predictions.

Benefits:

How does attention work in sequence-to-sequence models?

What role does the fully connected layer play in attention-based models?

The fully connected layer:

Draw Attention mechanism

Last changed
5 months ago

05. IR - Simple Neural Techniques