Hi everyone! I am new to NLP problems and I just saw people using Attention algorithm in a model with Bidirectional LSTM. What does it mean and what are pros and cons of using Attention?
I have read an interesting article about Attention mechanism. I guess, this is what you need: https://medium.com/syncedreview/a-brief-overview-of-attention-mechanism-13c578ba9129