Wird geladen...
An Effective Dense Co-Attention Networks for Visual Question Answering
At present, the state-of-the-art approaches of Visual Question Answering (VQA) mainly use the co-attention model to relate each visual object with text objects, which can achieve the coarse interactions between multimodalities. However, they ignore the dense self-attention within question modality....
Gespeichert in:
| Veröffentlicht in: | Sensors (Basel) |
|---|---|
| Hauptverfasser: | , |
| Format: | Artigo |
| Sprache: | Inglês |
| Veröffentlicht: |
MDPI
2020
|
| Schlagworte: | |
| Online Zugang: | https://ncbi.nlm.nih.gov/pmc/articles/PMC7506747/ https://ncbi.nlm.nih.gov/pubmed/32872620 https://ncbi.nlm.nih.govhttp://dx.doi.org/10.3390/s20174897 |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|