Đang chuẩn bị liên kết để tải về tài liệu:
ViCAN: Co-attention network for Vietnamese visual question answering
Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
In recent years, the task of Visual Question Answering (VQA) has evolved into a very attractive research field. Normally, this task requires a simultaneous understanding of both the visual content of the image and the textual content of the question. |