tailieunhanh - Báo cáo hóa học: "Joint modality fusion and temporal context exploitation for semantic video analysis"