Multimodal Architecture for Emotion Recognition