multi_modal_transformer library