CLIP: Learning Transferable Visual Models From Natural Language Supervision RWTH Computer Vision Seminar “Current Topics in Computer Vision and Machine Learning”.