Curated learning path for Multimodal AI & Vision-Language. Build practical skills through expert-selected courses.
Varies by topic; basics usually sufficient
Some programming experience helpful
Follow these courses in order to complete the learning path. Click on any course to enroll.
This course explores baseline vision transformer models and their performance on remote sensing image classification. You will gain a good understanding of vision transformers and how to deploy them for remote sensing image classification using PyTorch.
This course teaches how to build multimodal search and RAG systems. It covers implementing contrastive learning for modality-independent embeddings, building multimodal RAG systems that reason over multimodal context, and implementing industry applications like multi-vector recommender systems.
April 2024 Update: Two new sections have been added recently. New Section 5: learn to edit the clothes of a person in a picture by programming a combination of a segmentation model with the Stable Diffusion generative model. New bonus section 6: Journey to the latent space of a neural network - dive deep into the latent space of the neural networks that power Generative AI in order to understand in depth how they learn their mappings. Generative A.I. is the present and future of A.I. and deep learning, and it will touch every part of our lives. It is the part of A.I that is closer to our unique human capability of creating, imagining and inventing. By doing this course, you gain advanced knowledge and practical experience in the most promising part of A.I., deep learning, data science and advanced technology.The course takes you on a fascinating journey in which you learn gradually, step by step, as we code together a range of generative architectures, from basic to advanced, until we reach multimodal A.I, where text and images are connected in incredible ways to produce amazing results.At the beginning of each section, I explain the key concepts in great depth and then we code together, you and me, line by line, understanding everything, conquering together the challenge of building the most promising A.I architectures of today and tomorrow. After you complete the course, you will have a deep understanding of both the key concepts and the fine details of the coding process.What a time to be alive! We are able to code and understand architectures that bring us home, home to our own human nature, capable of creating and imagining. Together, we will make it happen. Let's do it!
Explore related content to expand your skills beyond this learning path.
Enroll in this path to track your progress and stay motivated.