I built a simple model to generate image embeddings. This video will help you understand embeddings from first principles. I don’t use transformers or anything fancy. Instead, I build a simple Siamese Network step by step, and train it using contrastive loss.