Fine-tuning Multimodal Embedding Models | by Shaw Talebi
The first (and most important) step of any fine-tuning process is data collection. Here, I extracted title-thumbnail pairs from my channel in a 2-step process. First, I used YouTube’s search API to extract the video IDs for all the videos on my channel. Second, I used YouTube’s video API to extract the title and thumbnail …
Fine-tuning Multimodal Embedding Models | by Shaw Talebi Read More »










