Senior Research Engineer - Multimodal & Video Foundation Model
13 hours ago
Experience working with large-scale text data, or (bonus) interleaved data spanning audio, video, image, and/or text. Direct hands-on experience in developing or benchmarking at least one of the following topics: LLMs, Vision Language Models, Audio Language Models, generative video models. Demonstra