Senior Research Engineer Multimodal Video Foundation Model 100% remote Worldwide
hace 1 día
Experience working with large‑scale text data, or (bonus) interleaved data spanning audio, video, image, and/or text. Direct hands‑on experience in developing or benchmarking at least one of the following topics: LLMs, Vision Language Models, Audio Language Models, generative video models. Demon