Machine Learning Research Scientist / Research Engineer, Post-Training
hace 8 días
San Francisco
Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in areas of instruction following, factuality, coding, multilingual and multimodal understanding. Experience with post-training techniques such as RLHF, preference model...