Research Scientist - Agency and Reasoning
hace 9 días
Palo Alto
Job DescriptionZyphra is an artificial intelligence company based in Palo Alto, California. The Role: As a Research Scientist, you will be a core contributor to Zyphra’s Agency and Reasoning Team. You will be involved with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at scale to our next generation of language models. What We’re Looking For: * Strong research taste and intuition * The ability to work through a research project from conception to execution to write-up * Strong implementation and prototyping skillset * A researcher who can take an idea from conception to experimentation extremely quickly * The ability to work well and cooperate with others in a high-paced research setting * Curiosity, interest, and joy in understanding intelligence. Qualifications: * Experience and aptitude with reinforcement learning, either in the context of language model reasoning or more classical RL tasks * Experience with language model supervised finetuning and preference learning methods such as DPO, simPO, etc. * Experience with context-length extension methods * A good intuitive ability to understand model behaviors and correct them through iterative fine-tuning * Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation * Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics) * Previously published machine learning research in well-respected venues * Highly proficient with PyTorch and Python * We are excited and able to rapidly learn new fields and implement new ideas * Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale Why Work at Zyphra: * We strongly value new and crazy ideas and are very willing to bet big on new ideas * We move as quickly as we can; we aim to minimize the bar to impact as low as possible * We all enjoy what we do and love discussing AI Benefits and Perks: * Comprehensive medical, dental, vision, and FSA plans * Competitive compensation and 401(k) * Relocation and immigration support on a case-by-case basis * On-site meals prepared by a dedicated culinary team; Thursday Happy Hours * In-person team in Palo Alto, CA, with a collaborative, high-energy environment