Machine Learning Engineer, Safeguards
hace 2 días
New York
This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Neurodivergence, for example, attention-defi...