Facility Security Officer (FSO)
2 days ago
San Francisco
Care about AI safety risk scenarios. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Anthrop...