You want to build large scale ML systems from the ground up.
You care about making safe, steerable, trustworthy systems.
As an Senior Software Engineer, you’ll touch all parts of our code and infrastructure, whether that’s making the cluster more reliable for our big jobs, improving throughput and efficiency, running and designing scientific experiments, or improving our dev tooling.
You’re excited to write code when you understand the research context and more broadly why it’s important.You might be a good fit if youHave significant software engineering experienceAre results-oriented, with a bias towards flexibility and impactPick up slack, even if it goes outside your job descriptionEnjoy pair programming (we love to pair)Want to learn more about machine learning researchCare about the societal impacts of your workStrong candidates may also have experience with some of the followingHigh performance, large-scale ML systemsGPUs, kubernetes, pytorch, OS internalsLanguage modeling with transformersReinforcement learningLarge-scale ETLRepresentative projectsMaking a Wikipedia dataset in a format models and easily consumeScaling a distributed training job to thousands of GPUsWriting a design doc for fault tolerance strategiesCreating an interactive visualization of attention between tokens in a language modelOptimizing the throughput of a new attention mechanismComparing the compute efficiency of two Transformer variantsHow we’re differentWe believe that the highest-impact AI research will be big science.
At Anthropic we work as a single cohesive team on just a few large-scale research efforts.
And we value impact
– advancing our long-term goals of steerable, trustworthy AI
– rather than work on smaller and more specific puzzles.We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science.
We’re most excited to hire researchers from diverse backgrounds who share this perspective.We’re an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time.
As such, we greatly value communication skills.We’re trying to build a core of knowledge and intuition about the most robustly effective innovations in AI, and so thoroughly-documented null results are almost as valuable as positive discoveries.We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.The easiest way to understand our research directions is to read some of our team’s previous work, such as: GPT-3, Circuit-Based Interpretability, Mulitmodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.Come work with us Anthropic is a public benefit corporation based in San Francisco.
We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.