Software Engineer, GenAI Model Evaluation Job at Tbwa Chiat/Day Inc, San Francisco, CA

a1J2UXJydG5JNDh4MlZoU2RjWWVmaEw3
  • Tbwa Chiat/Day Inc
  • San Francisco, CA

Job Description

Software Engineer, GenAI Model Evaluation About Job Software is eating the world, but AI is eating software. We live in unprecedented times – AI has the potential to exponentially augment human intelligence. As the world adjusts to this new reality, leading tech companies are racing to build LLMs at billion dollar scale, while large enterprises figure out how to add it to their products. To ensure that these models are safe, aligned, and highly useful, they require extremely high quality human-generated data and evaluation. Since before the launch of ChatGPT, through to the latest generation of frontier models coming out today, Scale has been at the forefront of providing the post-training, fine-tuning, and human preference alignment (RLHF) data needed to ensure these models are capable, aligned, and useful via our Generative AI Data Engine. As customers train their models on this data, and constantly aim to improve them, a critical need is having trustworthy evaluations of model performance, and an ability to identify weaknesses and potential vulnerabilities. Conducting these evaluations with our human experts constitutes a significant and growing portion of Scale’s work—thus assisting model developers in iteratively understanding where to focus their technical investments. The GenAI Safety & Evaluation product team at Scale is at the heart of this work, building a world-class customer-facing model evaluation platform. This platform enables customers to easily launch new evaluation workflows, deep dive into evaluation results down to the test case level to understand weaknesses and benchmark performance, and use these insights to drive model development roadmaps. In building this product, you will have a chance to shape the way that models across the industry are evaluated, impacting billions of people around the world. As part of the Safety & Evaluation product team, you will partner closely with researchers from Scale’s Safety, Evaluations, and Alignment Lab (SEAL) on productization of novel research, as well as Scale’s expert red team, which supports AI safety via rigorous model testing trusted by major enterprises and leading model developers. We’re looking for entrepreneurial Software Engineers to join our team. In this role, you'll be given the opportunity to build these products and drive millions of dollars in revenue. You’ll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and large tech companies. You will: Own large new areas within our product Work across backend, frontend, and interacting with LLMs and/or other ML models Deliver experiments at a high velocity and level of quality to engage our customers Work across the entire product lifecycle from conceptualization through production Be able, and willing, to multi-task and learn new technologies quickly Collaborate with cross-functional teams to define, design, and ship new product features and experiences. Must be able to commute to the San Francisco Office 1-2x weekly. Ideally you’d have: 5+ years of full-time engineering experience, post-graduation Proficiencies in one or more of Python, Node, React, Next.js and MongoDB Solid background in algorithms, data structures, and object-oriented programming. Experience scaling products at hyper-growth startups Excitement to work with AI technologies Strong written and verbal communication skills Strong problem-solving skills, and be able to work independently or as part of a team. Nice to haves: Strong knowledge of software engineering best practices. Experience with AI platforms and technologies, including generative models and LLMs. Experience building ML infrastructure and AI-powered solutions. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. The base salary range for this full-time position in the location of San Francisco is:

$160,000 - $192,000 USD

PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. About Us: At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an affirmative action employer and inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. #J-18808-Ljbffr Tbwa Chiat/Day Inc

Job Tags

Full time, Shift work,

Similar Jobs

Optum

Ultrasound Tech OB/GYN Job at Optum

 ...Medical Sonographers (ARDMS) in ultrasound physics, general abdominal, OB/GYN Current Sonography license issued by the Oregon Board of...  ...Registries with ARDMS ~1+ years of experience as an Ultrasound Tech ~ OB/GYN experience ~ Epic experience UnitedHealth Group... 

Amada Senior Care NorthShore

Caregivers-Pick Your Hours Job at Amada Senior Care NorthShore

 ...well-being. Responsibilities:- Assist clients with personal care tasks such as bathing, grooming, and dressing.- Provide companionship...  ...when necessary. Skills:- Experience in assisted living or senior care environments.- Knowledge of HIPAA regulations.- Proficiency... 

ACS Consultancy Services

Cybersecurity Analyst Job at ACS Consultancy Services

 ...suspicious activities. Use security tools such as SIEM (Security Information and Event Management) platforms IDS/IPS firewalls and...  ...partners to evaluate new cybersecurity solutions and technologies. Qualifications: ~ Bachelors degree in Cybersecurity... 

MSA

Neurologist Job at MSA

 ...Neurologists Sought for a Well-Established and Rapidly Growing Group in Broward/Palm Beach County, Florida Join Our Team: We are looking for general and fellowship-trained neurologists to become part of a highly respected and long-established private neurology practice... 

Parsons Corporation

Background Investigator Job at Parsons Corporation

 ...next, right now. We've got what you're looking for. **Job Description:** Parsons is looking for an amazingly talented **Background Investigator** to join our team! In this role you will provide Background Investigations Support (Investigative coverage for all...