Software Engineer, GenAI Model Evaluation Job at Tbwa Chiat/Day Inc, San Francisco, CA

a1J2UXJydG5JNDh4MlZoU2RjWWVmaEw3
  • Tbwa Chiat/Day Inc
  • San Francisco, CA

Job Description

Software Engineer, GenAI Model Evaluation About Job Software is eating the world, but AI is eating software. We live in unprecedented times – AI has the potential to exponentially augment human intelligence. As the world adjusts to this new reality, leading tech companies are racing to build LLMs at billion dollar scale, while large enterprises figure out how to add it to their products. To ensure that these models are safe, aligned, and highly useful, they require extremely high quality human-generated data and evaluation. Since before the launch of ChatGPT, through to the latest generation of frontier models coming out today, Scale has been at the forefront of providing the post-training, fine-tuning, and human preference alignment (RLHF) data needed to ensure these models are capable, aligned, and useful via our Generative AI Data Engine. As customers train their models on this data, and constantly aim to improve them, a critical need is having trustworthy evaluations of model performance, and an ability to identify weaknesses and potential vulnerabilities. Conducting these evaluations with our human experts constitutes a significant and growing portion of Scale’s work—thus assisting model developers in iteratively understanding where to focus their technical investments. The GenAI Safety & Evaluation product team at Scale is at the heart of this work, building a world-class customer-facing model evaluation platform. This platform enables customers to easily launch new evaluation workflows, deep dive into evaluation results down to the test case level to understand weaknesses and benchmark performance, and use these insights to drive model development roadmaps. In building this product, you will have a chance to shape the way that models across the industry are evaluated, impacting billions of people around the world. As part of the Safety & Evaluation product team, you will partner closely with researchers from Scale’s Safety, Evaluations, and Alignment Lab (SEAL) on productization of novel research, as well as Scale’s expert red team, which supports AI safety via rigorous model testing trusted by major enterprises and leading model developers. We’re looking for entrepreneurial Software Engineers to join our team. In this role, you'll be given the opportunity to build these products and drive millions of dollars in revenue. You’ll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and large tech companies. You will: Own large new areas within our product Work across backend, frontend, and interacting with LLMs and/or other ML models Deliver experiments at a high velocity and level of quality to engage our customers Work across the entire product lifecycle from conceptualization through production Be able, and willing, to multi-task and learn new technologies quickly Collaborate with cross-functional teams to define, design, and ship new product features and experiences. Must be able to commute to the San Francisco Office 1-2x weekly. Ideally you’d have: 5+ years of full-time engineering experience, post-graduation Proficiencies in one or more of Python, Node, React, Next.js and MongoDB Solid background in algorithms, data structures, and object-oriented programming. Experience scaling products at hyper-growth startups Excitement to work with AI technologies Strong written and verbal communication skills Strong problem-solving skills, and be able to work independently or as part of a team. Nice to haves: Strong knowledge of software engineering best practices. Experience with AI platforms and technologies, including generative models and LLMs. Experience building ML infrastructure and AI-powered solutions. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. The base salary range for this full-time position in the location of San Francisco is:

$160,000 - $192,000 USD

PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. About Us: At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an affirmative action employer and inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. #J-18808-Ljbffr Tbwa Chiat/Day Inc

Job Tags

Full time, Shift work,

Similar Jobs

New York City Health and Hospitals Corporation - 462 1st Ave...

Travel Social Work - Licensed Clinical Social Worker Job at New York City Health and Hospitals Corporation - 462 1st Ave...

 ...Job Description Certification Details ~ LCSW Job Details ~ Licensed Clinical Social Worker position at New York City Health and Hospitals Corporation ~1 year LCSW experience required. Job Requirements ~1 year LCSW experience Additional Details... 

Undisclosed

Licensed Attorney to Train in Immigration Law (REMOTE) Job at Undisclosed

 ...Job Description Job Description Our Firm, is currently looking to increase our National Family Immigration footprint. If you are a newly licensed attorney just starting out or an attorney that has desired to learn about Immigration Law, and are seeking to start your... 

YPM

Content Writer / Copywriter Job at YPM

 ...valued clients. Your primary focus will be on researching and writing SEO-optimized content for websites, blogs, social media, white...  ...infographics, etc. Your role will collaborate closely with our creative and web teams to align with key metrics, guidelines, and outcomes... 

Small Door Veterinary

Practice Manager Job at Small Door Veterinary

 ...opportunity to work hand-in-hand with our medical team and have direct interactions with our members and their furry friends. Our Practice Managers are key business stakeholders in the field and report directly up to our Regional Manager of Practice Operations. We are... 

Dynamics ATS

Welder Job at Dynamics ATS

 ...globally. Job Summary Our client is seeking a Welder who possesses experience in 0.062 wire dual shield and back gauging for CJP welds Job Description They need to have experience with using 0.062 wire dual shields Back gouging for CJP welds Most of...