Software Engineering Evaluator
Turing
About the role
About Us:
Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises looking to deploy advanced AI systems. We focus on two main offerings: accelerating pioneering research through high-quality data, advanced training pipelines, and top-notch AI researchers, and assisting enterprises in transforming AI from concept into reliable, powerful intelligence that delivers measurable business results.
Ideal Background:
We are seeking experienced engineers who have operated at the forefront of AI innovation, ideally within organizations such as OpenAI, NVIDIA, Databricks, Palantir, Snowflake, or others pushing the boundaries of intelligent systems. While degrees from prestigious computer science programs like Stanford, MIT, Carnegie Mellon, and UC Berkeley are valued, we believe that exceptional experience and skill are paramount.
Project Overview:
In the role of Software Engineering Evaluator, you will play a pivotal part in creating innovative datasets crucial for training, benchmarking, and advancing large language models. Your work will involve curating code examples, crafting precise solutions, and ensuring code quality across diverse tech stacks, including Python for backend and ML workflows and JavaScript (React, Node.js) for frontend applications. Additionally, you will actively evaluate and enhance AI-generated code for efficiency, scalability, and reliability while collaborating with cross-functional teams to elevate AI-driven coding solutions.
What Does a Typical Day Look Like?
- Engage in AI model training initiatives by curating code examples and correcting code in both Python and JavaScript, with contributions in C/C++, Java, Rust, and Go.
- Assess and enhance AI-generated code, ensuring efficiency and reliability across backend and frontend applications.
- Collaborate with diverse teams to improve AI-driven coding solutions based on performance benchmarks.
- Develop agents that can verify code quality and identify error patterns throughout full-stack applications.
- Hypothesize on the software engineering cycle and evaluate model capabilities accordingly.
- Design automatic verification mechanisms for software engineering tasks.
Required Skills:
- A minimum of 3 years of software engineering experience.
- Proficient in full-stack application development with expertise in Python and JavaScript (React, Node.js), capable of navigating both backend and frontend codebases.
- Experience in deploying scalable, production-grade software using modern programming languages and tools.
- Strong grasp of software architecture, design, development, debugging, and code quality assessment.
- Excellent communication skills, both oral and written, for effective evaluation communication.
Engagement Details:
- Flexible hours commitment: 10-40 hrs/week.
- Contractor position (no medical/paid leave).
- Initial duration of 1 month with potential for extension based on performance.
- Must be based in the United States.
Evaluation Process:
- Application process takes 15-30 minutes.
- Completion of an AI video interview is necessary.
Note: An AI video interview will be part of the assessment process.
After applying, a confirmation email will provide you with a link to complete your profile.
Know someone amazing? Refer them and earn rewards!
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free