Verified Employer Hiring Fast Featured Role

Software Engineer (LLM Evaluation)

Turing (Turing Enterprises, Inc.) verified

flag

Location

location_on -

Work Model

home_work remote

Job Type

work Full-time

External Interest

open_in_new 2 clicks

Salary

payments Undisclosed

Intelligence Match

Match: ??%

rocket_launch

Skills Match 87%

Experience Match 88%

Portfolio Match 84%

lock

Unlock Your Personalized Match

Team Tagline

No Tagline provided.

About the role

We are building LLM evaluation and training datasets to train LLMs to work on realistic software engineering problems. One of our approaches in this project is to build verifiable SWE tasks based on public repository histories using a synthetic approach with a human-in-the-loop, while expanding dataset coverage to include different types of tasks across programming languages, difficulty levels, etc. About the Role: We are looking for experienced software engineers (tech lead level) who are familiar with high-quality public GitHub repositories and can contribute to this project. This role involves hands-on software engineering work, including development environment automation, issue triaging, and evaluating test coverage and quality Why Join Us? Turing is one of the world’s fastest-growing AI companies accelerating the advancement and deployment of powerful AI systems. You’ll be at the forefront of evaluating how LLMs interact with real code, influencing the future of AI-assisted software development. This is a unique opportunity to blend practical software engineering with AI research What does day-to-day look like: Analyze and triage GitHub issues across trending open-source libraries. Set up and configure code repositories, including Dockerization and environment setup. Evaluate unit test coverage and quality. Modify and run codebases locally to assess LLM performance in bug-fixing scenarios. Collaborate with researchers to design and identify repositories and issues that are challenging for LLMs. Opportunities to lead a team of junior engineers to collaborate on projects. Required Skills: Minimum 3+ years of overall experience Strong experience with at least one of the following languages: Ruby Proficiency with Git, Docker, and basic software pipeline setup. Ability to understand and navigate complex codebases. Comfortable running, modifying, and testing real-world projects locally. Experience contributing to or evaluating open-source projects is a plus. Nice to Have: Previous participation in LLM research or evaluation projects. Experience building or testing developer tools or automation agents. Perks of Freelancing With Turing: Work in a fully remote environment. Opportunity to work on cutting-edge AI projects with leading LLM companies. Offer Details: Commitments Required: At least 4 hours per day and minimum 20 hours per week with overlap of 4 hours with PST. (We have 3 options of time commitment: 20 hrs/week, 30 hrs/week or 40 hrs/week) Employment type: Contractor assignment (no medical/paid leave) Evaluation Process (approximately 75 mins) :

Required Skills

Ruby on Rails Agile Methodologies

Preferred Skills

Docker CI/CD Team Leadership Mentoring Remote Collaboration

Responsibilities

Analyze and triage GitHub issues across trending open-source libraries.
Set up and configure code repositories, including Dockerization and environment setup.
Evaluate unit test coverage and quality.
Modify and run codebases locally to assess LLM performance in bug-fixing scenarios.
Collaborate with researchers to design and identify repositories and issues that are challenging for LLMs.
Opportunities to lead a team of junior engineers to collaborate on projects.

Turing (Turing Enterprises, Inc.)

Software Development • 501-1000

Headquarters -, Nigeria

Hiring Activity ACTIVE

Company Health sentiment_very_dissatisfied Unverified

"Join our team to build the future of technology."

Intelligence Match

Match: ??%

rocket_launch

Skills Match 87%

Experience Match 88%

Portfolio Match 84%

lock

Unlock Your Personalized Match

psychology Core Intelligence

Skill Alignment

check_circle Python

check_circle React

check_circle AWS

warning Docker

warning Kubernetes

Exp. Score

8.8/10

Portfolio

Strong Signal

description

Resume Quality

Optimized

lock

Unlock Your Core Intelligence

Score Breakdown

?? / 100

Technical Skills 26/30
Industry Experience 22/25
Portfolio Depth 21/25
Resume Signal 18/20

lock

Unlock Your Full Breakdown

Vs. Market

Below Average

Avg. Score

65%

auto_fix_high Optimization Path

Add more specific project examples to strengthen your portfolio depth.

Update your resume with relevant keywords from the job description.

Consider obtaining certifications in cloud infrastructure and DevOps.

lock

Unlock Your Optimization Path

Expires in

7 days

Posted

3 weeks ago

Similar Roles

Software Engineer

Credit Direct Finance Company Limited

Competitive Remote

Software Engineer

Deimos

Competitive Africa or UK (remote)

Software Engineer

Quidax Technologies Limited

Undisclosed Remote

security

Job Application Safety Disclaimer

Your security and privacy are our top priorities. Please be aware that InStreamIQ will never ask you to pay any fees for job applications, placements, or training as a condition of employment.

Furthermore, legitimate employers will not ask for sensitive personal identification such as your Bank Verification Number (BVN), National Identification Number (NIN), or Passport details during the initial application phase. Do not share financial information or make any payments to individuals or organizations claiming to represent an employer. If you encounter any suspicious requests, please report the listing immediately via our support channels.

Market Intel

Skills Intel

Stacks Intel

Roles Intel

Didn't see your company?

Explore Platform

Software Engineer (LLM Evaluation)

Intelligence Match

Team Tagline

About the role

Required Skills

Preferred Skills

Responsibilities

Job Application Safety Disclaimer

Confirm Application & Data Sharing

Leaving InStreamIQ