Weekday (YC W21)

We are a Y-Combinator-backed startup building your AI-powered Recruiter Agent

Generalist Evaluator Expert

Full-stack EngineerSoftware EngineerPart TimeRemoteTeam 11-50Since 2021H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

20 days ago

Salary

Not specified

English

Job Description

This role is for one of our clients

Compensation: $35-$40 per hour

We are seeking detail-oriented writing professionals to contribute to a high-impact AI research initiative in collaboration with a leading research lab. In this role, you will develop high-quality prompt–golden answer pairs used to train and evaluate advanced language models.

This is a short-term, flexible opportunity ideal for individuals with strong academic foundations and exceptional clarity in written communication. The role is well-suited for professionals who enjoy translating complex ideas into structured, precise, and easy-to-understand content.

Job Requirements

  • Key Responsibilities
  • Design and Optimize Prompts: Develop detailed, constraint-rich prompts with clear instructions and multiple requirements
  • Define Evaluation Standards: Establish expectations for high-quality responses in general consumer contexts and create comprehensive grading rubrics
  • Model Testing and Assessment: Execute prompts using AI systems and evaluate outputs against defined standards
  • Benchmarking & Quality Assurance: Collaborate in QA processes to ensure prompt tasks and rubrics meet high standards of rigor, clarity, and consistency before inclusion in benchmarking workflows
  • Maintain structured documentation and adhere to project guidelines
  • Minimum Qualifications
  • Bachelor’s degree (BS or BA) from a reputable institution (completed or in progress)
  • Strong writing, analytical, and critical thinking skills
  • Ability to work independently and meet structured deadlines
  • Meaningful familiarity with ChatGPT or similar AI tools for personal, academic, or professional use
  • Must be based in the United States or Canada
  • Preferred Qualifications
  • Experience in teaching, curriculum design, academic research, or structured evaluation
  • Experience developing grading rubrics or assessment frameworks
  • Project Details
  • Start: Immediate
  • Duration: Approximately 2 months
  • Commitment: Minimum 20 hours per week
  • Fully remote with flexible scheduling
  • Structured project environment with defined goals, workflows, and tools
  • Application & Onboarding Process
  • Complete a short AI-led interview (approximately 15 minutes)
  • Complete a 45-minute written assessment focused on rubric development
  • Selected candidates will receive project onboarding instructions
  • Contract & Payment Terms
  • Engagement will be structured as an independent contractor agreement
  • Work can be completed remotely on your own schedule
  • Projects may be extended, shortened, or concluded early based on performance and evolving project needs
  • Assignments will not require access to confidential or proprietary information from any employer, client, or institution
  • Payments are processed weekly via Stripe or Wise based on services rendered
  • Visa sponsorship is not available; H1-B and STEM OPT candidates cannot be supported at this time

Related Job Pages

More Full-stack Engineer Jobs

Software Engineering Expert

Weekday (YC W21)

We are a Y-Combinator-backed startup building your AI-powered Recruiter Agent

Full-stack Engineer20 days ago
Part TimeRemoteTeam 11-50Since 2021H1B No Sponsor

This role is for one of our clientsCompensation: $50-$150 per hourWe are seeking experienced Software Engineering professionals to contribute to high-impact research collaborations with leading AI laboratories. In this role, you will help enhance AI sy...

United States

Special Projects Software Engineers

Weekday (YC W21)

We are a Y-Combinator-backed startup building your AI-powered Recruiter Agent

Full-stack Engineer20 days ago
Part TimeRemoteTeam 11-50Since 2021H1B No Sponsor

This role is for one of our clientsCompensation: $100-$200 per hourWe are inviting select, highly capable software engineers to participate in specialized project-based engagements. This opportunity is designed for engineers who enjoy tackling unique, ...

United States

Software Developer

Wix

Wix is the comprehensive platform that gives you total creative freedom online.

Full-stack Engineer20 days ago
Full TimeRemoteTeam 1,001-5,000Since 2006H1B No Sponsor

Software Developer training the team on Splunk applications and solutions

BootstrapEntity FrameworkJavaScriptjQueryJUnitMS SQL ServerSOAPSplunkSQL.NET
Minnesota
Full-stack Engineer20 days ago
Full TimeRemoteTeam 10,001+Since 1916H1B Sponsor

Lead development of illumination and solar panel/array products: translate requirements into designs, perform mechanical/electrical/reliability/EMC analyses, generate CAD and manufacturing documentation, qualify products, support suppliers and manufacturing, and drive corrective actions and continuous improvement in a fast-paced aerospace manufacturing environment.

SolidworksAutocadInventorSolidworks PdmAutodesk VaultLabviewVbaSQLC/C+/C++Pcb DesignGd&TFmeaEmcDfmat2D Cad3D CadMultijunction Solar Cells
California
$98.6K - $162.2K / year