M

Mercor

170 active gigsreferral program

AI talent marketplace. 143+ roles for domain experts. $12-$3000/hr. Referral bonuses on every listing.

Languages Supported

๐Ÿ‡บ๐Ÿ‡ธ EN๐Ÿ‡ฎ๐Ÿ‡ณ HI๐Ÿ‡ธ๐Ÿ‡ฆ AR๐Ÿ‡จ๐Ÿ‡ณ ZH๐Ÿ‡ช๐Ÿ‡ธ ES๐Ÿ‡ง๐Ÿ‡ท PT๐Ÿ‡ซ๐Ÿ‡ท FR๐Ÿ‡ฉ๐Ÿ‡ช DE๐Ÿ‡ฏ๐Ÿ‡ต JA๐Ÿ‡ฐ๐Ÿ‡ท KO

Countries Available

USGBCAAUNZDEFRATBELU+3 more

Current Opportunities (170)

$500 - $1,000 one-time

## **About the Role** the platform is seeking experienced **News Analysts, Reporters, and Journalists** to support a leading AI lab in advancing research and infrastructure for next-generation machine learning systems. This engagement focuses on diagnosing and solving real issues in your domain. It's an opportunity to contribute your expertise to cutting-edge AI research while working independently and remotely on your own schedule. ## **Key Responsibilities** - Youโ€™ll be asked to create deliverables regarding common requests within your professional domain - Youโ€™ll be asked to review peer developed deliverables to improve AI research ## **Ideal Qualifications** - 4+ years professional experience in your respective field - Excellent written communication with strong grammar and spelling skills ## Project Timeline - **Start Date:** Immediate - **Duration:** ~2 weeks (with the potential for project expansion) - **Commitment:** ~15 hours/week required * * * ## Compensation & Contract - **Task Completion Pay:** Competitive and based on task quality (~$500 - $1000 per completed task, subject to change as the project evolves) - **Performance Bonus:** Top performers receive a weekly bonus incentive on top of their per task rate!

๐ŸŒ Remote4/2/2026
Apply โ†’
$100 - $130 per hour

**Investment Banking Expert** You are a good fit if you ## You bring - Have at least 2 years of experience working at top firms in investment banking and experience in at least one of the following - Financial Modeling - Pitch Decks - Investment/Analysis Summaries and Memos - Company/Industry Analysis Here are more details ## Pay - between $100-$130/hour with potential for increases for top performers

๐ŸŒ Remote4/2/2026
Apply โ†’
$120 per hour

**Equity Research Expert** You are a good fit if you ## You bring - Have at least 2 years of experience working at top firms in equity research and hands-on experience with analyzing and extracting info from public SEC filings. Here are more details

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **Physics PhDโ€™s (Graduated)** **specializing in Statistical mechanics, condensed matter physics, or atomic, molecular and optical physics** for a premier project with one of the world's top AI labs. In this role, you will contribute your subject matter expertise to a cutting-edge project involving state-of-the-art large language models. Specifically, you will help create high-quality data that will inform the future of AI innovation by coming up with difficult problems in your domain. You're a good fit if you: - Hold a **PhD in Physics** with specialization in Statistical mechanics, condensed matter physics, or atomic, molecular and optical physics - Received your graduate degree in **US/UK/Canada/Western Europe** - Have high **attention to detail** - Have exceptional **written and verbal communication skills** - Have excellent **proficiency in English** Here are more details about the role: - The role is ongoing starting in February and continuing with rolling applications - Experts are expected to contribute 4-6 tasks per week, each taking several hours to complete - The work will require rigorous physics expertise and ability to follow complex instructions Screening Process: - You will need to complete a short AI interview and form - the whole application process should last 20-40 minutes * * * Apply today and leverage your leadership and technical expertise to advance cutting-edge AI models!

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **Physics PhDโ€™s (Graduated)** **specializing in String theory, Quantum Field theory, particle physics, or nuclear physics** for a premier project with one of the world's top AI labs. In this role, you will contribute your subject matter expertise to a cutting-edge project involving state-of-the-art large language models. Specifically, you will help create high-quality data that will inform the future of AI innovation by coming up with difficult problems in your domain. You're a good fit if you: - Hold a **PhD in Physics** with specialization in String theory, Quantum Field theory, particle physics, or nuclear physics - Received your graduate degree in **US/UK/Canada/Western Europe** - Have high **attention to detail** - Have exceptional **written and verbal communication skills** - Have excellent **proficiency in English** Here are more details about the role: - The role is ongoing starting in February and continuing with rolling applications - Experts are expected to contribute 4-6 tasks per week, each taking several hours to complete - The work will require rigorous physics expertise and ability to follow complex instructions Screening Process: - You will need to complete a short AI interview and form - the whole application process should last 20-40 minutes * * * Apply today and leverage your leadership and technical expertise to advance cutting-edge AI models!

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **Physics PhDโ€™s (Graduated)** **specializing in Advanced Quantum Mechanics, Advanced Electrodynamics, or Advanced classical mechanics** for a premier project with one of the world's top AI labs. In this role, you will contribute your subject matter expertise to a cutting-edge project involving state-of-the-art large language models. Specifically, you will help create high-quality data that will inform the future of AI innovation by coming up with difficult problems in your domain. You're a good fit if you: - Hold a **PhD in Physics** with specialization in Advanced Quantum Mechanics, Advanced Electrodynamics, or Advanced classical mechanics - Received your graduate degree in **US/UK/Canada/Western Europe** - Have high **attention to detail** - Have exceptional **written and verbal communication skills** - Have excellent **proficiency in English** Here are more details about the role: - The role is ongoing starting in February and continuing with rolling applications - Experts are expected to contribute 4-6 tasks per week, each taking several hours to complete - The work will require rigorous physics expertise and ability to follow complex instructions Screening Process: - You will need to complete a short AI interview and form - the whole application process should last 20-40 minutes * * * Apply today and leverage your leadership and technical expertise to advance cutting-edge AI models!

๐ŸŒ Remote4/2/2026
Apply โ†’

**Physics PhD Experts (General Relativity, Astrophysics & Cosmology)** In this role, you will contribute your subject matter expertise to a cutting-edge project involving state-of-the-art large language models. ## You bring - Hold a PhD in Physics with specialization in General Relativity, Astrophysics, or Cosmology - Received your graduate degree in US/UK/Canada/Western Europe - Have high attention to detail - Have exceptional written and verbal communication skills - Have excellent proficiency in English Here are more details

๐ŸŒ Remote4/2/2026
Apply โ†’

Professional Domain Expert (Finance/Accounting) โ€” hourly contract role on the platform.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $80 per hour

Accounting Expert โ€” hourly contract role on the platform.

๐ŸŒ Remote4/2/2026
Apply โ†’
$100 - $130 per hour

the platform is hiring **Primary Care Physicians (PCPs)** on behalf of a leading healthcare AI partner building **next-generation clinical intelligence systems and AI agents**. These systems are designed to automate complex clinical workflows, enhance decision-making, and improve care delivery at scale. In this role, you will work closely with AI researchers and engineers to ensure that medical AI systems are **clinically accurate, safe, and aligned with real-world practice**. Your expertise will directly contribute to building AI that can interpret, reason over, and act on clinical data across healthcare workflows. * * * ### **Key Responsibilities** **Clinical Data Annotation & Structuring** - Review and annotate clinical text, EHR data, and patient case notes - Identify diagnoses, co-morbidities, treatment plans, and outcomes - Structure complex medical information for AI training and reasoning **Quality Assurance & Clinical Validation** - Audit annotated datasets to ensure high clinical accuracy and consistency - Validate AI-generated outputs, including summaries, recommendations, and care pathways - Ensure outputs align with real-world standards of care and clinical guidelines **Clinical Knowledge Integration** - Contribute to the development of annotation guidelines, taxonomies, and decision frameworks - Define edge cases and nuanced clinical scenarios - Collaborate with cross-functional teams to improve model understanding of medical context **AI Model Evaluation & Feedback** - Evaluate AI systems designed to support clinical workflows and decision-making - Flag inaccuracies, hallucinations, or unsafe outputs - Provide structured feedback to improve model performance iteratively **Workflow & Systems Thinking** - Help map real-world clinical workflows (documentation, diagnosis, treatment planning) into AI-compatible processes - Support development of AI systems that integrate seamlessly into healthcare operations **Documentation & Training Support** - Contribute to clinical documentation standards for AI systems - Assist in onboarding and training new clinical annotators * * * ### **Requirements** - MD or DO - Board-certified or board-eligible - Active medical license in good standing - **2+ years of clinical experience**, preferably in inpatient or hospitalist settings - Academic hospital experience preferred * * * ### **Why Join** - Work on cutting-edge **AI systems transforming healthcare workflows** - Directly shape how AI understands and supports real-world clinical decision-making - Collaborate with top AI researchers and engineers

๐ŸŒ Remote4/2/2026
Apply โ†’
$100 - $135 per hour

ASIC Power Engineer to perform power analysis and optimizations in ASIC for a prestigious tech company's AR/VR products. Areas of interest include Machine Learning. Primary languages used are Python, Tcl, and SystemVerilog. ## Responsibilities - Perform PPA optimization with Fusion compiler. - Perform RTL and netlist level Power analysis. - Perform post-processing and scripting on report log files for format conversion, data analysis, and information extraction. - Setup, run, debug, and analyze reports of ASIC flows (Synthesis, PD, Power, Timing). - Implement some blocks at RTL and UPF. - Ability to document and communicate clearly. ## Qualifications - 10+ years of experience as an ASIC Power engineer, or CAD Engineer/Physical Design engineer. - Experience with power estimation tools and synthesis, some physical design. - Knowledge of power trade-offs in design and back end implementation. - Hands-on experience in scripting, data analysis. - BS in Electrical Engineering/Computer Science or equivalent experience. Preferred qualifications: - Experience with Synopsys (DC, ICC, PTPX/PrimePower, VCS, Verdi) and/or Cadence (Joules). - Python, Perl (or similar) scripting and data-post-processing tools. - Excel (or Matlab) for model fitting, data visualization, and analysis. - Experience in low power design, tools, and methodologies including power intent UPF specifications. - Silicon Power Characterization. - Some power profiling experience at IP/SoC level. ## Work Authorization Applicants must be located in the United States. Cincinnatus does not sponsor visas and will not provide visa sponsorship for this role. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote4/2/2026
Apply โ†’

Generalists

evaluation
$50 per hour

the platform is hiring highly detail-oriented Generalists on behalf of an AI lab to support **human data pipelines used to train and evaluate advanced AI systems**. This role is ideal for people who are extremely quality-focused, methodical, and take pride in producing consistently high-accuracy work. Youโ€™ll work on structured evaluation, labelling, and review tasks where precision matters more than speed. Small mistakes compound here โ€” weโ€™re looking for people who notice edge cases, inconsistencies, and ambiguity instinctively. ### **What Youโ€™ll Do** - Review, evaluate, and annotate AI outputs with extreme attention to detail - Follow complex guidelines precisely and apply them consistently across tasks - Identify subtle errors, edge cases, and quality issues others might miss - Provide clear, structured feedback to improve downstream AI training quality - Maintain high accuracy across repetitive, high-volume workflows - Flag unclear instructions or ambiguous cases proactively ### **What Weโ€™re Looking For** - Exceptionally **detail-oriented** and quality-obsessed mindset - Prior experience as a **CB / reviewer / rater** in human data or AI training pipelines - Proven ability to follow nuanced instructions without deviation - Strong written communication and reasoning skills - High reliability, consistency, and comfort with repetitive precision work - Ability to work independently and maintain focus over long task batches ### **Nice to Have** - Experience with large-scale AI data annotation or evaluation programs - Background in QA, operations, trust & safety, or structured review work - Familiarity with multi-step rubric-based evaluations - Comfort giving calibrated feedback aligned with strict quality bars ### **Why Join** - Work directly on training next-generation AI systems - Clear expectations and quality-first culture - Long-term opportunity for consistent, high-quality contributors

๐ŸŒ Remote4/2/2026
Apply โ†’
$85 per hour

the platform is seeking **Physics PhDโ€™s (Graduated)** **specializing in one of the following sub-domains:** - **Statistical Mechanics, Condensed Matter & AMO Physics** - **Advanced Quantum, Electrodynamics & Classical Mechanics** - **String theory, QFT, Particle physics & Nuclear physics** - **General Relativity, Astrophysics & Cosmology** In this role, you will contribute your subject matter expertise to a cutting-edge project involving state-of-the-art large language models. Specifically, you will help create high-quality data that will inform the future of AI innovation by coming up with difficult problems in your domain. You're a good fit if you: - Hold a **PhD in Physics** - Received your graduate degree in **US/UK/Canada/Western Europe** - Have high **attention to detail** - Have exceptional **written and verbal communication skills** - Have excellent **proficiency in English** Here are more details about the role: - The role is ongoing starting in February and continuing with rolling applications - Experts are expected to contribute 4-6 tasks per week, each taking several hours to complete - The work will require rigorous physics expertise and ability to follow complex instructions Screening Process: - You will need to complete a short AI interview and form - the whole application process should last 20-40 minutes

๐ŸŒ Remote4/2/2026
Apply โ†’

Sourcing Manager, Data Center Equipment A prestigious tech company is seeking a highly skilled and experienced Sourcing Manager to develop and manage strategic vendor partnerships for data center equipment, with a focus on robotics and automation technology. This role requires a strong background in supply chain management, complex negotiations, and a deep understanding of technology trends within the data center and automation space. ## Responsibilities - Develop and nurture strategic vendor relationships through effective relationship management, including implementing performance metrics (e.g., scorecards), conducting regular business reviews, and fostering collaboration to meet the companyโ€™s commercial and technical requirements. - Maintain current knowledge of technology and industry trends for data center equipment, performing relevant market analysis. - Drive supplier proposals (RFI/RFQ) and business awards. - Develop and execute negotiation strategies, acceptable terms and conditions, and lead the negotiation team through to contract closure. - Negotiate complex contracts, including Master Purchase Agreements (MPA), Master Service Agreements (MSA), and Statements of Work (SOW), demonstrating a strong grasp of key terms. - Own market and actual cost data, including developing should-cost models for data center equipment. - Resolve all commercial and technical queries arising from the quote process. - Ensure business metrics are achieved across the end-to-end process, from manufacturing to data center operation. - Collaborate extensively with cross-functional teams, including Manufacturing Quality, Strategic Engineering and Design, Global Site Sourcing, Data Center Facility Operations, Data Center Construction, and Connectivity. - Manage operations and define and drive core business metrics and risk indicators for continuous improvement. - Provide metrics for quarterly business reviews to assess supply chain health and forward-looking risk. - Drive critical operations related to assurance of supply and capacity through partnership with the Demand/Supply team for accurate forecasting, materials management, qualification, reliability management, and supply allocation. ## Qualifications - BA/BS or equivalent with 3-5 years of supply chain experience. - A minimum of 5 years of Sourcing Management or Supply Chain Management experience in commercial sourcing roles. - Proven analytical experience and demonstrated ability to guide teams to deliver results. - Experience in improving supply chain and supplier performance, including the use of cost models and objective-based performance measures, particularly in the semiconductor/memory commodity. - Extensive experience in contracts and contract negotiations, including contract strategy development and execution, with a focus on agreement closure. - Proficiency in robotics and automation, with a strong understanding of supply chain processes and their integration with robotics technology. - Requires a Master's or Bachelorโ€™s degree in Software Engineering, Computer Engineering, Computer Science, Analytics, Robotics and Automation, or a related field, and 5 years of experience in the job offered or a related occupation, including: - 5 years of Material or production planning experience. - 5 years experience with broad enterprise systems work, including SAP, Oracle, and custom applications. - 5 years of working knowledge of SQL and/or BI tools (Tableau/Power BI/Looker) to build and maintain supplier scorecards (quality, OTIF, lead time, defect rates, downtime impact, cost). - 2 years experience negotiating software licensing models for robotics/AI (per-robot, per-site, usage-based, support/maintenance terms). - 2 years experience defining contract and supplier requirements for AI systems, specifically addressing: - Model performance metrics, test/validation methodology, update/rollback processes. - Telemetry for model monitoring, and incident response protocols for safety-critical failures. - Data governance requirements (data retention, access controls, auditability). Preferred Qualifications: - MS/MBA. - Strong understanding of complex supply chain operations and processes, including forecasting, supplier qualification, allocation management, material management, and quality assurance. - Specific experience in automation/robotics and warehouse/logistics. Pursuant to the California Fair Chance Act, Los Angeles County Fair Chance Ordinance for Employers, Los Angeles Fair Chance Initiative for Hiring Ordinance, and San Francisco Fair Chance Ordinance, qualified applicants will be considered for assignment with arrest and conviction records. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness, meet client expectations, standards, and accompanying requirements, and safeguard business operations and company reputation. ## Work Authorization Applicants must be located in the United States. Cincinnatus does not sponsor visas and will not provide visa sponsorship for this role. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote4/2/2026
Apply โ†’
$75 - $150 per hour

**Role Overview** the platform is collaborating with leading AI labs to engage highly accomplished creative writersโ€”playwrights, novelists, and short story authorsโ€”for advanced AI training projects. Contributors will apply their literary expertise to improve AI systemsโ€™ ability to generate nuanced, high-quality narrative content. This work emphasizes strong storytelling, stylistic precision, and editorial judgment informed by published and awarded experience. This is a project-based opportunity with flexible participation. **Key Responsibilities** - Review and refine AI-generated content across plays, novels, and short stories - Evaluate outputs for literary quality, structure, tone, and thematic depth - Provide detailed editorial feedback to improve coherence and originality **Ideal Qualifications** - 10+ years of experience in creative writing, including playwriting, novel writing, and/or short fiction - A substantial body of published work (e.g., novels, staged plays, or multiple short stories in recognized outlets) - Work featured in prestigious literary publications, journals, anthologies, or professional productions - Recipient of recognized literary awards, fellowships, or honors across any of the three disciplines - Exceptional command of narrative craft, including structure, voice, dialogue, and character development across formats - Strong ability to critique and refine written work with attention to style, coherence, and thematic depth

๐ŸŒ Remote4/2/2026
Apply โ†’

**NYC / London Marketing Agency Professionals (5+ YOE)** Sample companies include ## You bring - Ogilvy (+ parent company WPP) โ€“ this is our top preference - Dentsu - Saatchi & Saatchi - Work experience in London HQ / NY - Undergraduate degree in related major - Must be able to commit to short-term, intensive engagement - ideal for contributors with strong near-term availability

๐ŸŒ Remote4/2/2026
Apply โ†’
$150 - $200 per hour

## About the Role the platform is hiring **Gastroenterologists** on behalf of a healthcare AI partner developing advanced clinical decision-support systems focused on digestive health, hepatology, and endoscopic diagnostics. In this role, you will apply deep clinical expertise to review, annotate, and validate gastroenterology-related medical data, directly shaping safe and reliable medical AI. This role bridges real-world gastroenterology practice with applied AI, ensuring that complex diagnostic reasoning, guideline adherence, and procedural insights are accurately represented in model training. ## Key Responsibilities ### Clinical Data Annotation - Review and label clinical narratives, EHR notes, endoscopy reports, imaging findings, and case data related to gastrointestinal and hepatobiliary conditions - Identify and validate diagnoses, diagnostic workups, procedural findings, treatment plans, and outcomes (e.g., IBD, GI bleeding, liver disease, malignancies) ### Quality Review & Validation - Audit annotated datasets for clinical accuracy and guideline alignment (AGA, ACG, AASLD) - Evaluate AI-generated recommendations for diagnostic evaluation, medical management, and procedural interventions ### Knowledge Contribution - Define annotation guidelines for complex GI conditions (e.g., inflammatory bowel disease, cirrhosis complications, pancreatic disorders) - Advise on taxonomy and edge cases such as obscure GI bleeding, functional disorders, and post-procedural complications ### Model Evaluation & Feedback - Review AI-generated clinical summaries, endoscopy interpretations, and treatment suggestions - Provide structured feedback to improve diagnostic reasoning, procedural accuracy, and patient safety ### Documentation & Training Support - Contribute to specialty-specific annotation standards and onboarding materials ## Requirements - MD or DO with Gastroenterology specialization - Board-certified or board-eligible in Gastroenterology - Active U.S. medical license in good standing - 5+ years of post-fellowship clinical experience - Experience with endoscopy, inpatient consults, and complex GI/hepatology cases preferred - Strong familiarity with EHRs, endoscopy reporting systems, and clinical terminologies - Interest in medical AI or informatics preferred ## Work Arrangement - Preferred in person (SF), part-time (up to 10 hours/week) - Flexible schedule, designed for actively practicing clinicians

๐ŸŒ Remote4/2/2026
Apply โ†’

**the platform is seeking a Bahasa Indonesian Audio Generalist Evaluator Expert** to contribute to a high-impact audio AI research project with a leading research lab. In this role, you will work on transcription, annotation, and evaluation tasks that help train and benchmark advanced language models. This is a short-term, structured engagement ideal for candidates with strong academic or analytical backgrounds who are fluent in Bahasa Indonesian and English and enjoy translating complex audio and visual information into precise, well-structured text. * * * ### **Job Responsibilities** **Transcribe and Optimize Audio & Video** - Listen to, analyze, and transcribe audio and video content in Bahasa Indonesian, following detailed constraints and instructions. - Produce high-quality written outputs in Bahasa Indonesian, with supporting work in English when required. - Ensure clarity, accuracy, and strict adherence to formatting and stylistic guidelines. - Capture nuances such as tone, intent, formal vs. informal register, and regional variations where relevant. * * * **Define and Document Evaluation Standards** - Establish clear expectations for correct and high-quality responses in general consumer audio contexts. - Develop detailed evaluation rubrics and grading guidelines in Bahasa Indonesian and English. - Document standards to ensure consistency across reviewers and model evaluations. - Identify linguistic nuances, grammatical complexities, and edge cases specific to Bahasa Indonesian. * * * **Conduct Model Testing and Grading** - Run prompts through language models and assess generated outputs. - Evaluate responses against predefined criteria for accuracy, completeness, fluency, and instructional clarity. - Provide structured feedback to improve model performance in Bahasa Indonesian audio tasks. * * * **Support Benchmarking and Quality Assurance** - Participate in QA and review cycles to ensure tasks, rubrics, and outputs meet the platformโ€™s quality bar. - Maintain consistency and reliability before datasets are integrated into official benchmarks. - Collaborate with project leads to resolve ambiguities and improve task design. * * * ### **Minimum Qualifications** - Strong writing, editing, and critical thinking skills. - Ability to work independently, manage time effectively, and meet deadlines. - Native or near-native fluency in Bahasa Indonesian (spoken and written) and professional fluency in English. - Ability to accurately transcribe and analyze Bahasa Indonesian audio content across general consumer contexts. - Available to commit 10โ€“20 hours per week. * * * ### **Preferred Qualifications** - College students or recent graduates. - Background in linguistics, humanities, social sciences, journalism, or technical disciplines. - Prior experience with transcription, annotation, localization, evaluation, or research workflows in Bahasa Indonesian. - Familiarity with dialectal variations and contemporary usage. - Interest in AI, language models, or applied research environments. * * * ### **Application & Onboarding Process** - Complete a short AI-led interview (approximately 15 minutes). - If selected, you will be onboarded and invited to begin project work.

๐ŸŒ Remote4/2/2026
Apply โ†’
$32 - $40 per hour

**Hindi Content Annotation Specialist** Contract Type: Remote Language Requirement: Native/Fluent Hindi and English **About the Role** We're hiring Hindi-speaking annotators for a content evaluation + annotation project. You'll review short-form video content in Hindi and write structured assessments across multiple dimensions. Your work feeds directly into AI systems used for content integrity and trust and safety. **What You'll Do** - Review Hindi-language video content and write detailed evaluations in English - Identify and describe problematic content, including implied meaning, coded language, cultural subtext, and suggestive framing - Assess viewer impact across emotional and sensitivity dimensions - Apply structured rating frameworks to categorize content appropriateness - Research unfamiliar references (Hindi slang, trending memes, regional context) to make sure your evaluations are accurate - Triage content that doesn't meet review criteria, such as wrong language or broken media **Content Warning** **This role involves regular exposure to harmful and disturbing content, including violence, explicit language, and mature themes. A signed content release and waiver is required before onboarding. Please consider this carefully before applying.** **Requirements** - Native or fluent Hindi speaker with a deep understanding of Hindi internet culture, slang, memes, and regional context - Professional writing ability in both Hindi and English. You need to articulate nuance, subtext, and cultural context clearly in both languages. - Strong analytical and interpretive skills. You need to capture what's implied, not just what's shown. - Ability to evaluate sensitive and disturbing content objectively and consistently - Familiarity with short-form social media video formats - Reliable internet connection for streaming video **Ideal Candidate** - Experience in content annotation, content moderation, or trust and safety - Comfortable working with mature, graphic, or sensitive content on a daily basis - Detail-oriented. You focus on both the literal content and the underlying intent. - Able to recognize when Hindi-specific cultural context, slang, or coded language changes the meaning of content that looks harmless on the surface - Strong written communication. Your evaluations need to stand on their own without the reviewer seeing the source material.

๐ŸŒ Remote4/2/2026
Apply โ†’
$70 - $90 per hour

**Role Overview** the platform is partnering with a leading AI lab to engage experienced cybersecurity and low-level programming experts for a short-term, high-impact project. This engagement focuses on analyzing and reviewing content for security vulnerabilities, with an emphasis on pattern recognition and classification in an AI context. Contributors will apply expertise in systems programming and security concepts to improve how AI models detect and reason about potential threats. This opportunity begins with a work trial and may extend into a ~2-month project based on performance. **Ideal Qualifications** - 2+ years of experience in programming, preferably with low-level languages such as C, C++, or Java - Familiarity with security vulnerability classification (e.g., OWASP, CVEs, or similar frameworks) - Understanding of core cybersecurity concepts, including web security and common attack vectors - Strong attention to detail and pattern recognition skills - Clear written and verbal communication in English - Currently based in the **U.S., Canada, UK, Australia, or New Zealand** - Ability to pass an enhanced background check **Key Responsibilities:** - You will work asynchronously with a team of highly qualified experts across your domain. - You will craft, solve, and review challenging problems with real world applicability. **Timeline** - The work trial will happen mid-April with potential to expand based on results. - Exact dates will be confirmed closer to the start date. **Interview Process** - You will complete a short interview and questionnaire to assess your domain expertise. - You will be paid for up to 1 hour of onboarding time including the screening process and a few onboarding videos if you are hired.

๐ŸŒ Remote4/2/2026
Apply โ†’
$90 - $130 per hour

the platform is seeking **lawyers with litigation or corporate/transactional experience** to support a HK dataset project. These roles involve compiling publicly available legal documents that will contribute to internal testing and external demonstrations for advanced AI systems. ### Key Responsibilities: - Collect and organize publicly available legal documents based on project guidelines. - Ensure accuracy, completeness, and clarity of all compiled materials. - Follow structured documentation processes to meet defined quality standards. - Verify sources and ensure all data is obtained from public, verifiable platforms. - Collaborate asynchronously with the project manager and meet all deadlines. **Youโ€™re a strong fit if you have:** - Prior experience in litigation or in corporate/transactional work in HK - Familiarity with HK legal and regulatory document sources. - Excellent attention to detail and ability to follow structured documentation requirements. - Strong organizational skills and ability to manage time to meet deadlines. - Ability to work independently and deliver high-quality work on short timelines.

๐ŸŒ Remote4/2/2026
Apply โ†’
$80 - $110 per hour

In a prestigious tech company's AR/VR division, a practical neural interface is being developed that leverages rich neuromotor signals measurable non-invasively with single motor neuron resolution. This technology is a key pillar for interaction with virtual and augmented worlds. The team is seeking developers experienced in user interfaces, infrastructure, and tools supporting applications across various platforms, including desktop and Android. The role involves collaborating with researchers and product partners on concept creation, ensuring proper backend integration, and producing reusable, well-tested code. ## Responsibilities - Present designs, prototypes, and concepts to cross-functional partners and stakeholders - Work collaboratively with Research, Engineering, and other partners to execute and complete experiences - Work on a variety of coding languages and technologies - Implement custom user interfaces using the latest programming techniques and technologies - Develop reusable software components for interfacing with back-end platforms ## Qualifications - Experience building maintainable and testable code bases, including API design and unit testing techniques - Exposure to architectural patterns of large-scale software applications - Experience with scripting languages such as Python, JavaScript, or Hack - Experience building Android applications in Java or Kotlin using Android SDK - Experience as an owner of a particular component, feature, or system Preferred: - Experience building complex applications for iPhone or iPad using Objective-C/C++/Swift with the iOS SDK and other frameworks - Experience with multithreaded programming and mobile memory management ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote4/2/2026
Apply โ†’
$110 - $135 per hour

**Role Overview** We are seeking PhD-level consultants with deep expertise spanning preclinical development through early clinical stages. The ideal candidate has led or meaningfully contributed to programs navigating the path from target validation through first-in-human studies, and can independently drive strategic decisions at key inflection points. Consultants will support a range of high-impact deliverables - from preclinical study strategy and data interpretation through clinical program design and quantitative analysis. * * * **Key Areas of Expertise** We are looking for depth in one or more of the following areas. Candidates with breadth across multiple domains are especially valued. **1\. Preclinical Study Design & Execution** - Designing and executing in vivo studies that link molecular mechanism to disease-relevant phenotypes - Selecting appropriate preclinical systems (in vitro, ex vivo, animal models) with a clear rationale for human translatability - Developing biomarker strategies that span target engagement through clinical response, including practical considerations around sample collection and assay performance - Evaluating formulation and delivery approaches for tissue access across different modalities - Troubleshooting inconclusive or negative preclinical results and recommending next steps **2\. Preclinical Data Interpretation & Decision-Making** - Building exposure-activity relationships from in vivo datasets to inform clinical predictions - Evaluating whether preclinical evidence supports drug activity at the intended site of action - Updating mechanistic hypotheses as new data emerges and designing experiments to resolve ambiguity - Assessing early safety observations and developing hypotheses for their biological basis - Evaluating immunogenicity risk and its potential downstream consequences - Supporting portfolio-level decisions (advance, pivot, terminate) grounded in data quality and residual uncertainty **3\. Early Clinical Program Design** - Determining safe and pharmacologically relevant starting doses for human studies, including cross-species scaling and its limitations - Designing dose escalation schemes informed by expected pharmacodynamic timecourses and safety margins - Powering early-phase studies appropriately given biological variability and expected effect sizes - Defining patient selection and enrichment strategies using available biomarker and epidemiological data - Selecting endpoints - including when surrogate measures are sufficient vs. when clinical endpoints are required - Planning interim analyses, safety monitoring, and adaptive decision rules **4\. Quantitative Pharmacology & Clinical Modeling** - Exposure-response analysis and model-informed dose optimization - Population PK and PK/PD modeling, including covariate identification and impact assessment - Model-based support for dose escalation decisions using accumulating trial data - Longitudinal efficacy modeling, including time-to-effect and trajectory-based analyses - Sensitivity analyses addressing missing data, protocol deviations, and intercurrent events **5\. Clinical Biostatistics** - Statistical analysis planning across endpoint types (binary, continuous, time-to-event) - Multiplicity-adjusted hypothesis testing and sample size determination - Subgroup and heterogeneous treatment effect analyses with appropriate false discovery controls - Handling of estimand-related considerations, including missing data frameworks and dropout patterns - Adaptive and interim monitoring design, including futility boundaries and alpha-spending functions * * * **Ideal Candidate Profile** - PhD, MD, and/or PharmD in pharmacology, pharmaceutical sciences, biostatistics, quantitative biology, or a related field (PharmD, MD also considered) - 5+ years of industry experience in pharma, biotech, or CRO environments - Based in the United States or United Kingdom - Direct experience supporting at least one program from late preclinical stages through IND or into early clinical development - Ability to independently evaluate complex data packages and deliver clear, actionable recommendations - Strong communication skills for technical and non-technical audiences

๐ŸŒ Remote4/2/2026
Apply โ†’
$23 - $30 per hour

**Generalist Expert** Contractors will support the development of AI systems by categorizing and labeling diverse datasets using predefined taxonomies. The project offers an opportunity to directly contribute to the accuracy, reliability, and performance of next-generation AI models. ### Key ## Day-to-day - Synthesize information from large volumes of data - Annotate and categorize text, images, and other data according to detailed guidelines - Apply predefined rubrics and taxonomies to produce structured, high-quality outputs - Flag inconsistencies, ambiguities, or errors in datasets - Contribute to the improvement of AI systems through consistent annotation work ### ## You bring - Ability to synthesize complex or high-volume information into structured formats - Strong critical reasoning, reading comprehension, and written communication skills - Prior experience applying rubrics, taxonomies, or standardized guidelines ( ## Bonus - A college degree and experience with data annotation projects ### More About the Opportunity - Expected commitment: ~20 hours/week ### ## Pay - Expected commitment: ~20 hours/week ###

๐ŸŒ Remote4/2/2026
Apply โ†’
$15 - $25 per hour

**LMIC Primary Care Physician** We are seeking licensed physicians with direct experience in Rwanda primary-care settings to support the development and validation of clinical AI systems aligned with the Rwanda Standard Treatment Guidelines (STG).Required Background ## You bring - Strong working knowledge of - Rwanda Standard Treatment Guidelines (STG) - WHO IMCI protocols - Rwanda National Formulary - Experience managing high-burden primary-care conditions (malaria, pneumonia, diarrheal disease, maternal health, chronic disease) - Familiarity with real-world supply constraints and referral thresholds in Rwandaโ€™s public health system - Prior experience reviewing clinical documentation, guidelines, or case audits ## Bonus - . Compensation Note: This project will initially be task-based, with the potential to transition to an hourly compensation model ($15โ€“$25/hour) based on performance. Important - While the listing may show an hourly

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking a **Bengali Audio Generalist Evaluator Expert** to contribute to a high-impact audio AI research project with a leading research lab. In this role, you will work on transcription, annotation, and evaluation tasks that help train and benchmark advanced language models. This is a short-term, structured engagement ideal for candidates with strong academic or analytical backgrounds who are fluent in **Bengali and English** and enjoy translating complex audio and visual information into precise, well-structured text. * * * ## Job Responsibilities ### Transcribe and Optimize Audio & Video - Listen to, analyze, and transcribe audio and video content in **Bengali**, following detailed constraints and instructions. - Produce high-quality written outputs in Bengali, with supporting work in English when required. - Ensure clarity, accuracy, and strict adherence to formatting and stylistic guidelines. - Capture nuances such as tone, intent, and regional language variations where relevant. * * * ### Define and Document Evaluation Standards - Establish clear expectations for correct and high-quality responses in general consumer audio contexts. - Develop detailed evaluation rubrics and grading guidelines in **Bengali and English**. - Document standards to ensure consistency across reviewers and model evaluations. - Identify edge cases, ambiguities, and linguistic complexities specific to Bengali. * * * ### Conduct Model Testing and Grading - Run prompts through language models and assess generated outputs. - Evaluate responses against predefined criteria for accuracy, completeness, fluency, and instructional clarity. - Provide structured feedback to improve model performance in Bengali audio tasks. * * * ### Support Benchmarking and Quality Assurance - Participate in QA and review cycles to ensure tasks, rubrics, and outputs meet the platformโ€™s quality bar. - Maintain consistency and reliability before datasets are integrated into official benchmarks. - Collaborate with project leads to resolve ambiguities and improve task design. * * * ## Minimum Qualifications - Strong writing, editing, and critical thinking skills. - Ability to work independently, manage time effectively, and meet deadlines. - **Native or near-native fluency in Bengali (spoken and written) and professional fluency in English.** - Ability to accurately transcribe and analyze Bengali audio content across general consumer contexts. - Available to commit **10โ€“20 hours per week**. * * * ## Preferred Qualifications - College students or recent graduates. - Background in linguistics, humanities, social sciences, journalism, or technical disciplines. - Prior experience with transcription, annotation, localization, evaluation, or research workflows in Bengali. - Familiarity with dialectal variations, regional speech patterns, and contemporary usage in Bengali. - Interest in AI, language models, or applied research environments. * * * ## Application & Onboarding Process - Complete a short AI-led interview (approximately 15 minutes). - If selected, you will be onboarded and invited to begin project work. * * * ## Additional Role Details - Work in a structured, goal-oriented project environment with clear tooling, guidelines, and support. - Gain hands-on exposure to real-world AI research and evaluation workflows. - Contribute directly to benchmarking and improving advanced multilingual language models.

๐ŸŒ Remote4/2/2026
Apply โ†’

**About the platformโ€™s talent network** Join our Compliance & Risk Specialist Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Compliance & Risk - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in regulatory compliance frameworks, risk assessment & mitigation, internal controls & audit - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Compliance & Risk Specialist talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Compliance & Risk expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $100 per hour

**About the platformโ€™s talent network** Join our Management Consultant Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Management Consulting - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in strategic analysis & problem solving, financial & operational modeling, stakeholder communication - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Management Consultant talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Management Consulting expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $80 per hour

**About the platformโ€™s talent network** Join our Physical Scientist Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Physical Sciences - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in experimental research methods, quantitative data analysis, scientific reporting - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Physical Scientist talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Physical Sciences expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’

**About the platformโ€™s talent network** Join our Graphic & UX/UI Designer Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Graphic & UX/UI Design - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in user experience research, interface design (Figma/Adobe XD), visual design & prototyping - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Graphic & UX/UI Designer talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Graphic & UX/UI Design expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $80 per hour

**About the platformโ€™s talent network** Join our Marketing Specialist Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Marketing - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in digital marketing & campaign management, market research & analytics, content strategy & branding - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Marketing Specialist talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Marketing expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’

**About the platformโ€™s talent network** Join our HR & Administration Specialist Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in HR & Administration - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in talent acquisition & onboarding, payroll & benefits administration, HR policy & compliance - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our HR & Administration Specialist talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need HR & Administration expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $180 per hour

**About the platformโ€™s talent network** Join our Financial Analyst Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Finance - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in financial modeling & forecasting, financial statement analysis, budgeting & variance analysis - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Financial Analyst talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Finance expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $80 per hour

**About the platformโ€™s talent network** Join our Chemist Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Chemistry - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in analytical chemistry techniques, laboratory experimentation, chemical data analysis - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Chemist talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Chemistry expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $80 per hour

**About the platformโ€™s talent network** Join our Biologist Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Biology - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in molecular & cellular biology techniques, experimental design & lab research, data analysis in life sciences - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Biologist talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Biology expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $80 per hour

**About the platformโ€™s talent network** Join our Mathematician Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Mathematics - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in statistical analysis & modeling, mathematical proof & theory, computational mathematics - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Mathematician talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Mathematics expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $150 per hour

**About the platformโ€™s talent network** Join our Law Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Law - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in legal research & writing, contract drafting & review, litigation & dispute resolution - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Lawyer talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Law expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $80 per hour

**About the platformโ€™s talent network** Join our Physicist Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Physics - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in theoretical & computational modeling, experimental design & data analysis, scientific programming - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Physicist talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Physics expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$60 - $120 per hour

**About the platformโ€™s talent network** Join our Nursing Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Nursing - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in patient care & monitoring, medication administration, clinical documentation - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Nursing talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Nursing expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$110 - $250 per hour

**About the platformโ€™s talent network** Join our Physician Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Medicine - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in clinical diagnosis & treatment, patient care management, medical documentation & compliance - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Physician talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Physician expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’

**About the platformโ€™s talent network** Join our Machine Learning Engineer Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Machine Learning Engineering - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in machine learning model development, Python & ML frameworks (PyTorch/TensorFlow), model deployment & MLOps - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Machine Learning Engineer talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Machine Learning Engineering expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’

**About the platformโ€™s talent network** Join our Business Intelligence Analyst Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Business Intelligence Analytics - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in SQL & data querying, data visualization (Tableau/Power BI), data modeling & warehousing - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Business Intelligence Analyst talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Business Intelligence Analyst expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’

**About the platformโ€™s talent network** **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **Role Overview** Weโ€™re building an **AI-native platform** that replaces spreadsheet-driven operations with **real-time dashboards** and **agentic workflows**. This engineer will own core product delivery end-to-end: platform foundations, integrations, analytics surfaces, and the pilot launch. **What youโ€™ll build** - **AI-native platform** (core entities, workflows, permissions, multi-tenant foundations) - **Omni-tool integration layer** (connectors to common tools; event ingestion; sync; actions) - **Real-time analytics dashboard** (live metrics, drill-downs, exports) - **MCP-compatible agentic integration architecture** (tool registry, permissions, runtime, observability) - **Pilot launch support** with hands-on technical execution (shipping, debugging, iteration with real users) ### **Responsibilities** - Ship production features across **backend + frontend**. - Design and implement a **scalable integrations framework** - Build **real-time analytics** experiences - Architect an **agentic tool-use layer** compatible with MCP-style patterns - Set up strong engineering hygiene: testing, observability, on-call readiness for the pilot. - Iterate quickly and pragmatically. ### **What weโ€™re looking for** - Strong, practical software engineering skills; you can **ship end-to-end** and unblock yourself. - Experience building **SaaS products** with APIs, data models, and production constraints. - Deep comfort with **integrations** (OAuth2, webhooks, API pagination, rate limiting, sync semantics). - โ€œ**Vibe coding**โ€ mindset: you use AI tools effectively (Copilot/ChatGPT/etc.) to move fast **without** sacrificing correctnessโ€”good taste around tests, interfaces, and guardrails. - Familiarity with **MCP-style** tool interfaces and/or building tool servers. - Clear communicator who can explain tradeoffs and keep stakeholders aligned. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote4/2/2026
Apply โ†’
$70 - $150 per hour

**About the platformโ€™s talent network** Join our Frontend Engineer Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Frontend Engineering - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in modern JavaScript frameworks (React/Vue/Angular), responsive UI development (HTML/CSS), state management & performance optimization - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Frontend Engineer talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Frontend Engineering expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’

**About the platformโ€™s talent network** Join our DevOps / Platform Engineer Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in DevOps / Platform Engineering - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in CI/CD pipelines, cloud infrastructure (AWS/GCP/Azure), containerization & orchestration (Docker/Kubernetes) - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our DevOps / Platform Engineer talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need DevOps / Platform Engineering expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$70 - $150 per hour

**About the platformโ€™s talent network** Join our Backend Engineer Expert Network to connect with leading AI labs and companies seeking your expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Backend Engineering - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in API development & microservices, database architecture & optimization, distributed systems - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Backend Engineer talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Backend Engineering expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’
$70 - $150 per hour

**About the platformโ€™s talent network** Join our Full-Stack Engineer Expert Network to connect with leading AI labs and companies seeking Full-Stack Engineering expertise. **This is an open application for future contract opportunities that match your background and interests.** Once you complete your profile and pass our AI interview, you'll be eligible for relevant projects as they become available. We match experts to opportunities on a rolling basis. **About the platform projects** Experts in our network contribute to: - Training and evaluating AI models in Full-Stack Engineering - Creating tasks and deliverables based on real-world scenarios - Providing domain-specific feedback to advance frontier AI research Projects vary in scope and duration, with typical commitments ranging from 15-30 hours per week. **What we're looking for:** - Professional experience in frontend development (React/Angular/Vue), backend development (Node.js/Django/Spring), relational & NoSQL databases - Strong communication skills - Ability to work independently in a remote environment **How it works:** This is not a specific job posting. By applying, you're joining our Full-Stack Engineer talent network. - Apply once: Complete your profile by uploading your resume and confirming your work location. - Get verified: We review your credentials and approve qualified experts. - Stay connected: When projects need Full-Stack Engineering expertise, we match you to opportunities based on your expertise and project needs. - Start working: Be invited to interview for opportunities that align with your schedule and interests. You will earn competitive rates while working remotely on your own schedule.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking a **Bilingual Traditional Chinese STEM Expert who is native to Hong Kong or Taiwan**, with experience in biology / physics / chemistry theory and problem-solving, to develop and refine high-quality reasoning questions that evaluate AI models. Your expertise in biology / physics / chemistry will ensure the scientific accuracy, rigor, and instructional quality of these assessment items. You will create and evaluate Traditional Chinese/English prompts and responses, ensuring scientific precision while maintaining clear, natural Traditional Chinese phrasing consistent with academic conventions in Hong Kong or Taiwan, and alignment with English where needed. * * * ## **Job Details** ### **Design and Optimize STEM-Based Prompts (Bilingual)** Create detailed prompts in Traditional Chinese and/or English with multiple constraints and scientifically accurate instructions. ### **Define and Document Evaluation Standards** Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor andโ€”when written in Traditional Chineseโ€”linguistic clarity, regionally appropriate terminology (Hong Kong or Taiwan usage), and academic tone. ### **Conduct Model Testing and Grading (Bilingual)** Run prompts through AI models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity. Compare Traditional Chinese and English responses where needed. ### **Support Benchmarking and Quality Assurance** Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific standards. Maintain consistency and reliability across Traditional Chinese-language benchmarks prior to integration into official evaluation sets. * * * ## **Minimum Qualifications** - Native-level fluency in Traditional Chinese (written), specifically **native to Hong Kong or Taiwan** - Strong reading and writing proficiency in English - BS or BA in Biology / Physics / Chemistry - Familiarity with undergraduate-level biology / physics / chemistry topics - Strong writing and critical thinking skills - Ability to work independently and meet deadlines * * * ## **Preferred Qualifications** - 2+ years of experience in teaching, research, or an applied STEM discipline - Experience working with academic terminology specific to Hong Kong or Taiwan educational standards * * * ## **Application & Onboarding Process** - Complete a short AI-led interview (approximately 15 minutes) - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expected commitment: **Minimum 20 hours per week** - Project duration: Approximately **3 - 6 months** - Work environment: Structured project setting with defined goals, tools, and review processes

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is looking for a **Bilingual Simplified Chinese STEM Expert** with experience in **biology / physics / chemistry** theory and problem-solving to develop and refine high-quality reasoning questions that evaluate AI models. Your knowledge in **biology / physics / chemistry** will ensure the accuracy, rigor, and instructional quality of these test items. You will create and evaluate **Simplified Chinese/English** prompts and responses, ensuring scientific precision while maintaining clear, natural **Simplified Chinese** phrasing and alignment with English where needed. * * * ## **Job Details** - **Design and Optimize STEM-Based Prompts (Bilingual):** Create detailed prompts in **Simplified Chinese** and/or English with multiple constraints and scientific instructions. - **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor andโ€”when in **Simplified Chinese**โ€”linguistic clarity and appropriate academic tone. - **Conduct Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity, comparing **Simplified Chinese vs. English** where needed. - **Support Benchmarking and Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific rigor, maintaining consistency and reliability across **Simplified Chinese-language** benchmarks before integration into official benchmarks. * * * ## **Minimum Qualifications** - Native-level fluency in **Simplified Chinese (written)** with strong reading/writing ability in English - BS or BA in **Biology / Physics / Chemistry** - Familiarity with undergraduate-level **biology / physics / chemistry** topics - Strong writing and critical thinking skills - Ability to work independently and meet deadlines * * * ## **Preferred Qualifications** - **2+ years** of experience in teaching, research, or an applied STEM discipline field * * * ## **Application & Onboarding Process** - Complete an AI-led interview (around **15 minutes**) - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least **20 hours per week** - Expect a commitment of around **1 month** - Youโ€™ll be working in a structured project environment with clear goals and tools

๐ŸŒ Remote4/2/2026
Apply โ†’
$38 - $43 per hour

the platform is looking for a **Bilingual Korean STEM Expert** with experience in **biology / physics / chemistry** theory and problem-solving to develop and refine high-quality reasoning questions that evaluate AI models. Your knowledge in **biology / physics / chemistry** will ensure the accuracy, rigor, and instructional quality of these test items. You will create and evaluate **Korean/English** prompts and responses, ensuring scientific precision while maintaining clear, natural **Korean** phrasing and alignment with English where needed. * * * ## **Job Details** - **Design and Optimize STEM-Based Prompts (Bilingual):** Create detailed prompts in **Korean** and/or English with multiple constraints and scientific instructions. - **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor andโ€”when in **Korean**โ€”linguistic clarity and appropriate academic tone. - **Conduct Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity, comparing **Korean vs. English** where needed. - **Support Benchmarking and Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific rigor, maintaining consistency and reliability across **Korean-language** benchmarks before integration into official benchmarks. * * * ## **Minimum Qualifications** - Native-level fluency in **Korean (written)** with strong reading/writing ability in English - BS or BA in **Biology / Physics / Chemistry** - Familiarity with undergraduate-level **biology / physics / chemistry** topics - Strong writing and critical thinking skills - Ability to work independently and meet deadlines * * * ## **Preferred Qualifications** - **2+ years** of experience in teaching, research, or an applied STEM discipline field * * * ## **Application & Onboarding Process** - Complete an AI-led interview (around **15 minutes**) - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least **20 hours per week** - Expect a commitment of around **1 month** - Youโ€™ll be working in a structured project environment with clear goals and tools

๐ŸŒ Remote4/2/2026
Apply โ†’
$38 - $43 per hour

the platform is looking for a **Bilingual Japanese STEM Expert** with experience in **biology / physics / chemistry** theory and problem-solving to develop and refine high-quality reasoning questions that evaluate AI models. Your knowledge in **biology / physics / chemistry** will ensure the accuracy, rigor, and instructional quality of these test items. You will create and evaluate **Japanese/English** prompts and responses, ensuring scientific precision while maintaining clear, natural **Japanese** phrasing and alignment with English where needed. * * * ## **Job Details** - **Design and Optimize STEM-Based Prompts (Bilingual):** Create detailed prompts in **Japanese** and/or English with multiple constraints and scientific instructions. - **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor andโ€”when in **Japanese**โ€”linguistic clarity and appropriate academic tone. - **Conduct Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity, comparing **Japanese vs. English** where needed. - **Support Benchmarking and Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific rigor, maintaining consistency and reliability across **Japanese-language** benchmarks before integration into official benchmarks. * * * ## **Minimum Qualifications** - Native-level fluency in **Japanese (written)** with strong reading/writing ability in English - BS or BA in **Biology / Physics / Chemistry** - Familiarity with undergraduate-level **biology / physics / chemistry** topics - Strong writing and critical thinking skills - Ability to work independently and meet deadlines * * * ## **Preferred Qualifications** - **2+ years** of experience in teaching, research, or an applied STEM discipline field * * * ## **Application & Onboarding Process** - Complete an AI-led interview (around **15 minutes**) - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least **20 hours per week** - Expect a commitment of around **1 month** - Youโ€™ll be working in a structured project environment with clear goals and tools

๐ŸŒ Remote4/2/2026
Apply โ†’
$18 - $42 per hour

the platform is seeking a **Bilingual Portuguese STEM Expert who is native to Brazil or Portugal**, with experience in biology / physics / chemistry theory and problem-solving, to develop and refine high-quality reasoning questions that evaluate AI models. Your expertise in biology / physics / chemistry will ensure the scientific accuracy, rigor, and instructional quality of these assessment items. You will create and evaluate Portuguese/English prompts and responses, ensuring scientific precision while maintaining clear, natural Portuguese phrasing consistent with academic conventions in Brazil or Portugal, and alignment with English where needed. * * * ## **Job Details** ### **Design and Optimize STEM-Based Prompts (Bilingual)** Create detailed prompts in Portuguese and/or English with multiple constraints and scientifically accurate instructions. ### **Define and Document Evaluation Standards** Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor andโ€”when written in Portugueseโ€”linguistic clarity, regionally appropriate terminology (Brazil or Portugal usage), and academic tone. ### **Conduct Model Testing and Grading (Bilingual)** Run prompts through AI models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity. Compare Portuguese and English responses where needed. ### **Support Benchmarking and Quality Assurance** Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific standards. Maintain consistency and reliability across Portuguese-language benchmarks prior to integration into official evaluation sets. * * * ## **Minimum Qualifications** - Native-level fluency in Portuguese (written), specifically native to Brazil or Portugal - Strong reading and writing proficiency in English - BS or BA in Biology / Physics / Chemistry - Familiarity with undergraduate-level biology / physics / chemistry topics - Strong writing and critical thinking skills - Ability to work independently and meet deadlines * * * ## **Preferred Qualifications** - 2+ years of experience in teaching, research, or an applied STEM discipline field - Experience working with academic terminology and STEM educational standards specific to Brazil or Portugal * * * ## **Application & Onboarding Process** - Complete a short AI-led interview (approximately 15 minutes) - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expected commitment: Minimum 20 hours per week - Project duration: 3โ€“6 months - Work environment: Structured project setting with defined goals and tools

๐ŸŒ Remote4/2/2026
Apply โ†’
$38 - $43 per hour

the platform is seeking a **Bilingual Italian STEM Expert who is native to Italy or Italian-speaking Switzerland**, with experience in biology / physics / chemistry theory and problem-solving, to develop and refine high-quality reasoning questions that evaluate AI models. Your expertise in biology / physics / chemistry will ensure the scientific accuracy, rigor, and instructional quality of these assessment items. You will create and evaluate Italian/English prompts and responses, ensuring scientific precision while maintaining clear, natural Italian phrasing consistent with academic conventions in Italy or Italian-speaking Switzerland, and alignment with English where needed. * * * ## **Job Details** ### **Design and Optimize STEM-Based Prompts (Bilingual)** Create detailed prompts in Italian and/or English with multiple constraints and scientifically accurate instructions. ### **Define and Document Evaluation Standards** Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor andโ€”when written in Italianโ€”linguistic clarity, regionally appropriate terminology (Italy or Italian-speaking Switzerland usage), and academic tone. ### **Conduct Model Testing and Grading (Bilingual)** Run prompts through AI models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity. Compare Italian and English responses where needed. ### **Support Benchmarking and Quality Assurance** Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific standards. Maintain consistency and reliability across Italian-language benchmarks prior to integration into official evaluation sets. * * * ## **Minimum Qualifications** - Native-level fluency in Italian (written), specifically native to Italy or Italian-speaking Switzerland - Strong reading and writing proficiency in English - BS or BA in Biology / Physics / Chemistry - Familiarity with undergraduate-level biology / physics / chemistry topics - Strong writing and critical thinking skills - Ability to work independently and meet deadlines * * * ## **Preferred Qualifications** - 2+ years of experience in teaching, research, or an applied STEM discipline - Experience working with Italian or Swiss secondary/university STEM curricula * * * ## **Application & Onboarding Process** - Complete a short AI-led interview (approximately 15 minutes) - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expected commitment: Minimum 20 hours per week - Project duration: Approximately 3 - 6 months - Work environment: Structured project setting with defined goals, tools, and review processes

๐ŸŒ Remote4/2/2026
Apply โ†’
$38 - $53 per hour

the platform is seeking a **Bilingual French STEM Expert who is native to Canada (French-speaking), Belgium (French-speaking), Switzerland (French-speaking), or France**, with experience in biology / physics / chemistry theory and problem-solving, to develop and refine high-quality reasoning questions that evaluate AI models. Your expertise in biology / physics / chemistry will ensure the scientific accuracy, rigor, and instructional quality of these assessment items. You will create and evaluate French/English prompts and responses, ensuring scientific precision while maintaining clear, natural French phrasing consistent with academic conventions in Canada (French-speaking), Belgium (French-speaking), Switzerland (French-speaking), or France, and alignment with English where needed. * * * ## **Job Details** ### **Design and Optimize STEM-Based Prompts (Bilingual)** Create detailed prompts in French and/or English with multiple constraints and scientifically accurate instructions. ### **Define and Document Evaluation Standards** Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor andโ€”when written in Frenchโ€”linguistic clarity, regionally appropriate terminology (Canada \[French-speaking\], Belgium \[French-speaking\], Switzerland \[French-speaking\], or France usage), and academic tone. ### **Conduct Model Testing and Grading (Bilingual)** Run prompts through AI models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity. Compare French and English responses where needed. ### **Support Benchmarking and Quality Assurance** Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific standards. Maintain consistency and reliability across French-language benchmarks prior to integration into official evaluation sets. * * * ## **Minimum Qualifications** - Native-level fluency in French (written), specifically native to Canada (French-speaking), Belgium (French-speaking), Switzerland (French-speaking), or France - Strong reading and writing proficiency in English - BS or BA in Biology / Physics / Chemistry - Familiarity with undergraduate-level biology / physics / chemistry topics - Strong writing and critical thinking skills - Ability to work independently and meet deadlines * * * ## **Preferred Qualifications** - 2+ years of experience in teaching, research, or an applied STEM discipline field - Experience working with French-language STEM curricula or academic standards in Canada (French-speaking), Belgium (French-speaking), Switzerland (French-speaking), or France * * * ## **Application & Onboarding Process** - Complete a short AI-led interview (approximately 15 minutes) - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expected commitment: Minimum 20 hours per week - **Expected duration: 3โ€“6 months** - Work environment: Structured project setting with clear goals and tools

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking a **Bilingual Spanish STEM Expert who is native to the United States (Spanish-speaking), Spain, Chile, or Mexico**, with experience in biology / physics / chemistry theory and problem-solving, to develop and refine high-quality reasoning questions that evaluate AI models. Your expertise in biology / physics / chemistry will ensure the scientific accuracy, rigor, and instructional quality of these assessment items. You will create and evaluate Spanish/English prompts and responses, ensuring scientific precision while maintaining clear, natural Spanish phrasing consistent with academic conventions in the United States (Spanish-speaking), Spain, Chile, or Mexico, and alignment with English where needed. * * * ## **Job Details** ### **Design and Optimize STEM-Based Prompts (Bilingual)** Create detailed prompts in Spanish and/or English with multiple constraints and scientifically accurate instructions. ### **Define and Document Evaluation Standards** Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor andโ€”when written in Spanishโ€”linguistic clarity, regionally appropriate terminology (United States \[Spanish-speaking\], Spain, Chile, or Mexico usage), and academic tone. ### **Conduct Model Testing and Grading (Bilingual)** Run prompts through AI models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity. Compare Spanish and English responses where needed. ### **Support Benchmarking and Quality Assurance** Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific standards. Maintain consistency and reliability across Spanish-language benchmarks prior to integration into official evaluation sets. * * * ## **Minimum Qualifications** - Native-level fluency in Spanish (written), specifically native to the United States (Spanish-speaking), Spain, Chile, or Mexico - Strong reading and writing proficiency in English - BS or BA in Biology / Physics / Chemistry - Familiarity with undergraduate-level biology / physics / chemistry topics - Strong writing and critical thinking skills - Ability to work independently and meet deadlines * * * ## **Preferred Qualifications** - 2+ years of experience in teaching, research, or an applied STEM discipline field - Experience working with Spanish-language STEM curricula or academic standards in the United States, Spain, Chile, or Mexico * * * ## **Application & Onboarding Process** - Complete a short AI-led interview (approximately 15 minutes) - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expected commitment: Minimum 20 hours per week - **Expected duration: Approximately 3โ€“6 months** - Work environment: Structured project setting with clear goals and tools

๐ŸŒ Remote4/2/2026
Apply โ†’
$38 - $43 per hour

the platform is seeking a **Bilingual German STEM Expert who is native to Switzerland (German-speaking), Germany, or Austria**, with experience in biology / physics / chemistry theory and problem-solving, to develop and refine high-quality reasoning questions that evaluate AI models. Your expertise in biology / physics / chemistry will ensure the scientific accuracy, rigor, and instructional quality of these assessment items. You will create and evaluate German/English prompts and responses, ensuring scientific precision while maintaining clear, natural German phrasing consistent with academic conventions in Switzerland (German-speaking), Germany, or Austria, and alignment with English where needed. * * * ## **Job Details** ### **Design and Optimize STEM-Based Prompts (Bilingual)** Create detailed prompts in German and/or English with multiple constraints and scientifically accurate instructions. ### **Define and Document Evaluation Standards** Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubrics that account for scientific rigor andโ€”when written in Germanโ€”linguistic clarity, regionally appropriate terminology (Switzerland \[German-speaking\], Germany, or Austria usage), and academic tone. ### **Conduct Model Testing and Grading (Bilingual)** Run prompts through AI models and assess preliminary outputs against expectations for scientific accuracy, reasoning quality, and clarity. Compare German and English responses where needed. ### **Support Benchmarking and Quality Assurance** Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific standards. Maintain consistency and reliability across German-language benchmarks prior to integration into official evaluation sets. * * * ## **Minimum Qualifications** - Native-level fluency in German (written), specifically native to Switzerland (German-speaking), Germany, or Austria - Strong reading and writing proficiency in English - BS or BA in Biology / Physics / Chemistry - Familiarity with undergraduate-level biology / physics / chemistry topics - Strong writing and critical thinking skills - Ability to work independently and meet deadlines * * * ## **Preferred Qualifications** - 2+ years of experience in teaching, research, or an applied STEM discipline field - Experience working with German-language STEM curricula or academic standards in Switzerland, Germany, or Austria * * * ## **Application & Onboarding Process** - Complete a short AI-led interview (approximately 15 minutes) - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expected commitment: Minimum 20 hours per week - Expected duration: Approximately 3โ€“6 months - Work environment: Structured project environment with clear goals and tools

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **native Traditional Chinese speakers from Hong Kong or Taiwan** with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Traditional Chinese/English promptโ€“golden answer pairs that train and evaluate advanced language models. **This role is strictly limited to candidates who are native to Hong Kong or Taiwan and have lived in or spent significant time in the country, with deep familiarity with local language usage, tone, and cultural context.** This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded Traditional Chinese text while maintaining technical precision in English. * * * ## **Job Details** ### **Multilingual Prompt Design & Optimization:** Create detailed prompts in Traditional Chinese and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Traditional Chineseโ€“speaking users in **Hong Kong and Taiwan** contexts. ### **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in Hong Kong and Taiwan consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions. ### **Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Traditional Chinese, comparing results against English where needed. ### **Benchmarking & Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigorโ€”maintaining consistency and reliability across Traditional Chineseโ€“language benchmarks before integration into official evaluations. * * * ## **Minimum Qualifications** - **Native-level fluency in Traditional Chinese (written), specific to Hong Kong or Taiwan usage, with strong reading/writing ability in English.** - Must be **native to Hong Kong or Taiwan and have lived in or spent significant time in-country**, with deep cultural and linguistic familiarity. - BS or BA from a reputable institution (completed or in progress). - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. - Based in Hong Kong or Taiwan (or able to reliably produce Hong Kong- or Taiwan-specific, culturally accurate Traditional Chinese). * * * ## **Preferred Qualifications** - Experience in teaching, research, editing, or academic writing. - Experience creating evaluation criteria, rubrics, or grading guidelines. - Familiarity with LLMs, prompting, or model evaluation (helpful but not required). * * * ## **Application & Onboarding Process** - Complete an AI-led interview (about 15 minutes). - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least 20 hours per week. - Expect a commitment of approximately **2โ€“4 months**. - Youโ€™ll be working in a structured project environment with clear goals and tools.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking native **Simplified Chinese** speakers with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author **Simplified Chinese/English** promptโ€“golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded **Simplified Chinese** text while maintaining technical precision in English. * * * ## **Job Details** ### **Multilingual Prompt Design & Optimization:** Create detailed prompts in Simplified Chinese and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Simplified Chineseโ€“speaking users. ### **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in Simplified Chinese consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions. ### **Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Simplified Chinese, comparing results against English where needed. ### **Benchmarking & Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigorโ€”maintaining consistency and reliability across Simplified Chineseโ€“language benchmarks before integration into official evaluations. * * * ## **Minimum Qualifications** - Native-level fluency in Simplified Chinese (written) with strong reading/writing ability in English. - BS or BA from a reputable institution (completed or in progress). - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. - Able to reliably produce **Chinaโ€“specific**, culturally accurate Simplified Chinese. * * * ## **Preferred Qualifications** - Experience in teaching, research, editing, or academic writing. - Experience creating evaluation criteria, rubrics, or grading guidelines. - Familiarity with LLMs, prompting, or model evaluation (helpful but not required). * * * ## **Application & Onboarding Process** - Complete an AI-led interview (about 15 minutes). - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least 20 hours per week. - Expect a commitment of around 2+ months. - Youโ€™ll be working in a structured project environment with clear goals and tools. - We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking native **Korean** speakers with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author **Korean/English** promptโ€“golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded **Korean** text while maintaining technical precision in English. * * * ## **Job Details** ### **Multilingual Prompt Design & Optimization:** Create detailed prompts in Korean and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Korean-speaking users. ### **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in Korean consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions. ### **Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Korean, comparing results against English where needed. ### **Benchmarking & Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigorโ€”maintaining consistency and reliability across Korean-language benchmarks before integration into official evaluations. * * * ## **Minimum Qualifications** - Native-level fluency in Korean (written) with strong reading/writing ability in English. - BS or BA from a reputable institution (completed or in progress). - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. - Based in **South Korea** (or able to reliably produce **South Korea-specific**, culturally accurate Korean). * * * ## **Preferred Qualifications** - Experience in teaching, research, editing, or academic writing. - Experience creating evaluation criteria, rubrics, or grading guidelines. - Familiarity with LLMs, prompting, or model evaluation (helpful but not required). * * * ## **Application & Onboarding Process** - Complete an AI-led interview (about 15 minutes). - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least 20 hours per week. - Expect a commitment of around 2+ months. - Youโ€™ll be working in a structured project environment with clear goals and tools. - We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking native **Japanese** speakers with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author **Japanese/English** promptโ€“golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded **Japanese** text while maintaining technical precision in English. * * * ## **Job Details** ### **Multilingual Prompt Design & Optimization:** Create detailed prompts in Japanese and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Japanese-speaking users. ### **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in Japanese consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions. ### **Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Japanese, comparing results against English where needed. ### **Benchmarking & Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigorโ€”maintaining consistency and reliability across Japanese-language benchmarks before integration into official evaluations. * * * ## **Minimum Qualifications** - Native-level fluency in Japanese (written) with strong reading/writing ability in English. - BS or BA from a reputable institution (completed or in progress). - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. - Based in **Japan** (or able to reliably produce **Japan-specific**, culturally accurate Japanese). * * * ## **Preferred Qualifications** - Experience in teaching, research, editing, or academic writing. - Experience creating evaluation criteria, rubrics, or grading guidelines. - Familiarity with LLMs, prompting, or model evaluation (helpful but not required). * * * ## **Application & Onboarding Process** - Complete an AI-led interview (about 15 minutes). - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least 20 hours per week. - Expect a commitment of around 2+ months. - Youโ€™ll be working in a structured project environment with clear goals and tools. - We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **native Portuguese speakers from Portugal or Brazil** with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Portuguese/English promptโ€“golden answer pairs that train and evaluate advanced language models. **This role is strictly limited to candidates who are native to Portugal or Brazil and have lived in or spent significant time in the country, with deep familiarity with local language usage, tone, and cultural context.** This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded Portuguese text while maintaining technical precision in English. * * * ## **Job Details** ### **Multilingual Prompt Design & Optimization:** Create detailed prompts in Portuguese and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Portuguese-speaking users in **Portugal and Brazil** contexts. ### **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in Portugal and Brazil consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions. ### **Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Portuguese, comparing results against English where needed. ### **Benchmarking & Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigorโ€”maintaining consistency and reliability across Portuguese-language benchmarks before integration into official evaluations. * * * ## **Minimum Qualifications** - **Native-level fluency in Portuguese (written), specific to Portugal or Brazil usage, with strong reading/writing ability in English.** - Must be **native to Portugal or Brazil and have lived in or spent significant time in-country**, with deep cultural and linguistic familiarity. - BS or BA from a reputable institution (completed or in progress). - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. - Based in Portugal or Brazil (or able to reliably produce Portugal- or Brazil-specific, culturally accurate Portuguese). * * * ## **Preferred Qualifications** - Experience in teaching, research, editing, or academic writing. - Experience creating evaluation criteria, rubrics, or grading guidelines. - Familiarity with LLMs, prompting, or model evaluation (helpful but not required). * * * ## **Application & Onboarding Process** - Complete an AI-led interview (about 15 minutes). - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least 20 hours per week. - Expect a commitment of approximately **2โ€“4 months**. - Youโ€™ll be working in a structured project environment with clear goals and tools.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **native Italian speakers from Switzerland or Italy** with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Italian/English promptโ€“golden answer pairs that train and evaluate advanced language models. **This role is strictly limited to candidates who are native to Switzerland or Italy and have lived in or spent significant time in the country, with deep familiarity with local language usage, tone, and cultural context (including regional distinctions such as Swiss Italian and Italy-specific conventions).** This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded Italian text while maintaining technical precision in English. * * * ## **Job Details** ### **Multilingual Prompt Design & Optimization:** Create detailed prompts in Italian and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Italian-speaking users in **Switzerland and Italy** contexts. ### **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in Switzerland and Italy consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions. ### **Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Italian, comparing results against English where neede ### **Benchmarking & Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigorโ€”maintaining consistency and reliability across Italian-language benchmarks before integration into official evaluations. * * * ## **Minimum Qualifications** - **Native-level fluency in Italian (written), specific to Switzerland or Italy usage, with strong reading/writing ability in English.** - Must be **native to Switzerland or Italy and have lived in or spent significant time in-country**, with deep cultural and linguistic familiarity. - BS or BA from a reputable institution (completed or in progress). - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. - Based in Switzerland or Italy (or able to reliably produce Switzerland- or Italy-specific, culturally accurate Italian). * * * ## **Preferred Qualifications** - Experience in teaching, research, editing, or academic writing. - Experience creating evaluation criteria, rubrics, or grading guidelines. - Familiarity with LLMs, prompting, or model evaluation (helpful but not required). * * * ## **Application & Onboarding Process** - Complete an AI-led interview (about 15 minutes). - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least 20 hours per week. - Expect a commitment of approximately **2โ€“4 months**. - Youโ€™ll be working in a structured project environment with clear goals and tools. - We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **native French speakers from Canada, Belgium, Switzerland, or France** with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author French/English promptโ€“golden answer pairs that train and evaluate advanced language models. **This role is strictly limited to candidates who are native to Canada, Belgium, Switzerland, or France and have lived in or spent significant time in the country, with deep familiarity with local language usage, tone, and cultural context (including regional distinctions such as Canadian French, Belgian French, Swiss French, and France-specific conventions).** This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded French text while maintaining technical precision in English. * * * ## **Job Details** ### **Multilingual Prompt Design & Optimization:** Create detailed prompts in French and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for French-speaking users in **Canada, Belgium, Switzerland, and France** contexts. ### **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in Canada, Belgium, Switzerland, and France consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions. ### **Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in French, comparing results against English where needed. ### **Benchmarking & Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigorโ€”maintaining consistency and reliability across French-language benchmarks before integration into official evaluations. * * * ## **Minimum Qualifications** - **Native-level fluency in French (written), specific to Canada, Belgium, Switzerland, or France usage, with strong reading/writing ability in English.** - Must be **native to Canada, Belgium, Switzerland, or France and have lived in or spent significant time in-country**, with deep cultural and linguistic familiarity. - BS or BA from a reputable institution (completed or in progress). - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. - Based in Canada, Belgium, Switzerland, or France (or able to reliably produce country-specific, culturally accurate French aligned with one of these regions). * * * ## **Preferred Qualifications** - Experience in teaching, research, editing, or academic writing. - Experience creating evaluation criteria, rubrics, or grading guidelines. - Familiarity with LLMs, prompting, or model evaluation (helpful but not required). * * * ## **Application & Onboarding Process** - Complete an AI-led interview (about 15 minutes). - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least 20 hours per week. - Expect a commitment of approximately **2โ€“4 months**. - Youโ€™ll be working in a structured project environment with clear goals and tools. - We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **native Spanish speakers from the United States, Spain, Chile, or Mexico** with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Spanish/English promptโ€“golden answer pairs that train and evaluate advanced language models. **This role is strictly limited to candidates who are native to the United States, Spain, Chile, or Mexico and have lived in or spent significant time in the country, with deep familiarity with local language usage, tone, and cultural context (including distinctions such as U.S. Spanish, Peninsular Spanish, Chilean Spanish, and Mexican Spanish conventions).** This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded Spanish text while maintaining technical precision in English. * * * ## **Job Details** ### **Multilingual Prompt Design & Optimization:** Create detailed prompts in Spanish and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Spanish-speaking users in **the United States, Spain, Chile, and Mexico** contexts. ### **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in United States, Spain, Chile, and Mexico consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions. ### **Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Spanish, comparing results against English where needed. ### **Benchmarking & Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigorโ€”maintaining consistency and reliability across Spanish-language benchmarks before integration into official evaluations. * * * ## **Minimum Qualifications** - **Native-level fluency in Spanish (written), specific to United States, Spain, Chile, or Mexico usage, with strong reading/writing ability in English.** - Must be **native to the United States, Spain, Chile, or Mexico and have lived in or spent significant time in-country**, with deep cultural and linguistic familiarity. - BS or BA from a reputable institution (completed or in progress). - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. - Based in the United States, Spain, Chile, or Mexico (or able to reliably produce country-specific, culturally accurate Spanish aligned with one of these regions). * * * ## **Preferred Qualifications** - Experience in teaching, research, editing, or academic writing. - Experience creating evaluation criteria, rubrics, or grading guidelines. - Familiarity with LLMs, prompting, or model evaluation (helpful but not required). * * * ## **Application & Onboarding Process** - Complete an AI-led interview (about 15 minutes). - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least 20 hours per week. - Expect a commitment of approximately **2โ€“4 months**. - Youโ€™ll be working in a structured project environment with clear goals and tools. - We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **native German speakers from Austria, Switzerland, or Germany** with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author German/English promptโ€“golden answer pairs that train and evaluate advanced language models. **This role is strictly limited to candidates who are native to Austria, Switzerland, or Germany and have lived in or spent significant time in the country, with deep familiarity with local language usage, tone, and cultural context (including distinctions such as Austrian German, Swiss Standard German, and Germany-specific conventions).** This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded German text while maintaining technical precision in English. * * * ## **Job Details** ### **Multilingual Prompt Design & Optimization:** Create detailed prompts in German and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for German-speaking users in **Austria, Switzerland, and Germany** contexts. ### **Define and Document Evaluation Standards:** Establish high-level expectations for correct responses in Austria, Switzerland, and Germany consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions. ### **Model Testing and Grading (Bilingual):** Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in German, comparing results against English where needed. ### **Benchmarking & Quality Assurance:** Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigorโ€”maintaining consistency and reliability across German-language benchmarks before integration into official evaluations. * * * ## **Minimum Qualifications** - **Native-level fluency in German (written), specific to Austria, Switzerland, or Germany usage, with strong reading/writing ability in English.** - Must be **native to Austria, Switzerland, or Germany and have lived in or spent significant time in-country**, with deep cultural and linguistic familiarity. - BS or BA from a reputable institution (completed or in progress). - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. - Based in Austria, Switzerland, or Germany (or able to reliably produce country-specific, culturally accurate German aligned with one of these regions). * * * ## **Preferred Qualifications** - Experience in teaching, research, editing, or academic writing. - Experience creating evaluation criteria, rubrics, or grading guidelines. - Familiarity with LLMs, prompting, or model evaluation (helpful but not required). * * * ## **Application & Onboarding Process** - Complete an AI-led interview (about 15 minutes). - If approved, complete a paid assessment focused on writing and rubric creation - Then, if selected, you will be invited to work on the project. * * * ## **More Details About This Role** - Expect to contribute at least 20 hours per week. - Expect a commitment of approximately **2โ€“4 months**. - Youโ€™ll be working in a structured project environment with clear goals and tools. - We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking a **Cantonese Audio Generalist Evaluator Expert** to contribute to a high-impact audio AI research project with a leading research lab. In this role, you will work on transcription, annotation, and evaluation tasks that help train and benchmark advanced language models. This is a short-term, structured engagement ideal for candidates with strong academic or analytical backgrounds who are fluent in **Cantonese and English** and enjoy translating complex audio and visual information into precise, well-structured text. * * * ## **Job Responsibilities** ### **Transcribe and Optimize Audio & Video** - Listen to, analyze, and transcribe audio and video content in **Cantonese**, following detailed constraints and instructions. - Produce high-quality written outputs, with supporting work in **English when required**. - Ensure clarity, accuracy, and strict adherence to formatting and stylistic guidelines. ### **Define and Document Evaluation Standards** - Establish clear expectations for correct and high-quality responses in general consumer audio contexts. - Develop detailed evaluation rubrics and grading guidelines in **Cantonese and English**. - Document standards to ensure consistency across reviewers and model evaluations. ### **Conduct Model Testing and Grading** - Run prompts through language models and assess generated outputs. - Evaluate responses against predefined criteria for **accuracy, completeness, and instructional clarity**. ### **Support Benchmarking and Quality Assurance** - Participate in QA and review cycles to ensure tasks, rubrics, and outputs meet the platformโ€™s quality bar. - Maintain consistency and reliability before datasets are integrated into official benchmarks. - Collaborate with project leads to resolve ambiguities and improve task design. * * * ## **Minimum Qualifications** - Strong writing, editing, and critical thinking skills. - Ability to work independently, manage time effectively, and meet deadlines. - Fluency in **Cantonese and English** (spoken and written). - Available to commit **10โ€“20 hours per week**. * * * ## **Preferred Qualifications** - College students or recent graduates. - Background in linguistics, humanities, social sciences, or technical fields. - Prior experience with transcription, annotation, evaluation, or research workflows. - Interest in AI, language models, or applied research environments. * * * ## **Application & Onboarding Process** - Complete a short AI-led interview (approximately 15 minutes). - If selected, you will be onboarded and invited to begin project work. * * * ## **Additional Role Details** - Work in a structured, goal-oriented project environment with clear tooling, guidelines, and support. - Gain hands-on exposure to real-world AI research and evaluation workflows.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking a **Korean Audio Generalist Evaluator Expert** to contribute to a high-impact audio AI research project with a leading research lab. In this role, you will work on **transcription, annotation, and evaluation tasks** that help train and benchmark advanced language models. This is a **short-term, structured engagement** ideal for candidates with strong academic or analytical backgrounds who are fluent in **Korean and English** and enjoy translating complex audio and visual information into precise, well-structured text. * * * ## **Job Responsibilities** ### **Transcribe and Optimize Audio & Video** - Listen to, analyze, and transcribe audio and video content in **Korean**, following detailed constraints and instructions. - Produce high-quality written outputs, with supporting work in **English when required**. - Ensure clarity, accuracy, and strict adherence to formatting and stylistic guidelines. ### **Define and Document Evaluation Standards** - Establish clear expectations for correct and high-quality responses in general consumer audio contexts. - Develop detailed **evaluation rubrics and grading guidelines** in Korean and English. - Document standards to ensure consistency across reviewers and model evaluations. ### **Conduct Model Testing and Grading** - Run prompts through language models and assess generated outputs. - Evaluate responses against predefined criteria for **accuracy, completeness, and instructional clarity**. ### **Support Benchmarking and Quality Assurance** - Participate in QA and review cycles to ensure tasks, rubrics, and outputs meet **the platformโ€™s quality bar**. - Maintain consistency and reliability before datasets are integrated into official benchmarks. - Collaborate with project leads to resolve ambiguities and improve task design. * * * ## **Minimum Qualifications** - Strong writing, editing, and critical thinking skills. - Ability to work independently, manage time effectively, and meet deadlines. - **Fluency in Korean and English** (spoken and written). - Available to commit **10โ€“20 hours per week**. * * * ## **Preferred Qualifications** - College students or recent graduates. - Background in linguistics, humanities, social sciences, or technical fields. - Prior experience with **transcription, annotation, evaluation, or research workflows**. - Interest in AI, language models, or applied research environments. * * * ## **Application & Onboarding Process** - Complete a short AI-led interview (approximately **15 minutes**). - If selected, you will be onboarded and invited to begin project work. * * * ## **Additional Role Details** - Work in a **structured, goal-oriented project environment** with clear tooling, guidelines, and support. - Gain hands-on exposure to **real-world AI research and evaluation workflows**.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking a **Tagalog Audio Generalist Evaluator Expert** to contribute to a high-impact audio AI research project with a leading research lab. In this role, you will work on transcription, annotation, and evaluation tasks that help train and benchmark advanced language models. This is a **short-term, structured engagement** ideal for candidates with strong academic or analytical backgrounds who are fluent in **Tagalog and English** and enjoy translating complex audio and visual information into precise, well-structured text. ### Job Responsibilities Transcribe and Optimize Audio & Video - Listen to, analyze, and transcribe audio and video content in **Tagalog**, following detailed constraints and instructions. - Produce high-quality written outputs with supporting work in **English** when required. - Ensure clarity, accuracy, and adherence to formatting and stylistic guidelines. Define and Document Evaluation Standards - Establish clear expectations for correct and high-quality responses in general consumer audio contexts. - Develop detailed evaluation rubrics and grading guidelines in **Tagalog and English**. - Document standards to ensure consistency across reviewers and model evaluations. Conduct Model Testing and Grading - Run prompts through language models and assess generated outputs. - Evaluate responses against predefined criteria for accuracy, completeness, and instructional clarity. Support Benchmarking and Quality Assurance - Participate in QA and review cycles to ensure tasks, rubrics, and outputs meet the platformโ€™s quality bar. - Maintain consistency and reliability before datasets are integrated into official benchmarks. - Collaborate with project leads to resolve ambiguities and improve task design. * * * ### Minimum Qualifications - Strong writing, editing, and critical thinking skills. - Ability to work independently, manage time effectively, and meet deadlines. - **Fluency in Tagalog and English** (spoken and written). * * * ### Preferred Qualifications - College students or recent graduates. - Background in linguistics, humanities, social sciences, or technical fields. - Prior experience with transcription, annotation, evaluation, or research workflows. - Interest in AI, language models, or applied research environments. * * * ### Application & Onboarding Process - Complete a short **AI-led interview** (approximately 15 minutes). - If selected, you will be onboarded and invited to begin project work. * * * ### Additional Role Details - You will work in a structured, goal-oriented project environment with clear tooling, guidelines, and support. - This role provides hands-on exposure to real-world AI research and evaluation workflows.

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking a **Thai Audio Generalist Evaluator Expert** to contribute to a high-impact audio AI research project with a leading research lab. In this role, you will work on transcription, annotation, and evaluation tasks that help train and benchmark advanced language models. This is a **short-term, structured engagement** ideal for candidates with strong academic or analytical backgrounds who are fluent in **Thai and English** and enjoy translating complex audio and visual information into precise, well-structured text. * * * ### Job Responsibilities Transcribe and Optimize Audio & Video - Listen to, analyze, and transcribe audio and video content in **Thai**, following detailed constraints and instructions. - Produce high-quality written outputs with supporting work in **English** when required. - Ensure clarity, accuracy, and adherence to formatting and stylistic guidelines. Define and Document Evaluation Standards - Establish clear expectations for correct and high-quality responses in general consumer audio contexts. - Develop detailed evaluation rubrics and grading guidelines in **Thai and English**. - Document standards to ensure consistency across reviewers and model evaluations. Conduct Model Testing and Grading - Run prompts through language models and assess generated outputs. - Evaluate responses against predefined criteria for accuracy, completeness, and instructional clarity. Support Benchmarking and Quality Assurance - Participate in QA and review cycles to ensure tasks, rubrics, and outputs meet the platformโ€™s quality bar. - Maintain consistency and reliability before datasets are integrated into official benchmarks. - Collaborate with project leads to resolve ambiguities and improve task design. * * * ### Minimum Qualifications - Strong writing, editing, and critical thinking skills. - Ability to work independently, manage time effectively, and meet deadlines. - **Fluency in Thai and English** (spoken and written). - Available to commit **10โ€“20 hours per week**. * * * ### Preferred Qualifications - College students or recent graduates. - Background in linguistics, humanities, social sciences, or technical fields. - Prior experience with transcription, annotation, evaluation, or research workflows. - Interest in AI, language models, or applied research environments. * * * ### Application & Onboarding Process - Complete a short **AI-led interview** (approximately 15 minutes). - If selected, you will be onboarded and invited to begin project work. * * * ### Additional Role Details - You will work in a structured, goal-oriented project environment with clear tooling, guidelines, and support. - This role provides hands-on exposure to real-world AI research and evaluation workflows.

๐ŸŒ Remote4/2/2026
Apply โ†’

**the platform is hiring SOC Investigation Specialist** on behalf of high-growth technology and enterprise partners building next-generation SOC automation and AI-driven investigation systems. This role is ideal for experienced SOC analysts who can apply real-world investigative judgment to review, validate, and construct high-quality security investigations across SIEM, endpoint, cloud, and identity environments. * * * ### Responsibilities - Review, monitor, and evaluate SOC alerts and investigation outputs based on predefined scenarios and criteria. - Distinguish true positives from false positives by validating investigative evidence and alert context. - Perform end-to-end security investigations when required, including log analysis, entity pivoting, timeline reconstruction, and evidence correlation. - Assess the correctness, completeness, and quality of SOC investigations produced by automated or human workflows. - Apply consistent investigative judgment while recognizing that multiple valid investigation paths may exist for the same alert. - Make clear binary determinations (e.g., ACCEPT / PASS) while also producing detailed ground-truth investigations when required. - Use Splunk extensively to pivot across logs, entities, and timelines, including reading and reasoning about SPL queries. - Maintain clear and accurate documentation of investigative steps, assumptions, evidence, and conclusions. - Collaborate with program leads and other expert annotators to uphold high-quality investigation and annotation standards. - Mentor or support other analysts where applicable, particularly in long-term or lead annotator roles. * * * ### Requirements - 3+ years of hands-on experience as a SOC analyst in a production SOC environment (Tier 2 or above strongly preferred). - Strong understanding of alert triage, incident investigation workflows, and evidence-based decision-making under time constraints. - Mandatory hands-on experience with **Splunk**, including: - Conducting investigations using Splunk - Reading, understanding, and reasoning about SPL queries - Pivoting between logs, entities, and timelines - Proven ability to evaluate SOC investigations and determine whether conclusions are valid, incomplete, or incorrect. - Strong investigative judgment and comfort making decisive evaluations. - Fluent English (written and spoken) with strong documentation and communication skills. * * * ### Nice to Have - Experience with Endpoint Detection & Response (EDR) tools such as CrowdStrike Falcon, Microsoft Defender for Endpoint, or SentinelOne. - Experience analyzing cloud security logs and signals: - AWS (CloudTrail, GuardDuty) - Azure (Activity Log, Defender for Cloud) - GCP (Cloud Audit Logs) - Familiarity with Identity & Access Management platforms such as Okta Identity Cloud or Microsoft Entra ID (Azure AD). - Experience with email security tools like Proofpoint or Mimecast. - SOC leadership or mentoring experience. - Basic scripting experience (Python or similar). - Security certifications (optional): GCIA, GCIH, GCED, Splunk certifications, Security+, CCNA, or cloud security certifications. * * * ### Why Join - Work on cutting-edge SOC automation and AI-driven investigation systems. - Apply real-world SOC expertise to shape how future security teams investigate and respond to threats. - Take ownership of high-impact investigative evaluations and ground-truth security cases. - Collaborate with experienced SOC practitioners, security engineers, and AI teams. - Join the platformโ€™s global network of vetted security professionals.

๐ŸŒ Remote4/2/2026
Apply โ†’
$40 - $63 per hour

**Rust SWEs (Junior Onwards)** Your work will focus on configuring development environments, resolving dependency issues, and ensuring tests pass across various codebases. # Job We are hiring software engineers to assist a leading AI research lab with environment setup and dependency management for open-source projects in Rust. ## You bring - Expertise in Rust, ideally at least 2 years of industry experience. - Familiarity with setting up development environments. - High attention to detail as well as exceptional written and verbal communication skills. - (Strongly ## Bonus - ) Strong industry experience. # More details

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking **detail-oriented transcribing and writing experts** to contribute to a high-impact **audio AI research project** with a leading lab. Freelancers will listen to and transcribe audio, annotate images, and evaluate videos to help train advanced language models. This is a **short-term, flexible opportunity** for professionals with strong academic backgrounds, **fluent in both Dutch and English**, and with a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. ## Job Details ### Transcribe and Optimise Audio/Video - Create detailed audio and video transcriptions with multiple constraints and instructions in **Dutch**, with supporting work in **English** when required. ### Define and Document Evaluation Standards - Establish high-level expectations for correct responses in general consumer audio contexts. - Develop comprehensive evaluation rubrics and guidelines in Dutch and English. ### Conduct Model Testing and Grading - Run prompts through models and assess preliminary outputs against defined expectations. ### Support Benchmarking and Quality Assurance - Collaborate in QA review processes to ensure prompt tasks and rubrics meet high standards of rigour. - Maintain consistency and reliability before integration into official benchmarks. * * * ## Minimum Qualifications - Strong writing and critical thinking skills - Ability to work independently and meet deadlines - **Fluent in Dutch and English** - **Able to work within GMT or PST working hours** * * * ## Preferred Qualifications - College students or graduates * * * ## Application & Onboarding Process - Complete an AI-led interview (approximately 15 minutes) - If selected, you will be invited to work on the project * * * ## More Details About This Role - Expected commitment of **3โ€“6 months** - Youโ€™ll work in a structured project environment with clear goals and tools

๐ŸŒ Remote4/2/2026
Apply โ†’

**About the Role** the platform is seeking experienced **Sales Representatives, Wholesale and Manufacturing, Except Technical and Scientific Products** to support a leading AI lab in advancing research and infrastructure for next-generation machine learning systems. This engagement focuses on diagnosing and solving real issues in your domain. It's an opportunity to contribute your expertise to cutting-edge AI research while working independently and remotely on your own schedule. **Key Responsibilities** - Youโ€™ll be asked to create tasks and deliverables regarding common requests within your professional domain **Ideal Qualifications** - 4+ years professional experience in your respective field - Excellent written communication with strong grammar and spelling skills **More About the Opportunity** - Expected workload: ~15 hours per week, with flexibility to scale up to 30+ hours - Project start date: immediately, lasting for around 3-4 weeks We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request. **Earn $200 by referring** Share the referral link below, and earn $200 for each successful referral through this unique link. There's no limit on how many people you can refer. Restrictions may apply. [Learn more](https://talent.docs.the platform.com/policies/referrals) We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/2/2026
Apply โ†’

Lawyers

legal
$90 - $150 per hour

**About the Role** the platform is seeking experienced **Lawyers** to support a leading AI lab in advancing research and infrastructure for next-generation machine learning systems. This engagement focuses on diagnosing and solving real issues in your domain. It's an opportunity to contribute your expertise to cutting-edge AI research while working independently and remotely on your own schedule. **Key Responsibilities** - Youโ€™ll be asked to create tasks and deliverables regarding common requests within your professional domain **Ideal Qualifications** - 4+ years professional experience in your respective field - Excellent written communication with strong grammar and spelling skills **More About the Opportunity** - Expected workload: ~15 hours per week, with flexibility to scale up to 30+ hours - Project start date: immediately, lasting for around 3-4 weeks We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request. **Earn $200 by referring** Share the referral link below, and earn $200 for each successful referral through this unique link. There's no limit on how many people you can refer. Restrictions may apply. [Learn more](https://talent.docs.the platform.com/policies/referrals) We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/2/2026
Apply โ†’

## **About the Role** the platform is seeking experienced **First-Line Supervisors of Retail Sales Workers** to support a leading AI lab in advancing research and infrastructure for next-generation machine learning systems. This engagement focuses on diagnosing and solving real issues in your domain. It's an opportunity to contribute your expertise to cutting-edge AI research while working independently and remotely on your own schedule. ## **Key Responsibilities** - Youโ€™ll be asked to create deliverables regarding common requests within your professional domain - Youโ€™ll be asked to review peer developed deliverables to improve AI research ## **Ideal Qualifications** - 4+ years professional experience in your respective field - Excellent written communication with strong grammar and spelling skills ## **More About the Opportunity** - Expected workload: ~30 hours per week, with flexibility to scale up to 40 hours - Project start date: immediately, lasting for around 3-4 weeks We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request. ## **Earn $200 by referring** Share the referral link below, and earn $200 for each successful referral through this unique link. There's no limit on how many people you can refer. Restrictions may apply. [Learn more](https://talent.docs.the platform.com/policies/referrals)

๐ŸŒ Remote4/2/2026
Apply โ†’
$65 - $75 per hour

## **About the Role** the platform is seeking experienced **concierge professionals** to support a leading AI lab in advancing research and infrastructure for next-generation machine learning systems. This engagement focuses on diagnosing and solving real issues in your domain. It's an opportunity to contribute your expertise to cutting-edge AI research while working independently and remotely on your own schedule. ## **Key Responsibilities** - Youโ€™ll be asked to create deliverables regarding common requests regarding your professional domain - Youโ€™ll be asked to review peer developed deliverables to improve AI research ## **Ideal Qualifications** - 4+ years professional experience in your respective domain - Excellent written communication with strong grammar and spelling skills ## **More About the Opportunity** - Expected workload: 30 hours per week, with flexibility to scale up to 40 hours - Project start date: immediately, lasting for around 3-4 weeks We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request. ## **Earn $250 by referring** Share the referral link below, and earn $250 for each successful referral through this unique link. There's no limit on how many people you can refer. Restrictions may apply. [Learn more](https://talent.docs.the platform.com/policies/referrals)

๐ŸŒ Remote4/2/2026
Apply โ†’

**About the Role** the platform is taking applications for **Family Medicine / Primary Care Physicians (PCPs)** on behalf of a healthcare AI partner building advanced clinical decision-support tools. We hire multiple experts on this role every few weeks. In this role you you will leverage your clinical expertise to **review, annotate, and validate medical data**, contributing directly to the development of **safe, accurate, and explainable medical AI systems**. This is an in person position based in San Francisco ### **Key Responsibilities** - **Clinical Data Annotation:** Review and label clinical text, EHR data, and case notes for use in AI model training. Identify and validate medical entities, diagnoses, treatment pathways, and outcomes relevant to family medicine. - **Quality Review & Validation:** Audit annotated datasets for clinical accuracy and consistency. Cross-check outputs generated by AI models to ensure medical soundness. - **Knowledge Contribution:** Provide expert input on guidelines for annotation, taxonomy development, and edge case definitions. Collaborate with data scientists and engineers to improve AI understanding of medical context. - **Model Evaluation & Feedback:** Evaluate AI-generated recommendations or clinical summaries, flag inaccuracies, and provide structured feedback for iterative model refinement. - **Documentation & Training Support:** Contribute to the creation of clinical documentation standards and assist in developing onboarding materials for new annotators. ### **Requirements** - MD or DO degree with specialization in **Family Medicine** or **Internal Medicine**. - **Board-certified or board-eligible** in Family Medicine or Internal Medicine. - Active **medical license** in good standing. - Academic hospital experience preferred - **2+ years of clinical experience** in in-patient or hospitalist care settings - This a talent network application where we store your details for multiple upcoming project * * *

๐ŸŒ Remote4/2/2026
Apply โ†’
$100 per hour

the platform is recruiting U.S./UK/Canada/Europe-based SWEs for a model-training project with a leading foundational model AI lab. You are a good fit if you: - Have experience working at top US tech firms - Proven track record of building and maintaining complex, production-grade Python systems โ€” not just scripts or notebooks, but full-featured services, tools, or frameworks used in real-world environments. - Deep understanding of Python language fundamentals, including advanced features like decorators, generators, async/await, context managers, and performance tuning (e.g., profiling, memory optimization). - Experience designing modular, testable codebases, using modern Python tooling and best practices (e.g., FastAPI, Pydantic, type hints, dependency injection, unit/integration testing frameworks). Interview Process: - **The vetting process involves a technical interview conducted by a human; you will not be allowed to use an AI IDE (Integrated Development Environment) but you will be allowed to use LLMs or Stack Overflow** Here are more details about the role: - You must be able to commit **around 20 hours per week** for this role - This contract is expected to last at least **1 month** - Successful contributions increase the odds that you are selected on future projects with the platform - The vetting process involves a 90 minute human interview centered on Python - you will hear back within two weeks With respect to pay and legal status: - **This role will pay $100/h** based on experience

๐ŸŒ Remote4/2/2026
Apply โ†’

the platform is seeking an exceptional open source contributor with deep expertise in Python, Java, C, JavaScript, or TypeScript to collaborate on high-impact projects with global reach. This role is ideal for engineers with a strong command of core programming fundamentals and a proven track record of consistent, high-quality contributions to leading open-source repositories. **Key Responsibilities:** - Design and oversee the creation of evaluations for a wide range of coding tasks across multiple languages, including JavaScript, TypeScript, Python, Java, and C. - Develop test cases to accurately assess system performance in diverse engineering scenarios. - Analyze system behavior on real-world user use cases to uncover strengths and improvement areas. - Communicate evaluation results effectively to the research team to support continued development and optimization. **You're a great fit if you have:** - A strong GitHub (or similar) presence with frequent, high-quality contributions to top open-source projects in the last 12 months. - Expertise in one or more of the following languages: Python, Java, C, JavaScript, or TypeScript. - Deep familiarity with widely-used libraries, frameworks, and tools in your language(s) of choice. - Excellent understanding of software architecture, performance tuning, and scalable code patterns. - Strong collaboration skills and experience working within distributed, asynchronous teams. - Confidence in independently identifying areas for contribution and executing improvements with minimal oversight. - Comfortable using Git, CI/CD systems, and participating in open-source governance workflows.

๐ŸŒ Remote4/2/2026
Apply โ†’

## Day-to-day - Edit and proofread **Arabic-language** content for grammar, spelling, punctuation, syntax, clarity, and overall readability. - Improve structure, logical flow, and readability without changing the writerโ€™s intended meaning. - Keep tone, style, terminology, and formatting consistent across documents. - Identify ambiguities, inconsistencies, factual issues, weak reasoning, unclear instructions, and other editorial risks in source material. - Enforce **Arabic style guidance such as **Modern Standard Arabic** conventions, Arabic editorial standards, or equivalent** and maintain a high editorial bar across large volumes of written work tied to AI training initiatives. ## You bring - **3โ€“5+ years** of proven professional **copy editing** experience. - Native-level written **Arabic** and exceptional grammatical command. - A demonstrated record of high-precision editorial work with minimal error rates. - Extremely strong attention to detail and the ability to catch subtle inconsistencies. - Comfort following instructions in a professional written workflow, working independently, and maintaining consistent quality over time. ## Bonus - Minimum commitment is **20 hours per week**, with ongoing and consistent engagement expected. - The work centers on reviewing and refining **high-volume written material**. - Application steps include submitting a resume that highlights relevant editorial work, completing the required interview and assessment, and then moving through follow-up on next steps within a few days.

๐ŸŒ Remote4/2/2026
Apply โ†’

## **1\. Role Overview** the platform is seeking experienced **Portuguese Copy Editors** to support high-quality written content across a range of AI training initiatives. This role is ideal for detail-oriented language professionals who can ensure clarity, consistency, grammatical precision, and stylistic excellence across complex Portuguese written materials. We are looking for people who demonstrate exceptional command of Portuguese, strong editorial judgment, and a meticulous eye for detail. This role requires consistent engagement and reliability over time. ## **2\. Key Responsibilities** - Edit and proofread Portuguese-language content for grammar, spelling, punctuation, syntax, and clarity - Ensure consistency in tone, style, formatting, and terminology across documents - Improve structure, readability, and logical flow without altering intended meaning - Identify ambiguities, inconsistencies, and factual or structural issues - **Enforce Portuguese style guide adherence and maintain high editorial standards** - Flag unclear instructions, weak reasoning, or editorial risks in source material - Provide thoughtful feedback to improve overall content quality ## **3\. Ideal Qualifications** - **3-5+ years of proven professional experience** in copy editing - Exceptional written and grammatical command of Portuguese (native proficiency required) - Demonstrated history of high-precision editorial work with minimal error rates - Extremely high attention to detail and ability to catch subtle inconsistencies - Strong familiarity with following instructions in a professional written context (**Acordo Ortogrรกfico standards, Portuguese editorial standards, or equivalent**) - Ability to work independently while maintaining consistent quality standards - Reliable availability and consistent engagement ## **4\. More About the Opportunity** - Minimum commitment: **20 hours per week** - Ongoing, consistent engagement expected - Work involves reviewing and refining high-volume written material ## **5\. Application Process** - Submit your resume highlighting relevant editorial experience - Complete the attached the required interview and assessment - We aim to follow up within a few days with next steps ## **6\. About the platform** the platform is a talent marketplace that connects top experts with leading AI labs and research organizations. Our investors include Benchmark, General Catalyst, Adam D'Angelo, and Jack Dorsey. Thousands of professionals across law, research, engineering, design, and creative fields contribute to the platform projects shaping the next generation of AI.

๐ŸŒ Remote4/2/2026
Apply โ†’

## **1\. Role Overview** the platform is seeking experienced **Russian Copy Editors** to support high-quality written content across a range of AI training initiatives. This role is ideal for detail-oriented language professionals who can ensure clarity, consistency, grammatical precision, and stylistic excellence across complex Russian written materials. We are looking for people who demonstrate exceptional command of Russian, strong editorial judgment, and a meticulous eye for detail. This role requires consistent engagement and reliability over time. ## **2\. Key Responsibilities** - Edit and proofread Russian-language content for grammar, spelling, punctuation, syntax, and clarity - Ensure consistency in tone, style, formatting, and terminology across documents - Improve structure, readability, and logical flow without altering intended meaning - Identify ambiguities, inconsistencies, and factual or structural issues - **Enforce Russian style guide adherence and maintain high editorial standards** - Flag unclear instructions, weak reasoning, or editorial risks in source material - Provide thoughtful feedback to improve overall content quality ## **3\. Ideal Qualifications** - **3-5+ years of proven professional experience** in copy editing - Exceptional written and grammatical command of Russian (native proficiency required) - Demonstrated history of high-precision editorial work with minimal error rates - Extremely high attention to detail and ability to catch subtle inconsistencies - Strong familiarity with following instructions in a professional written context (**GOST, Russian editorial standards, or equivalent**) - Ability to work independently while maintaining consistent quality standards - Reliable availability and consistent engagement ## **4\. More About the Opportunity** - Minimum commitment: **20 hours per week** - Ongoing, consistent engagement expected - Work involves reviewing and refining high-volume written material ## **5\. Application Process** - Submit your resume highlighting relevant editorial experience - Complete the attached the required interview and assessment - We aim to follow up within a few days with next steps ## **6\. About the platform** the platform is a talent marketplace that connects top experts with leading AI labs and research organizations. Our investors include Benchmark, General Catalyst, Adam D'Angelo, and Jack Dorsey. Thousands of professionals across law, research, engineering, design, and creative fields contribute to the platform projects shaping the next generation of AI.

๐ŸŒ Remote4/2/2026
Apply โ†’

## About the Role the platform is hiring **Building Code & Permitting Specialists** on behalf of a partner developing AI systems to streamline construction permitting, regulatory compliance, and plan review workflows. In this role, you will apply your expertise in building codes, jurisdictional requirements, and permitting processes to review, annotate, and validate construction-related data, helping train AI models to accurately interpret and navigate regulatory frameworks. This role bridges real-world permitting and compliance workflows with applied AI, ensuring that jurisdiction-specific rules, edge cases, and review standards are correctly represented in model training. ## Key Responsibilities ### Regulatory Data Annotation - Review and label construction documents, permit applications, site plans, and inspection reports - Identify and classify code requirements, jurisdictional rules, and compliance elements across different regions ### Quality Review & Validation - Audit annotated datasets for accuracy and alignment with local, state, and national building codes - Evaluate AI-generated outputs for correctness in permit requirements, zoning constraints, and code compliance ### Knowledge Contribution - Define annotation guidelines for interpreting building codes and jurisdictional differences - Provide expertise on edge cases such as mixed-use zoning, variances, and complex permitting scenarios ### Model Evaluation & Feedback - Review AI-generated permit reviews, compliance summaries, and recommendations - Provide structured feedback to improve regulatory reasoning and accuracy across jurisdictions ### Documentation & Training Support - Contribute to standards for construction permitting annotation and onboarding materials ## Requirements We are seeking candidates with strong familiarity with building codes, permitting workflows, and jurisdictional requirements. Ideal backgrounds include: - **Permit Expeditors**: Experience coordinating with city agencies, submitting permits, and managing inspection processes - **Architects**: Strong knowledge of building codes, design compliance, and plan preparation - **Government Reviewers / Plan Examiners**: Experience reviewing and approving permits within municipal or state agencies - We are initially seeking **experts** with deep expertise in one of the following jurisdictions: - California - Texas - Florida

๐ŸŒ Remote4/4/2026
Apply โ†’

**Google Workspace & Business Profile Owners** Participant Qualifications To ensure the integrity and quality of our insights, The team needs contributors who meet the following professional criteria: Core Requirement: Profile Ownership Applicants must be active owners or administrators of a verified the team Business Profile (GBP) The team is looking for individuals who engage with the platform regularly to manage their online presence, respond to inquiries, or update business information ## Day-to-day - Technical Expertise: No specialized technical background is required beyond a functional understanding of managing your own business listing - Account Maturity: To provide the most valuable data, we prefer accounts with at least one year of active history. However, we welcome applications from newer business owners who demonstrate consistent profile engagement - Hospitality & Lodging: Hotels, B&Bs, and specialized accommodations - Food & Beverage: Restaurants, cafes, and catering services - Retail & Shopping: Boutique storefronts and specialized commerce - Professional Services: Trade services, consulting, and consumer-facing agencies. _Note: While we strive for a broad distribution across these verticals, there is no strict quota per industry. All eligible business owners are encouraged to apply._

๐ŸŒ Remote4/4/2026
Apply โ†’
$90 - $150 per hour

## **About the Role** the platform is seeking experienced **Compliance Officer** to support a leading AI lab in advancing research and infrastructure for next-generation machine learning systems. This engagement focuses on diagnosing and solving real issues in your domain. It's an opportunity to contribute your expertise to cutting-edge AI research while working independently and remotely on your own schedule. ## **Key Responsibilities** - Youโ€™ll be asked to create tasks and deliverables regarding common requests within your professional domain ## **Ideal Qualifications** - 4+ years professional experience in your respective field - Excellent written communication with strong grammar and spelling skills ## **More About the Opportunity** - Expected workload: ~15 hours per week, with flexibility to scale up to 30+ hours - Project start date: immediately, lasting for around 3-4 weeks We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request. ## **Earn $200 by referring** Share the referral link below, and earn $200 for each successful referral through this unique link. There's no limit on how many people you can refer. Restrictions may apply. [Learn more](https://talent.docs.the platform.com/policies/referrals) We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/4/2026
Apply โ†’
$500 - $1,000 one-time

**This listing will be archived. Please apply to the Accounting & Audit Expert role instead.** ## Day-to-day - Evaluate AI-generated accounting work for **accuracy, completeness, and compliance** with **GAAP, IFRS, and applicable regulatory frameworks**. - Calibrate AI decision processes used for **journal entries, reconciliations, and variance analysis**. - Simulate and audit workflows tied to **financial close, consolidations, tax filings, and internal controls**. - Provide accounting feedback on model-generated **financial statements, disclosures, audit working papers, and management reports**. - Work asynchronously with product and AI teams to improve systems using real-world accounting practice. ## You bring - **4+ years** of professional accounting experience, ideally in **Big Four firms (Deloitte, PwC, EY, KPMG)**, **Fortune 500** finance or accounting teams, or specialized settings such as **government, nonprofit/fund, forensic, or international/cross-border accounting**. - **Bachelorโ€™s degree** in **Accounting, Finance, or a related quantitative field**. - Strong technical accounting knowledge across **GAAP, IFRS, and related reporting or regulatory frameworks**. - Expertise in core processes such as **financial reporting, budgeting, tax compliance, month-end close, reconciliations, revenue recognition, lease accounting, consolidations, and internal controls**. - Excellent analytical, documentation, and communication skills. ## Bonus - **Masterโ€™s in Accounting** or **MBA with an accounting concentration**. - Credentials such as **CPA, CA, CMA, or ACCA**. - Experience in specialized areas like **fraud investigations, public sector compliance, donor and grant reporting, transfer pricing, foreign exchange, or GAAP vs. IFRS**. ## Pay - **Immediate start** - **Duration:** about **2 weeks**, with potential for expansion - **Commitment:** about **15 hours per week** - **Task completion pay:** approximately **$500 - $1000 per completed task**, subject to change as the project evolves - **Performance bonus:** top performers receive a **weekly bonus incentive** on top of the per-task rate **Application process:** submit a **resume and application form**, complete a short **15-minute AI interview**, and then complete a **paid 6-hour work trial** consisting of onboarding and an initial submission to evaluate your ability to interpret and apply the project guidelines. This assignment is a fit for accountants who can translate messy real-world finance workflows into structured feedback that is precise enough for model calibration and practical enough for non-engineering stakeholders to use.

๐ŸŒ Remote4/4/2026
Apply โ†’

**Role Overview** the platform is collaborating with a leading AI lab to contract experienced business and corporate writers for an AI training data project. Contractors will write natural, high-quality prompts and responses across a range of professional writing tasks common in corporate environments โ€” strategy memos, stakeholder updates, performance reviews, polished emails, and more. The goal is to help AI models produce business writing that is sharp, concise, and well-calibrated to the conventions of corporate communication. All work must be entirely human-written; LLM-generated content is strictly prohibited. * * * **Key Responsibilities** - Write realistic prompts that reflect how professionals request written content in corporate and business settings, including: - Memos & strategy docs - Proposals - Stakeholder updates - Performance reviews and feedback - Emails - Messages - Notes-to-docs (e.g., converting meeting notes into a polished document) - Domain explainers (e.g., how to build a DCF model, how to read a P&L) - Tutorials and how-to guides - Craft polished, high-quality responses to those prompts that demonstrate clear, concise writing with appropriate tone, structure, accuracy, and audience-awareness - Adapt voice and style to match the context โ€” an internal Slack message should read differently than a board-level strategy memo or a client proposal * * * **Ideal Qualifications** - **Professional experience in finance, consulting, retail, or a closely related corporate domain where clear written communication is central to the role** - A background that requires consistent, high-volume writing โ€” strategy decks, client deliverables, internal communications, analyst reports, or similar - Sharp, concise writing instincts โ€” comfort distilling complex information into clear, well-structured documents - Range across corporate formats: comfort moving between a quick status update and a formal proposal - Strong attention to detail and a high personal bar for quality * * * **More About the Opportunity** - Expected commitment: ~15 hours/week - All writing will be in English **Application Process** - Submit your resume to begin - Complete the Domain Expert Interview - Complete the Training Quiz

๐ŸŒ Remote4/6/2026
Apply โ†’

the platform is seeking detail-oriented writing experts to add to a high-impact AI research project with a leading lab. Freelancers will use their AI input prompts to assess advanced language models. This is a short-term, flexible opportunity for professionals with strong professional backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### **Job Details: ** - **Architect and Refine Prompts **: Craft detailed prompts with multiple constraints and instructions. - **Run Model Trialing and Grading **: Run prompts through models and assess preliminary outputs against expectations, choosing and explaining model preferences. - **Aid Benchmarking and Quality Assurance **: Partner in QA inspect processes to confirm prompt tasks and rubrics meet rigor, upkeeping consistency and reliability before integration into official benchmarks. ### **Minimum Qualifications: ** - BS or BA from a reputable institution - Strong writing and critical thinking skills. - work capabilities independently and meet deadlines. - Notable exposure to ChatGPT or similar tools for professional questions. - Use of macOS and/or iOS on a day-to-day basis. - US or Canada based. ### ** - 3-6 years of track record in a professional domain: - Medicine - Law - Finance - Consulting - Accounting - Data Science - Coding - Engineering ### **Application & ramp-up Process: ** - Complete an AI-led interview, this should take around 15 minutes. - If selected, you will be invited to work on the project. ### **More Details About This Role: ** - Expect to add at least **20 hours per week**. - Expect a commitment starting 2-3 weeks, with strong potential to expand. - Youโ€™ll be working in a structured project en ## Extra credit - **Preferred Qualifications:

๐ŸŒ Remote4/6/2026
Apply โ†’

**Role Overview** We are seeking expert computer scientists to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core computer science domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of computer science expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Computer Science Domains Covered** Algorithms & Data Structures, Machine Learning & AI, Systems, Networks & Theory, Computer Security, Software Engineering, Miscellaneous Computer Science. **Key Responsibilities** - Author original computer science questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise intermediate steps in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD or doctoral candidate in Computer Science, Electrical Engineering, or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of graduate-level CS theory, algorithms, systems design, and/or machine learning - Research publications, industry experience at top tech companies, or competitive programming background is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’
$25 - $60 per hour

**Role Overview** We are seeking expert philosophers to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core philosophy domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of philosophy expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Philosophy Domains Covered** Ethics & Moral Philosophy, Formal Logic, World Religions & Philosophy of Religion, Epistemology & Metaphysics, Miscellaneous Philosophy. **Key Responsibilities** - Author original philosophy questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise reasoning in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD or doctoral candidate in Philosophy or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of philosophical argumentation, formal logic, and canonical texts across traditions - Research publications or teaching experience in philosophy is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’

**Role Overview** We are seeking experts in history and political science to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core history and political science domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **History & Political Science Domains Covered** World History, U.S. History, Prehistory & Archaeology, International Relations & Security Studies, Government & Politics, European History, U.S. Foreign Policy, Miscellaneous History & Political Science. **Key Responsibilities** - Author original history and political science questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise reasoning in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD or doctoral candidate in History, Political Science, International Relations, or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of historiographical methods, political theory, and comparative analysis - Research publications or policy experience is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’
$30 - $75 per hour

**Role Overview** We are seeking expert biologists to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core biology domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of biology expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Biology Domains Covered** Ecology and Evolutionary Biology, Genetics, Cell Biology, Biomedical Science, Microbiology, Molecular Biology, Biochemistry, Neuroscience, Miscellaneous Biology. **Key Responsibilities** - Author original biology questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise intermediate steps in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD or doctoral candidate in Biology, Molecular Biology, Biochemistry, Neuroscience, or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of graduate-level biological concepts, experimental design, and data interpretation - Research publications or laboratory experience in biological sciences is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’
$25 - $60 per hour

**Role Overview** We are seeking expert psychologists to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core psychology domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of psychology expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Psychology Domains Covered** Clinical & Professional Psychology, Social & Behavioral Psychology, Developmental Psychology, Miscellaneous Psychology. **Key Responsibilities** - Author original psychology questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise reasoning in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD, PsyD, or doctoral candidate in Psychology or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of graduate-level psychological theory, research methodology, and empirical literature - Clinical licensure or research publications in psychology is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’

**Role Overview** We are seeking expert medical and health science professionals to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core health and medicine domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of medical expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Health & Medicine Domains Covered** Clinical Specialties & Practice, Anatomy & Physiology, Nutrition & Dietetics, Public Health & Epidemiology, Medical Genetics, Virology & Infectious Disease, Human Aging & Gerontology, Immunology, Physical Therapy, Pharmacology, Pathology, Miscellaneous Health & Medicine. **Key Responsibilities** - Author original health and medicine questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise reasoning in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, clinical guidelines) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - MD, DO, PhD, or doctoral candidate in Medicine, Biomedical Sciences, Public Health, or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of graduate-level medical knowledge, clinical reasoning, and biomedical research methodology - Board certification, clinical experience, or research publications in health fields is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’

**Role Overview** We are seeking business and commerce experts to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core business domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of business expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Business & Commerce Domains Covered** Business Intelligence, Business Ethics & Management, Marketing & Public Relations, E-Commerce, Miscellaneous Business & Commerce. **Key Responsibilities** - Author original business questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise reasoning in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD, DBA, or doctoral candidate in Business Administration, Management, Marketing, or a closely related field - MBA or Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of graduate-level business strategy, organizational theory, and quantitative methods - Industry leadership experience or research publications in business fields is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’
$35 - $75 per hour

**Role Overview** We are seeking expert engineers to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core engineering domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of engineering expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Engineering Domains Covered** Mechanical & Thermal Engineering, Electrical Engineering, Aerospace Engineering, Materials Science & Engineering, Industrial & Systems Engineering, Civil Engineering, Miscellaneous Engineering. **Key Responsibilities** - Author original engineering questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise intermediate steps in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD or doctoral candidate in Engineering or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of graduate-level engineering principles, applied mathematics, and domain-specific standards - Professional engineering licensure (PE) or industry experience is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’
$35 - $75 per hour

**Role Overview** We are seeking expert chemists to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core chemistry domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of chemistry expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Chemistry Domains Covered** Physical Chemistry, Inorganic Chemistry, Analytical Chemistry, Organic Chemistry, Miscellaneous Chemistry. **Key Responsibilities** - Author original chemistry questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise intermediate steps in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD or doctoral candidate in Chemistry, Biochemistry, Chemical Engineering, or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of graduate-level chemistry concepts, reaction mechanisms, and quantitative analysis - Experience with rigorous academic problem design or chemistry olympiad writing is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’
$25 - $60 per hour

**Role Overview** We are seeking expert mathematicians to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core mathematics domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of mathematical expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Mathematics Domains Covered** Algebra (incl. Linear Algebra), Probability & Statistics, Analysis (incl. Calculus), Discrete Mathematics (incl. Combinatorics & Graph Theory), Number Theory, Geometry & Topology, ODE/PDE & Dynamical Systems, Optimization & Operations Research (incl. Game Theory), Computational & Numerical Mathematics, Logic, Set Theory & Foundations. **Key Responsibilities** - Author original math questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise intermediate steps in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD or doctoral candidate in Mathematics, Applied Mathematics, Statistics, or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of graduate-level mathematical concepts and formal proof writing - Experience with rigorous academic problem design or mathematical competition writing is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/7/2026
Apply โ†’
$40.56 per hour

**Generalist - English & Hebrew** Weโ€™re looking for bilingual Hebrewโ€“English speakers to evaluate AI-generated responses, check for accuracy, and provide clear written feedback to help improve model quality. Youโ€™ll play a direct role in shaping how AI systems communicate with millions of users. ## Day-to-day - Review AI-generated responses in Hebrew and English - Check responses for factual accuracy and clarity - Provide structured written feedback - Flag reasoning errors or misleading information - Follow clear evaluation guidelines to ensure consistency ## You bring - You hold a Bachelorโ€™s degree or are currently pursuing one. - Native or fluent Hebrew & English - Strong writing skills - Detail-oriented and thoughtful - Comfortable reviewing content across different topics - Experience using AI tools (helpful but not required)

๐ŸŒ Remote4/7/2026
Apply โ†’
$80 - $100 per hour

the platform is seeking skilled evaluators to support an essential evaluation workflow in partnership with a leading AI research lab. This project focuses on improving model performance in the **Legal domain** by evaluating agent-generated research reports related to legal analysis, regulatory updates, case law developments, and emerging legal trends. ### Who We're Hiring We are looking for evaluators with experience or an academic/professional background in **Law, Legal Research, Compliance, or related fields**. This workflow is ideal for individuals who: - Have experience in legal analysis, legal practice, compliance, or academic legal research. - Are familiar with case law, statutes, regulatory frameworks, and legal terminology. - Can critically evaluate the structure, reasoning, and accuracy of legal arguments. ### Key Responsibilities - Evaluate the quality, accuracy, and relevance of agent-generated **legal research reports**. - Assess legal reasoning, factual accuracy, and the applicability of cited laws or precedents. - Provide structured feedback using a provided rubric and include written justifications for your evaluations. - Ensure all reports demonstrate clarity, legal soundness, and adherence to the rubric. ### Youโ€™re a Strong Fit If You Have: - Experience in law, legal research, litigation support, compliance, policy analysis, or related fields. - Strong analytical skills to assess legal arguments, factual application, and the relevance of cited authorities. - Excellent written communication skills to provide precise, cogent, and actionable feedback. ### Role Details - Part-time (15โ€“30 hours/week) with flexible scheduling. - Competitive rates: $80โ€“$100/hour depending on expertise.

๐ŸŒ Remote4/7/2026
Apply โ†’
$90 - $110 per hour

**the platform is seeking legal contractors with clean energy transactional or project finance experience to support the development of AI-enabled workflows for clean energy transaction professionals.** These roles involve reviewing AI-generated outputs, validating their accuracy, and providing structured feedback to improve model performance. **Key Responsibilities:** - Review and validate AI-generated outputs related to clean energy transactions (e.g., tax equity, transferable tax credits). - Provide clear, structured feedback to improve accuracy, completeness, and usability of outputs. - Identify gaps, inconsistencies, or risks in generated content and suggest improvements. - Follow defined evaluation frameworks and documentation standards. - Collaborate asynchronously with the project team and meet weekly deliverables. **Youโ€™re a strong fit if you have:** - Experience in clean energy transactions (e.g., tax equity, transferable tax credits), **or** strong project finance experience. - Background in corporate, transactional, or finance-related legal work. - Familiarity with deal structures, financing documents, or regulatory considerations in energy or infrastructure. - Strong attention to detail and ability to critically assess complex outputs. - Ability to provide structured, actionable feedback. - Availability of ~10 hours per week, starting immediately. We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote4/7/2026
Apply โ†’
$50 - $100 per hour

**We are looking for an experienced Koreanโ€“English translator/interpreter to support real-time translation during upcoming sales and business meetings.** The ideal candidate will have prior experience facilitating bilingual communication in corporate or professional settings, particularly where business discussions, negotiations, or client presentations are involved. Youโ€™ll play a critical role in ensuring smooth, accurate, and culturally appropriate communication between English- and Korean-speaking participants. * * * ### **Responsibilities** - Provide simultaneous or consecutive translation during virtual or in-person business meetings. - Translate both verbal discussions and written follow-ups (chat messages, key points, or summary notes) where needed. - Ensure professional tone and cultural sensitivity in all translations. - Coordinate with internal stakeholders before the meeting to understand context, terminology, and objectives. - Maintain confidentiality and professionalism at all times. * * * ### **Requirements** - Proven experience as an interpreter or translator in corporate, sales, or client-facing environments. - Native-level fluency in Korean and English (both spoken and written). - Strong understanding of business terminology, particularly in sales, partnerships, or technology. - Excellent interpersonal and communication skills. - Ability to think quickly and convey ideas accurately under time constraints. * * * ### **Nice to Have** - Prior experience translating for startups, tech firms, or multinational companies. - Background in sales enablement, client success, or business development contexts. - Familiarity with meeting platforms like Zoom, Google Meet, or MS Teams.

๐ŸŒ Remote4/8/2026
Apply โ†’
$35 - $55 per hour

## **1\. Role Overview** the platform is seeking experienced **Bilingual STEM Experts** to support high-quality content across a range of AI training initiatives. This role involves creating image-based STEM training examples โ€” finding relevant images, writing prompts, and crafting ideal model responses โ€” in both English and your native language. We are looking for people who demonstrate strong scientific and technical reasoning, clear written communication in two languages, and a meticulous eye for accuracy. This role requires consistent engagement and reliability over time. **Languages we are hiring for:** Arabic, Portuguese, Spanish, Korean, French, Chinese, and Russian. ## **2\. Key Responsibilities** - Create high-quality training examples consisting of an image, a STEM-related prompt, and an ideal response - Solve problems in non-math STEM fields (physics, chemistry, biology, engineering, computer science, etc.) with clear, step-by-step explanations - Interpret and analyze charts, diagrams, graphs, infographics, and data visualizations with precision - Write prompts and responses that sound natural and fluent in your native language โ€” not translated from English - Ensure proper use of scientific notation, technical terminology, and formatting conventions - Identify and correct errors in logic, reasoning, or factual content within source material - Follow style guides, system prompts, and formatting requirements with precision - Source appropriate images (textbook pages, scientific diagrams, charts, lab results, data visualizations) that meet project quality standards ## **3\. Ideal Qualifications** - Bachelor's degree (or higher) in a STEM field such as Physics, Chemistry, Biology, Engineering, Computer Science, or a closely related discipline (excluding pure Mathematics) - Bilingual proficiency in English and one of the following languages: Arabic, Portuguese, Spanish, Korean, French, Chinese, or Russian (native-level proficiency in at least one required) - Strong ability to communicate scientific reasoning clearly in written form across both languages - Demonstrated experience with STEM problem-solving at a college level or above - Ability to read and interpret charts, diagrams, and data visualizations accurately - Extremely high attention to detail and ability to catch subtle errors in reasoning and factual content - Familiarity with LaTeX notation and standard scientific formatting conventions is a plus - Ability to work independently while maintaining consistent quality standards - Reliable availability and consistent engagement ## **4\. More About the Opportunity** - Minimum commitment: **20 hours per week** - Ongoing, consistent engagement expected - Work involves creating and refining high-volume STEM content in a multilingual context ## **5\. Application Process** - Submit your resume highlighting relevant STEM and language experience - Complete the attached required interview and assessment - We aim to follow up within a few days with next steps ## **6\. About the platform** the platform is a talent marketplace that connects top experts with leading AI labs and research organizations. Our investors include Benchmark, General Catalyst, Adam D'Angelo, and Jack Dorsey. Thousands of professionals across law, research, engineering, design, and creative fields contribute to the platform projects shaping the next generation of AI.

๐ŸŒ Remote4/8/2026
Apply โ†’
$500 - $1,000 one-time

## **About the Role** the platform is seeking experienced **Buying and Purchasing Agents** to support a leading AI lab in advancing research and infrastructure for next-generation machine learning systems. This engagement focuses on diagnosing and solving real issues in your domain. It's an opportunity to contribute your expertise to cutting-edge AI research while working independently and remotely on your own schedule. ## **Key Responsibilities** - Youโ€™ll be asked to create deliverables regarding common requests within your professional domain - Youโ€™ll be asked to review peer developed deliverables to improve AI research ## **Ideal Qualifications** - 4+ years professional experience in your respective field - Excellent written communication with strong grammar and spelling skills ## Project Timeline - **Start Date:** Immediate - **Duration:** ~2 weeks (with the potential for project expansion) - **Commitment:** ~15 hours/week required ## Compensation & Contract - **Task Completion Pay:** Competitive and based on task quality (~$500 - $1000 per completed task, subject to change as the project evolves) - **Performance Bonus:** Top performers receive a weekly bonus incentive on top of their per task rate!

๐ŸŒ Remote4/9/2026
Apply โ†’

**Legal Expert - AI Data Research Study** legal professionals to participate in a short-term, paid AI research study with . ## Day-to-day - Complete structured legal reasoning tasks as part of an AI training dataset - Apply your legal knowledge to evaluate, draft, or analyze legal materials - Provide feedback on AI-generated content within a guided platform - Work independently at your own pace within a 15-hour/week commitment ## You bring - Licensed attorneys, law graduates, or legal professionals with substantive practice experience - Any legal specialization welcome (corpo ## Pay - We are looking for legal professionals to participate in a short-term, paid AI research study with Machina. - Work independently at your own pace within a 15-hour/week commitment - Must be available to start April 15 and commit 15 hrs/week

๐ŸŒ Remote4/10/2026
Apply โ†’
$150 - $180 per hour

the platform is hiring **Fixed Income Trading Experts** on behalf of a leading AI research lab developing advanced financial intelligence systems. In this role, you will contribute your domain expertise to help train, evaluate, and improve AI models designed to understand and operate within fixed income markets. Your insights will directly shape how AI systems interpret bond markets, pricing dynamics, and trading strategies. This is a **flexible, project-based opportunity** ideal for professionals with deep experience in rates, credit, or structured products trading. * * * ### **Key Responsibilities** - Evaluate and improve AI-generated outputs related to: - Bond pricing, yield curves, and spread analysis - Interest rate products (Treasuries, swaps, futures, repos) - Credit markets (IG, HY, distressed debt) - Design realistic trading scenarios and edge cases based on real-world market behavior - Review model reasoning on: - Trade structuring and execution decisions - Risk management (duration, convexity, DV01, liquidity) - Relative value and macro-driven strategies - Provide feedback on accuracy, market intuition, and institutional realism - Annotate datasets and create high-quality training examples for financial AI systems - Collaborate with researchers to refine model performance on trading workflows * * * ### **Ideal Candidate Profile** - **Experience:** - 3โ€“10+ years in fixed income trading, sales & trading, or portfolio management - Background at investment banks, hedge funds, asset managers, or proprietary trading firms - **Core Requirements:** - Day-to-day experience executing fixed income trades (government bonds, corporate credit, rates, securitized products, etc.) - Hands-on experience using **Tradeweb or similar electronic trading platforms** - Strong familiarity with **RFQ workflows, price discovery, and order management systems** in fixed income markets - Willingness to provide **candid, experience-based feedback** on trading tools, workflows, and market structure - **Domain Expertise (one or more):** - Rates trading (USTs, swaps, curve trades) - Credit trading (corporates, HY, CDS) - Structured products (MBS, ABS, CLOs) - Emerging markets debt - **Core Skills:** - Strong understanding of bond math (duration, convexity, carry, roll-down) - Market intuition around liquidity, execution, and pricing - Ability to break down complex trades into clear reasoning steps - **Nice to Have:** - Experience with quant tools (Python, Excel modeling) - Familiarity with Bloomberg, MarketAxess, or similar platforms - Interest in AI/ML applications in finance * * * ### **Why Join** - Contribute to cutting-edge AI systems transforming financial markets - Apply your trading expertise in a novel, high-impact setting - Flexible engagement alongside your current role - Collaborate with top-tier AI researchers and engineers

๐ŸŒ Remote4/10/2026
Apply โ†’
$120 - $150 per hour

the platform is partnering with a leading AI research lab to engage experienced **Market Research, Consumer Insights, and Marketing Strategy professionals** to act as **end-users of a cutting-edge AI product**. In this role, you will interact with the product as if you were using it in your day-to-day workโ€”evaluating outputs, providing feedback, and helping improve how the system supports real-world decision-making. This is a **short-term, remote engagement (~2 hours total)** designed for professionals currently working at or with **Fortune 1000 companies or equivalent environments**. * * * ### **What Youโ€™ll Do** - Use an AI-powered product designed for: - Market research synthesis - Consumer insights generation - Marketing strategy recommendations - Evaluate outputs from an **end-user perspective**, focusing on: - Relevance and usefulness of insights - Strategic quality and business applicability - Clarity, structure, and actionability - Provide structured feedback and annotations on: - What works well vs. what doesnโ€™t - Gaps, inaccuracies, or missing context - How outputs could be improved for real-world use - Simulate real workflows (e.g., reviewing reports, validating insights, making decisions based on outputs) * * * ### **Who Weโ€™re Looking For** We are seeking professionals who regularly **consume, interpret, and act on market research and insights**, such as: - Market Research Analysts / Managers / Directors - Consumer Insights Managers / Leads - Marketing Strategy / Brand Managers - Product Marketing / Growth professionals with strong research exposure **Requirements:** - 3+ years of experience in market research, insights, or marketing strategy - Current or recent experience at a **Fortune 1000 company, top consulting firm, or leading agency** - Hands-on experience using insights to drive decisions (not just producing reports) - Familiarity with: - Consumer research, segmentation, or survey analysis - Market sizing, competitive intelligence, or trend analysis - Translating insights into business or marketing strategy - Strong critical thinking and ability to give clear, actionable feedback - Fluent in English * * * ### **Engagement Details** - **Duration:** ~2 hours total - **Compensation:** Competitive, based on experience * * * ### **Why Join** - Influence how next-generation AI products are designed for real business users - Act as a **true end-user voice** in shaping product quality and usability - Flexible, low-commitment opportunity alongside your current role - Opportunity for future product testing and feedback engagements

๐ŸŒ Remote4/10/2026
Apply โ†’
$35 - $75 per hour

**Role Overview** We are seeking expert physicists to author and review high-quality academic assessment content for an AI research initiative. You will write and verify rigorous multiple-choice questions across core physics domains, evaluate solution quality, and help establish gold-standard benchmarks used to advance AI capabilities. You will be assigned one of two task types: - **Question Authoring** โ€” Create original, challenging multiple-choice questions in your area of physics expertise, rate their difficulty, and submit them for review. - **Question Verification** โ€” Review pre-written questions for accuracy, clarity, and rigor. Edit where needed, rate difficulty, and document any changes made. **Physics Domains Covered** Theoretical Classical Mechanics, Optics & Acoustics, Electromagnetism & Photonics, Statistical Mechanics & Thermodynamics, Astrophysics & Cosmology, Particle & Nuclear Physics, Quantum Mechanics, Fluid Mechanics, Relativity, Solid State & Atomic Physics. **Key Responsibilities** - Author original physics questions that test deep conceptual understanding, not surface-level recall - Ensure questions are unambiguous, self-contained, and precisely defined โ€” all necessary information must be in the problem statement - Rate each question's difficulty: Medium (intro undergraduate), Hard (advanced undergraduate), or Expert (post-graduate and above) - Provide 1 correct answer and 9 plausible but subtly incorrect alternatives that challenge expert-level solvers - Write step-by-step Chain-of-Thought solutions with clear, concise intermediate steps in markdown format - Supply 1โ€“5 academic references per question from reputable sources (peer-reviewed journals, university repositories) - For verification tasks: flag issues with clarity, completeness, precision, or solvability and justify any edits made **Ideal Qualifications** - PhD or doctoral candidate in Physics, Applied Physics, Astrophysics, or a closely related field - Master's degree considered for candidates with exceptional depth in a specific subdomain - Strong command of graduate-level physics concepts and mathematical formalism - Experience with rigorous academic problem design or physics olympiad writing is a strong plus - Excellent written English and ability to express complex ideas clearly and concisely **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/10/2026
Apply โ†’
$60 - $85 per hour

the platform is seeking expert bioinformaticians for a Docker toolkit curation project with one of the world's leading AI labs. In this role, you will specify the complete set of bioinformatics tools needed to solve specific classes of computational biology problems. You will review representative problems and validate the final toolkit to solve these. Your selections directly shape how frontier models approach real scientific problems. This is a toolkit specification role, not a problem-solving role. You are selecting and validating tools โ€” not writing code or producing solutions. **Ideal Qualifications:** - PhD or Master's degree in bioinformatics, computational biology, genomics, or a closely related field. - Hands-on experience with at least 3 of the following: protein structure analysis, variant effect prediction, RNA-seq/expression analysis, ChIP-seq/epigenomics, single-cell analysis, CRISPR screen analysis, molecular docking, or cheminformatics. - Deep familiarity with the standard tool ecosystem for your subdomain (e.g., Rosetta, FoldX, DESeq2, MAGeCK, AutoDock Vina, HOMER, scanpy, etc.). - Proficiency in Python and/or R for bioinformatics workflows. - Familiarity with command-line bioinformatics tools (e.g., samtools, BEDTools, BLAST). - Strong opinions on tool selection โ€” you know what you'd install to solve a problem before you start. **Key Responsibilities:** - Review representative problems from your assigned problem class to understand the data shape and expected output. - Specify a complete toolkit: Python/R packages, system tools, etc. each with a brief rationale. **Timeline:** - 2-week engagement starting upon onboarding. - Assignments are distributed by problem class based on your domain expertise. **Interview Process:** - Short screening questionnaire to assess subdomain expertise and tool familiarity.

๐ŸŒ Remote4/10/2026
Apply โ†’
$60 - $130 per hour

the platform is seeking experienced **SAP ABAP developers** for a fast-paced pilot project with a leading AI research partner. In this role, you will apply your hands-on ABAP expertise to support structured data production workflows that help train and evaluate advanced AI systems. Your work will directly contribute to improving how large language models understand enterprise software and real-world development tasks. Youโ€™re a great fit if you: - **Have 3โ€“5 years of recent, hands-on experience as an SAP ABAP developer** - **Have worked in roles such as ABAP Developer or SAP Consultant (recent experience preferred over senior leadership titles)** - Are comfortable writing, reviewing, and reasoning about ABAP code in production environments - Can quickly ramp up in a structured, tool-driven workflow with clear guidelines - Have strong attention to detail and can follow technical instructions precisely - Developers who can prioritize short-term, high-impact engagements - Have solid written communication skills Here are more details about the role: - **Immediate start with a live onboarding** - Interested in a short-term, high-intensity pilot (approximately 1 week) with potential to expand - **Expected commitment of 10โ€“20 hours during the week (flexible based on workload)** Screening Process: - Brief technical assessment focused on ABAP proficiency and attention to detail

๐ŸŒ Remote4/11/2026
Apply โ†’

**Japanese STEM Translation Reviewer** We're looking for native Japanese speakers with a strong STEM background to evaluate AI-generated translations of science, technology, engineering, and math content. ## Day-to-day - Review translated responses to STEM questions and ## You bring - Native or near-native Japanese speaker - Strong STEM background (degree or professional experience in science, engineering, math, or technology) - Ability to evaluate both technical accuracy and linguistic quality - Remote, hourly | ~1.25 hrs per task | Flexible

๐ŸŒ Remote4/11/2026
Apply โ†’

**Hindi STEM Translation Reviewer** We're looking for native Hindi speakers with a strong STEM background to evaluate AI-generated translations of science, technology, engineering, and math content. ## Day-to-day - Review translated responses to STEM questions and ## You bring - Native or near-native Hindi speaker - Strong STEM background (degree or professional experience in science, engineering, math, or technology) - Ability to evaluate both technical accuracy and linguistic quality - Remote, hourly | ~1.25 hrs per task | Flexible

๐ŸŒ Remote4/11/2026
Apply โ†’

**Mandarin STEM Translation Reviewer** We're looking for native Mandarin speakers with a strong STEM background to evaluate AI-generated translations of science, technology, engineering, and math content. ## Day-to-day - Review translated responses to STEM questions and ## You bring - Native or near-native Mandarin speaker - Strong STEM background (degree or professional experience in science, engineering, math, or technology) - Ability to evaluate both technical accuracy and linguistic quality - Remote, hourly | ~1.25 hrs per task | Flexible

๐ŸŒ Remote4/11/2026
Apply โ†’
$42 per hour

**STEM Undergraduates** This listing is for STEM Undergraduates. ## You bring - Currently enrolled STEM undergraduate or only an undergraduate degree, no advanced education. - No explosives, pyrotechnics, demolition, or military EOD experience - U.S. person currently located in a Five Eyes country ## Bonus - Chemistry, physics, or engineering major (closer to relevant fundamentals) - Comfortable with independent, open-ended research tasks - Strong information literacy โ€” can evaluate source credibility

๐ŸŒ Remote4/11/2026
Apply โ†’
$105 per hour

**Requirements** - PhD in chemistry or closely related field (chemical engineering, materials science) - **No prior chemical weapons synthesis experience** - U.S. person currently located in a Five Eyes country **Preferred qualifications** - Strong organic or synthetic chemistry background - Comfortable working independently on open-ended research tasks - Ability to assess technical accuracy of procedural information

๐ŸŒ Remote4/11/2026
Apply โ†’

**Biosafety / Laboratory Operations - Senior** _Provides the practical implementation layer โ€” understands equipment procurement, facility setup, containment protocols, and how labs actually ope ## You bring - Experience working in BSL-2 or higher facility - Knowledge of laboratory equipment, procurement channels, and containment protocols - U.S. person currently located in a Five Eyes country ## Bonus - Biosafety officer certification or training (e.g., ABSA credentials) - Hands-on procurement and supply chain experience for lab consumables and equipment - Understanding of biosafety failure modes and incident response - Experience with institutional biosafety committees (IBC) or regulatory compliance

๐ŸŒ Remote4/11/2026
Apply โ†’
$105 per hour

**Biologist (PhD)** _Provides the scientific knowledge layer โ€” understands what is biologically feasible, relevant pathogen biology, and laboratory protocols._ ## You bring - PhD in biology, microbiology, virology, immunology, or closely related field - U.S. person currently located in a Five Eyes country ## Bonus - Familiarity with molecular biology techniques (cloning, PCR, cell culture) - Publication record in relevant subdomains - Experience interpreting primary research literature in biosecurity-adjacent areas

๐ŸŒ Remote4/11/2026
Apply โ†’

**Role Overview** We are seeking expert operations research professionals to author and verify high-quality open-ended prompts for AI model evaluation. You will craft and review challenging, unambiguous optimization and decision-science problems across core subdomains, assessing AI reasoning quality and helping establish rigorous evaluation standards for frontier language models. You will be assigned one of two task types: - **Authoring Task** โ€” Create 5 original, open-ended prompts from your assigned subdomain at varying difficulty levels (undergraduate, advanced undergraduate, or graduate/professional). Prompts should require human judgment to evaluate the quality of the AI's response, such as optimization modeling, algorithmic analysis, or stochastic reasoning. - **Verification Task** โ€” Review 5 authored prompts for clarity, scope alignment, difficulty accuracy, and uniqueness. Edit prompts and difficulty ratings where needed. **Operations Research Subdomains Covered** Linear & Integer Programming, Network Optimization & Graph Theory, Stochastic Models & Queuing Theory, Game Theory & Decision Analysis, Supply Chain & Logistics Optimization, Simulation & Metaheuristics. **Key Responsibilities** - Author clear, unambiguous, open-ended operations research prompts that elicit evaluable AI responses - Verify prompts are within the scope of the assigned subdomain and correctly rated for difficulty - Ensure all 5 prompts in a task are sufficiently distinct from one another with varying difficulty levels - Apply expert judgment to assess the depth and quality of quantitative reasoning required - Edit prompts and difficulty assignments where standards are not met **Ideal Qualifications** - Master's degree or higher in Operations Research, Industrial Engineering, Applied Mathematics, or a closely related field - 2โ€“6 years of professional or research experience in optimization, logistics, or decision science - Strong command of mathematical programming, probabilistic modeling, and algorithmic methods - Experience with solvers (Gurobi, CPLEX) or simulation tools is a strong plus - Excellent written English and ability to craft precise, well-scoped technical questions **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote4/11/2026
Apply โ†’

**Join a leading AI labโ€™s cutting-edge GenAI team to be at the core of the AI revolution, where your expertise fuels the development of the most advanced Large Language Models.** ## 1\. Overview We are seeking **Professors across Finance, Accounting, Law, and other professional services domains** to contribute to a project supporting a frontier-model evaluation effort focused on coding and agentic workflows. Youโ€™ll design and validate challenging benchmark tasks to help surface and diagnose reasoning and problem-solving gaps in a target model. The work centers on building robust, real-world tasks with executable tests and then analyzing model/agent behavior. This is a W2 employment position with Cincinnatus LLC, with the opportunity to be placed at a leading AI Lab as part of their extended workforce. You will join a team of domain experts and together, you will guide the next generation of frontier AI tools. ## 2\. Key Responsibilities - Task Design and Development: Design challenging, real-world domain-specific problems that serve as the foundation for agentic tasks. Problems should be constructed to target specific core capability loss failures identified in a frontier AI model. - Spec & Golden Solution Generation: Integrate the problems into an Agentic development environment, preparing all necessary components using Python, which include: - Detailed Instructions and an overview of the required task. - A Golden solution that follows the instruction. - Any specific consultations and feedback with domain-specific knowledge. - Evaluation and Analysis: Evaluate the cross modelโ€™s performance on the tasks - Headroom Identification: Identify tasks where target model fails to pass all tests, specifically classifying the failure as a logical reasoning failure - Loss Extraction: Analyze the agentโ€™s steps (Agent Trajectory) to observe and extract core capability loss patterns from the model. ## 3\. Core Qualifications - Current or retired professor within Finance, Accounting, Law, etc. - Degree in finance, accounting, law or relevant field - Ability to engage reliably for at least 30 hours/week during weekdays (i.e. at least 6 hours/day during weekdays) - Past experience in AI training, model evaluation and data annotation is preferred - Basic ability to work independently and manage oneโ€™s time - Verbal and written communication skills, problem solving skills, and interpersonal skills ## About Cincinnatus LLC Cincinnatus LLC is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for contingent and contract-based opportunities. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve part-time or full-time commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus LLC. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote4/11/2026
Apply โ†’
$70 - $95 per hour

**Join a leading AI labโ€™s cutting-edge GenAI team to be at the core of the AI revolution, where your expertise fuels the development of the most advanced Large Language Models.** ## 1\. Overview We are seeking **Professors across STEM domains, including ML, Coding, Data Science, etc.** to contribute to a project supporting a frontier-model evaluation effort focused on coding and agentic workflows. Youโ€™ll design and validate challenging benchmark tasks to help surface and diagnose reasoning and problem-solving gaps in a target model. The work centers on building robust, real-world tasks with executable tests and then analyzing model/agent behavior. This is a W2 employment position with Cincinnatus LLC, with the opportunity to be placed at a leading AI Lab as part of their extended workforce. You will join a team of domain experts and together, you will guide the next generation of frontier AI tools. ## 2\. Key Responsibilities - Task Design and Development: Design challenging, real-world domain-specific problems that serve as the foundation for agentic tasks. Problems should be constructed to target specific core capability loss failures identified in a frontier AI model. - Spec & Golden Solution Generation: Integrate the problems into an Agentic development environment, preparing all necessary components using Python, which include: - Detailed Instructions and an overview of the required task. - A Golden solution that follows the instruction. - Any specific consultations and feedback with domain-specific knowledge. - Evaluation and Analysis: Evaluate the cross modelโ€™s performance on the tasks - Headroom Identification: Identify tasks where target model fails to pass all tests, specifically classifying the failure as a logical reasoning failure - Loss Extraction: Analyze the agentโ€™s steps (Agent Trajectory) to observe and extract core capability loss patterns from the model. ## 3\. Core Qualifications - Current or retired professor within STEM, including ML, coding, data science, etc. - Degree in computer science, data science, or relevant STEM field - Ability to engage reliably for at least 30 hours/week during weekdays (i.e. at least 6 hours/day during weekdays) - Past experience in AI training, model evaluation and data annotation is preferred - Basic ability to work independently and manage oneโ€™s time - Verbal and written communication skills, problem solving skills, and interpersonal skills ## About Cincinnatus LLC Cincinnatus LLC is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for contingent and contract-based opportunities. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve part-time or full-time commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus LLC. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote4/11/2026
Apply โ†’
$80 - $110 per hour

**Biology Expert (General)** The team needs PhD-level experts with experience applying design, optimization, prediction, and scientific reasoning to biological or chemical systems Ideal candidates have worked across experimental and/or computational settings, and can interpret complex data to guide research and development decisions ## Day-to-day - Ability to extract insights from noisy, high-dimensional data - Ability to generate testable hypotheses and evaluate competing explanations - Experience synthesizing evidence across experiments or datasets - Able to communicate complex ideas clearly and concisely ## You bring - 3-8+ years of relevant research or industry experience - Strong quantitative and analytical skills - Comfortable working across data

๐ŸŒ Remote4/12/2026
Apply โ†’
$56 - $70 per hour

**Business Operations Associate** This role supports a fast-growing AI lab building advanced technology for enterprise use cases The team operates with high velocity and is led by an experienced founder/CEO ## Details - Support event planning (conferences, customer dinners, demo days) - Act as a project coordinator across engineering, sales, and customer success teams ## Day-to-day - 1) Day-to-Day Execution - Own vendor logistics and coordination across software tools, service providers, contractors, and compliance platforms - Drive follow-ups across teams to ensure accountability and timely execution - Conduct research, data entry, and preparation to support sales, fundraising, and partnership initiatives - 2) Process & Operations:- Build and manage recurring workflows (e.g., investor updates, onboarding checklists, compliance cycles) - Create and maintain SOPs and documentation as systems the team - Coordinate hiring logistics ## You bring - 1-3 years of experience in operations, Chief of Staff, EA, consulting, or startup environments - Exceptional organisational skills with a strong bias toward documentation and systems - Ability to context-switch effectively across tasks (e.g., vendor management, investor communication, onboarding workflows) - Strong written communication skills - Comfortable drafting on behalf of leadership - Highly proactive - Identifies and solves problems before they escalate ## Bonus - Direct exposure to the founder and company-building at an early stage - High ownership with rapidly expanding scope - Opportunity to build foundational systems in a fast-scaling AI company - Work at the intersection of operations, strategy, and execution in one of the most important technology sectors today

๐ŸŒ Remote4/12/2026
Apply โ†’
$100 per hour

**About the Role** We're looking for licensed attorneys to review a corpus of legal documents and assess whether their meaning and logical flow are fully preserved. Reviewers will evaluate documents at the sentence and paragraph level, flagging any instances where content appears incomplete, ambiguous, or where the intended legal meaning has been lost or altered. This is a short-term, part-time engagement with an expected duration of 3 days. **Responsibilities** - Review redacted legal documents for semantic coherence and completeness - Flag passages where meaning is unclear, inconsistent, or appears to have been disrupted - Provide brief written rationale for any flagged content - Complete assigned document batches within daily targets **Qualifications** - JD required - Prior experience with legal records, court filings, or litigation documents strongly preferred - Familiarity with AI or document annotation project experience is a plus - Strong attention to detail and ability to work efficiently under time constraints

๐ŸŒ Remote4/14/2026
Apply โ†’
$80 - $150 per hour

**About the Role** We're looking for creative, detail-oriented experts to help train the next generation of AI agents. You'll design realistic, complex digital worlds and craft challenging scenarios that test how well AI navigates real-world tasks โ€” think scheduling conflicts, information overload, competing priorities, and more. **What You'll Do** - Build richly detailed personas and simulated digital environments (Gmail, Slack, Calendar, WhatsApp, Google Drive, and more) - Write tasks that challenge an AI agent's ability to reason, filter, and prioritize - Run the agent against your scenarios, evaluate its performance, and guide it to success through structured hints - Document your work clearly in Airtable and Crucible **You're a Good Fit If You** - Have strong written communication and attention to detail - Think creatively and enjoy constructing layered, realistic scenarios - Are comfortable working with structured tools like JSON editors, Airtable, and AI platforms - Can spot inconsistencies and think critically about information quality - Work independently and manage your own workflow - **Must Have** - Undergrad degree, 2+ years professional experience. - **Nice to have:** - **Writing chops** โ€” copywriter, UX writer, journalist, content strategist, editor, published blog/newsletter, creative writing background. Strong writers make the best task creators _and_ reviewers - **Multi-stakeholder coordination** โ€” PM, account manager, consultant, client success, event coordinator, producer. Creates richer, more authentic professional scenarios - **World-building / creative design** โ€” game designer, curriculum designer, UX researcher, simulation designer, tabletop RPG creator - **Technical / data comfort** โ€” JSON familiarity, Jira/Asana power user, technical PM, data analyst, SQL/Python exposure. A plus but not required โ€” we'll provide a JSON onboarding guid

๐ŸŒ Remote4/14/2026
Apply โ†’

**Location:** US-Based and Non-US-Based **Type**: Full-time or Part-time Contract Work **Fluent Language Skills Required:** English **Why This Role Exists** the platform partners with leading AI teams to improve the quality, usefulness, and reliability of general-purpose conversational AI systems. These systems are used across a wide range of everyday and professional scenarios, and their effectiveness depends on how clearly, accurately, and helpfully they respond to real user questions. In coding and software engineering contexts, conversational AI systems must demonstrate correct reasoning, strong problem-solving ability, and adherence to real-world engineering best practices. This project focuses on evaluating and improving how models reason about code, generate solutions, and explain technical concepts across a variety of programming tasks and complexity levels. **What Youโ€™ll Do** - **Evaluate LLM-generated responses** to coding and software engineering queries for accuracy, reasoning, clarity, and completeness - **Conduct fact-checking** using trusted public sources and authoritative references - Conduct accuracy testing by **executing code and validating outputs using appropriate tools** - **Annotate model responses** by identifying strengths, areas of improvement, and factual or conceptual inaccuracies - Assess code quality, readability, algorithmic soundness, and explanation quality - Ensure **model responses align with expected conversational behavior** and system guidelines - **Apply consistent evaluation standards** by following clear taxonomies, benchmarks, and detailed evaluation guidelines **Who You Are** - You hold a **BS, MS, or PhD in Computer Science or a closely related field** - You have **significant (3+ years) real-world experience in software engineering** or related technical roles - You are an expert in at **least two relevant programming languages (e.g., Python, Java, C++, C, JavaScript, Go, Rust, Ruby, SQL, Powershell, Bash, Swift, Kotlin, R, TypeScript, HTML/CSS)** - You are able to solve **HackerRank or LeetCode Medium and Hardโ€“level problems independently** - You have experience contributing to well-known open-source projects, including merged pull requests - You have **significant experience using LLMs while coding** and understand their strengths and failure modes - **You have strong attention to detail** and are **comfortable evaluating complex technical reasoning**, identifying subtle bugs or logical flaws **Nice-to-Have Specialties** - Prior experience with RLHF, model evaluation, or data annotation work - Track record in competitive programming - Experience reviewing code in production environments - Familiarity with multiple programming paradigms or ecosystems - Experience explaining complex technical concepts to non-expert audiences **What Success Looks Like** - You identify incorrect logic, inefficiencies, edge cases, or misleading explanations in model-generated code, technical concepts, and system design discussions - Your feedback improves the correctness, robustness, and clarity of AI coding outputs - You deliver reproducible evaluation artifacts that strengthen model performance - the platform customers trust AI systems to assist reliably with real-world coding tasks **Why Join the platform** At the platform, experienced software engineers play a direct role in shaping how AI systems reason about and generate code. This remote role allows you to apply your technical expertise to high-impact AI development work, improving systems used by developers around the world.

๐ŸŒ Remote4/14/2026
Apply โ†’
$35 - $40 per hour

the platform is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author promptโ€“golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### **Job Details:** - **Design and Optimize Prompts**: Create detailed prompts with multiple constraints and instructions. - **Define and Document Evaluation Standards**: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric. - **Conduct Model Testing and Grading**: Run prompts through models and assess preliminary outputs against expectations. - **Support Benchmarking and Quality Assurance**: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ### **Minimum Qualifications:** - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests. - US or Canada based. ### **Preferred Qualifications:** - Experience in teaching or research. ### **Application & Onboarding Process:** - Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ### **More Details About This Role:** - Expect to contribute at least **20 hours per week**. - Expect a commitment of around 2 month. - Youโ€™ll be working in a structured project environment with clear goals and tools.

๐ŸŒ Remote4/14/2026
Apply โ†’

**the platform is hiring experienced Mathematics, Biology, Chemistry, and Physics professionals** to support a variety of high-impact research collaborations with leading AI labs. Freelancers will help improve AI systems through work on code validation, prompt refinement, algorithmic evaluation, and model benchmarking. This is a unique opportunity to apply your engineering expertise toward shaping the next generation of intelligent systems. **Key Responsibilities** - **Scientific Research Translation:** Identify and analyze high-impact research papers in Physics, Chemistry, Math, or Science that introduce novel, code-implementable methods. - **Complex Problem Decomposition:** Break down high-level scientific problems into a series of logical, independently testable subproblems. - **Golden Solution Authoring:** Develop "Gold Standard" Python solutions for each subproblem and create corresponding unit tests to verify accuracy - **Quality Control and Review:** Review peer prompts, golden solutions, and unit tests for accuracy, comprehensiveness, and difficulty. **Ideal Qualifications** - **Degree:** Bachelorโ€™s, Masterโ€™s or PhD in a STEM field (Physics, Chemistry, Mathematics, Biochemistry, or related Computational Sciences). - **Expert Python Proficiency:** Strong experience in scientific computing libraries (e.g., NumPy, SciPy, Matplotlib, JAX, or domain-specific packages like RDKit or OpenMM). - **Technical Writing:** Ability to translate dense academic research into clear, structured problem statements for a technical audience. - **Version Control:** Proficiency with Git, including branch management and pull request workflows. - **Unit Testing Experience:** Familiarity with testing frameworks (like `pytest`) and the ability to write robust validation scripts for mathematical/scientific code. **Project Timeline** - Start Date: Immediate - Duration: 1โ€“2 months - Commitment: Part-time (15โ€“25 hours/week, with flexibility up to 80 hours/week) **Application & Onboarding Process** - Upload your resume - AI interview: 3 short, 15-minute conversational sessions to understand your background, experience, and interest in the role - Follow-up communication within a few days with next steps and onboarding details **Apply today and leverage your data science expertise to help build the future of AI-driven systems!**

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

### **Role Overview** the platform is collaborating with a leading AI lab to contract experienced professionals for an AI model evaluation project. Contractors will assess the quality, accuracy, and safety of AI-generated responses across health/medical subjects. The project offers an opportunity to directly improve the reliability of AI systems in high-stakes contexts where inaccurate information carries serious risk. **Key Responsibilities** - Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance - Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness - Identify fabricated claims, incorrect references, or misleading reasoning across model outputs - Score and rank multiple model responses using structured rubrics across dimensions - Provide written justifications with specific evidence for each evaluation **Ideal Qualifications** - Masterโ€™s degree or higher in Health or a relevant professional field - Professional experience applying domain expertise in a practitioner or advisory capacity - Familiarity with industry-specific standards, regulations, or clinical guidelines - Strong written communication and critical reasoning skills **More About the Opportunity** - Expected commitment: ~20 hours/week **Application Process** - Submit your resume to begin - Complete the Model Response Evaluation assessment

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

Professional Domain Expert (Government/Non-Profit) โ€” hourly contract role on the platform.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’
$50 - $75 per hour

### **Role Overview** the platform is collaborating with a leading AI lab to contract experienced professionals for an AI model evaluation project. Contractors will assess the quality, accuracy, and safety of AI-generated responses across legal subjects. The project offers an opportunity to directly improve the reliability of AI systems in high-stakes contexts where inaccurate information carries serious risk. **Key Responsibilities** - Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance - Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness - Identify fabricated claims, incorrect references, or misleading reasoning across model outputs - Score and rank multiple model responses using structured rubrics across dimensions - Provide written justifications with specific evidence for each evaluation **Ideal Qualifications** - Masterโ€™s degree or higher in Legal or a relevant professional field - Professional experience applying domain expertise in a practitioner or advisory capacity - Familiarity with industry-specific standards, regulations, or clinical guidelines - Strong written communication and critical reasoning skills **More About the Opportunity** - Expected commitment: ~20 hours/week **Application Process** - Submit your resume to begin - Complete the Model Response Evaluation assessment

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

We are seeking a Software Engineer III to join an onsite team in Redmond, focused on building and maintaining OpenXR-based XR applications. In this role, you will collaborate closely with engineers and cross-functional partners to prototype, implement, and iterate on immersive experiences that run across OpenXR-capable runtimes and devices. ## Responsibilities - Develop XR applications using OpenXR, covering architecture, implementation, testing, and iteration. - Build interactive features including: - Input handling (controllers, hands, action sets, haptics) - Scene and interaction systems (grabbing, ray interactions, UI in 3D) - Rendering and performance optimizations (frame pacing, latency-sensitive updates) - Integrate platform/runtime features where applicable (tracking spaces, anchor-like constructs, passthrough/scene understanding via extensions). - Create clean, testable code and contribute to basic CI/build scripts as needed. - Debug runtime and device issues related to graphics, tracking, and input; provide clear reproduction steps and fixes. - Collaborate with product, UX, and engineering stakeholders; document designs and tradeoffs. ## Qualifications - Minimum Qualifications: - 3+ years of professional software development experience (or equivalent). - Hands-on experience shipping 3D real-time applications such as XR, games, simulation, or visualization. - Practical experience with OpenXR core concepts including instance/session, swapchains, spaces, and actions. - Strong skills in C/C++ and/or C#, along with solid debugging abilities. - Experience with a real-time engine or framework such as: - Unity (C#) and OpenXR plugin ecosystem, or - Unreal (C++) XR pipeline, or - Custom/native OpenXR rendering with Vulkan/OpenGL/DirectX. - Understanding of rendering and performance constraints for XR (72/90/120 FPS targets, GPU/CPU bottlenecks, latency). - Preferred Qualifications: - Experience shipping at least one OpenXR-based app or feature to production. - Familiarity with OpenXR extensions (e.g., hand tracking, eye gaze, foveated rendering, scene/space-related extensions). - Graphics experience with Vulkan/OpenGL/DirectX, shaders, and profiling tools such as RenderDoc or engine profilers. - Android XR experience including Gradle, NDK, JNI, or low-level platform integration. - Experience building reusable components or frameworks for XR interaction or app scaffolding. Pursuant to the California Fair Chance Act, Los Angeles County Fair Chance Ordinance for Employers, Los Angeles Fair Chance Initiative for Hiring Ordinance, and San Francisco Fair Chance Ordinance, qualified applicants will be considered for assignment with arrest and conviction records. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness, meet client expectations, standards, and accompanying requirements, and safeguard business operations and company reputation. ## Work Authorization Applicants must be located in the United States. Cincinnatus does not sponsor visas and will not provide visa sponsorship for this role. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

Partner with Industrial Design, Color Material Finish, and Product Design Engineering teams to enable the definition and execution of next-generation hardware products, New Technology Introductions, and Special Edition products. Advocate for the Industrial Design (ID), Color Material, and Finish (CMF) vision by partnering closely with cross-functional teams to ensure the ID CMF point of view is authentically represented and meaningfully integrated into the product development process. ## Responsibilities - Coordinate and drive Industrial Design deliverables including specifications, DOE, and validation plans across multiple concurrent projects. - Work with cross-functional teams (Industrial Design, CMF, SMEs, System Product Development, Reliability, Operations) and other groups to develop best practices and technology development processes in a fast-paced environment. - Manage delivery of ID CMF configurations for development builds and drive qualification to meet mass production timelines. - Create development, delivery, and qualification schedules based on program requirements, technical challenges, lead-times, and business needs. - Manage program structure to optimize clarity and efficiency for program teams, leadership, and external customers. - Communicate program statuses, identifying potential setbacks, including vendor tooling schedules and CMF design sprints. - Manage communication within and between internal cross-functional teams. - Ensure program documents are complete, current, and available for staff and leadership review. - Contribute to resource planning to ensure program success. ## Qualifications - 7+ years experience working in a mechanical engineering or industrial design field. - Experience in first-party and/or second-party product development, defining requirements and program constraints, and collaborating with both internal and external stakeholders. - Experience in colors, plastics/textiles color management, understanding of color palettes, managing color schedules, and communications with ID/PD teams to keep projects on time. - Preferred: - Experience working in big tech. - Experience working in a fast-paced environment. - Experience in consumer electronics, hardware, or mechanical products. ## Work Authorization Applicants must be located in the United States. Cincinnatus does not sponsor visas and will not provide visa sponsorship for this role. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

### **Role Overview** the platform is collaborating with a leading AI lab to contract experienced professionals for an AI model evaluation project. Contractors will assess the quality, accuracy, and safety of AI-generated responses across retail/wholesale subjects. The project offers an opportunity to directly improve the reliability of AI systems in high-stakes contexts where inaccurate information carries serious risk. **Key Responsibilities** - Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance - Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness - Identify fabricated claims, incorrect references, or misleading reasoning across model outputs - Score and rank multiple model responses using structured rubrics across dimensions - Provide written justifications with specific evidence for each evaluation **Ideal Qualifications** - Masterโ€™s degree in a relevant professional field - Professional experience applying domain expertise in a practitioner or advisory capacity - Familiarity with industry-specific standards, regulations, or clinical guidelines - Strong written communication and critical reasoning skills **More About the Opportunity** - Expected commitment: ~20 hours/week **Application Process** - Submit your resume to begin - Complete the Model Response Evaluation assessment

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’
$25 - $35 per hour

### **Role Overview** the platform is collaborating with a leading AI lab to contract experienced professionals for an AI model evaluation project. Contractors will assess the quality, accuracy, and safety of AI-generated responses across manufacturing subjects. The project offers an opportunity to directly improve the reliability of AI systems in high-stakes contexts where inaccurate information carries serious risk. **Key Responsibilities** - Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance - Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness - Identify fabricated claims, incorrect references, or misleading reasoning across model outputs - Score and rank multiple model responses using structured rubrics across dimensions - Provide written justifications with specific evidence for each evaluation **Ideal Qualifications** - Professional experience applying domain expertise in a practitioner or advisory capacity - Familiarity with industry-specific standards, regulations, or clinical guidelines - Strong written communication and critical reasoning skills **More About the Opportunity** - Expected commitment: ~20 hours/week **Application Process** - Submit your resume to begin - Complete the Model Response Evaluation assessment

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

### **Role Overview** the platform is collaborating with a leading AI lab to contract experienced professionals for an AI model evaluation project. Contractors will assess the quality, accuracy, and safety of AI-generated responses across real estate related subjects. The project offers an opportunity to directly improve the reliability of AI systems in high-stakes contexts where inaccurate information carries serious risk. **Key Responsibilities** - Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance - Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness - Identify fabricated claims, incorrect references, or misleading reasoning across model outputs - Score and rank multiple model responses using structured rubrics across dimensions - Provide written justifications with specific evidence for each evaluation **Ideal Qualifications** - Professional experience applying domain expertise in a practitioner or advisory capacity - Familiarity with industry-specific standards, regulations, or clinical guidelines - Strong written communication and critical reasoning skills **More About the Opportunity** - Expected commitment: ~20 hours/week **Application Process** - Submit your resume to begin - Complete the Model Response Evaluation assessment

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’
$200 - $400 one-time

Excel/PowerPoint/Document Style Experts โ€” part-time contract role on the platform.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

the platform is seeking detail-oriented transcribing/writing experts to contribute to a high-impact audio AI research project with a leading lab. Freelancers will listen to and transcribe audios, annotate images and evaluate videos to help train advanced language models. This is a ~6 month short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. ## **Job Details:** - **Transcribe and Optimize Audio/Video**: Create detailed audio transcriptions with multiple constraints and instructions from Romanian. - **Define and Document Evaluation Standards**: Establish high-level expectations for correct responses in general audio consumer contexts, and develop comprehensive rubric. - **Conduct Model Testing and Grading**: Run prompts through models and assess preliminary outputs against expectations. - **Support Benchmarking and Quality Assurance**: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ## **Minimum Qualifications:** - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. ## **Preferred Qualifications:** - College students/grads ## **Application & Onboarding Process:** - Complete an AI-led interview, this should take around 15 minutes. - If selected, you will be invited to work on the project. ## **More Details About This Role:** - Expect to contribute at least **20 hours per week**. - Expect a commitment of around 2-4 weeks.

๐ŸŒ Remote
ro
4/2/2026
Apply โ†’
$70 - $95 per hour

Design Engineer III This role involves performing comprehensive power analysis throughout various design stages, from RTL to GDSII. The engineer will contribute to the development, improvement, and automation of power analysis flows, including gathering and analyzing large datasets for power modeling through automated processes. Additionally, the role requires investigating and addressing power inefficiencies and providing actionable feedback to the RTL design team. ## Responsibilities - Perform comprehensive power analysis at various design stages, spanning from RTL to GDSII. - Contribute to the development, improvement, and automation of various power analysis flows, including gathering and analyzing large datasets for power modeling via automated processes. - Investigate and address power inefficiencies, providing actionable feedback to the RTL design team. ## Qualifications - Demonstrated experience with RTL-to-GDSII design flow usage and development in advanced technology nodes (7nm and below). - Expertise in low-power implementation and signoff, including power gating, multiple voltage rails, and Unified Power Format (UPF) usage. - Proven experience in power analysis and reduction utilizing industry-standard tools such as PrimeTime PX/PrimePower. - Proficiency in scripting languages, with a strong emphasis on Python and ML/AI frameworks. - Working knowledge of RTL design principles. - Experience in RTL power optimization using specialized tools like Power-Artist. - Ability to learn quickly, explore new ideas, and iterate rapidly to achieve optimal results. - Experience with synthesis and Place and Route (PnR) flows (preferred). - Experience analyzing Intellectual Property (IP) design for power characteristics and building run-time estimation models for use by software/firmware teams (preferred). - Skills in data analysis and data modeling, including the application of machine learning approaches (preferred). - Bachelorโ€™s degree in Electrical/Computer Engineering or Computer Science required. - Masterโ€™s degree preferred but not required. Pursuant to the California Fair Chance Act, Los Angeles County Fair Chance Ordinance for Employers, Los Angeles Fair Chance Initiative for Hiring Ordinance, and San Francisco Fair Chance Ordinance, qualified applicants will be considered for assignment with arrest and conviction records. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness, meet client expectations, standards, and accompanying requirements, and safeguard business operations and company reputation. ## Work Authorization Applicants must be located in the United States. Cincinnatus does not sponsor visas and will not provide visa sponsorship for this role. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’
$25 - $50 per hour

**Role Overview** We are seeking expert mathematicians to author and verify high-quality open-ended prompts for AI model evaluation. You will craft and review challenging, unambiguous mathematical problems across core subdomains, assessing AI reasoning quality and helping establish rigorous evaluation standards for frontier language models. **You will be assigned one of two task types:** **Authoring Task** Create 5 original, open-ended prompts from your assigned subdomain at varying difficulty levels (undergraduate, advanced undergraduate, or graduate/professional). Prompts should require human judgment to evaluate the quality of the AI's response, such as chain-of-thought reasoning or proof construction. **Verification Task** Review 5 authored prompts for clarity, scope alignment, difficulty accuracy, and uniqueness. Edit prompts and difficulty ratings where needed. **Mathematics Subdomains Covered** Probability & Statistics, Algebra (incl. Linear Algebra), Ordinary/Partial Differential Equations & Dynamical Systems, Geometry, Graph Theory, Number Theory. **Key Responsibilities** \- Author clear, unambiguous, open-ended mathematical prompts that elicit evaluable AI responses \- Verify prompts are within the scope of the assigned subdomain and correctly rated for difficulty \- Ensure all 5 prompts in a task are sufficiently distinct from one another with varying difficulty levels \- Apply expert judgment to assess the depth and quality of mathematical reasoning required \- Edit prompts and difficulty assignments where standards are not met **Ideal Qualifications** \- Master's degree or higher in Mathematics, Applied Mathematics, Statistics, or a closely related field \- 2โ€“6 years of professional or research experience in a quantitative field \- Strong command of graduate-level mathematical concepts including proof writing, analysis, and formal reasoning \- Experience in academic research, mathematical competition design, or quantitative industry roles is a plus \- Excellent written English and ability to craft precise, well-scoped technical questions **More About the Opportunity** \- Expected commitment: 10+ hours/week \- Asynchronous, fully remote work

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

the platform is working with a **leading intelligence AI lab** to identify the most important open questions in core AI/ML fields and to build structured knowledge bases that could meaningfully accelerate progress over the next decade. Weโ€™re looking for exceptional PhD students and PostDocs with a clear point of view on what problems truly matter in their field and the depth to define how those problems could be tackled. ### Eligibility Requirement You will need to fill a short form in order to be eligible for this role: You will see this in addition to the AI interview in your application process. Below is guidance for what you will need to have in order to fill the form: Consider the biggest open questions in your field, for example, the 10โ€“15 questions where a breakthrough would make headline news. From this set, select those closest to your area of expertise: questions within or adjacent to your specialty, or those where you could mentor an expert toward meaningful progress. Specifically, we are looking for questions where: - A major breakthrough would be widely recognized as transformative (e.g., headline news in _Nature_, _Science_, or top field-specific venues) - Meaningful progress is plausible within the next decade (not purely speculative or dependent on unknown technology) - The question is concrete enough that progress can be evaluated (avoid umbrella questions like โ€œHow does the ML work?โ€) - You have the relevant expertise to assemble a comprehensive knowledge base directly relevant to the question ### What Youโ€™ll Do **1\. Identify high-impact open questions** - Propose major open questions where a breakthrough would be transformative - Focus on problems that are concrete, tractable, and close to your expertise **2\. Build a knowledge base for selected questions** - Seminal papers, key datasets, methods, recent advances, and โ€œhidden gemsโ€ - Assume an extremely strong expert all knowledge up until 6 months ago (1st October, 2025) **Time commitment:** ~8โ€“16 hours per selected question ### Who Weโ€™re Looking For - PhD candidates or PostDocs from top-tier institutions - Deep expertise in AI/ML/Engineering - Strong judgment about significance, tractability, and research quality - Ability to synthesize large bodies of literature into clear learning paths - **Openings:** ~50 experts per domain ### Expected Outputs - Clearly articulated, high-impact open research questions - Structured reading lists (~30โ€“100 sources per question) - Brief expert commentary on why each source and approach matters

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

**Location:** Remote **(must have access to a physical Mac with M1 processor or higher and MacOS 15.6 or hi)** **Fluent Language Skills Required:** English **Why This Role Exists** the platform is supporting a high-priority data collection initiative aimed at improving how AI systems understand complex software interfaces and real-world, multi-step workflows. Current datasets lack the fidelity and expert grounding needed to reflect authentic professional software usage. This project addresses that gap by collecting high-quality screen annotations and screen recordings performed by experienced domain experts working in real digital environments. **What Youโ€™ll Do** Depending on the task phase, you may be asked to complete one or both of the following: - Record screen sessions demonstrating specific tasks, accompanied by clear verbal narration explaining each step - Annotate screenshots of professional software by drawing precise bounding boxes around relevant UI elements - Follow provided staging instructions to set up specific UI states prior to recording - Use a custom capture tool to record workflows accurately and consistently - Adhere closely to task guidelines to ensure data quality and usability **Who You Are** - You have strong familiarity with professional software tools used in your domain including: - Blender - Godot - GIMP - R - Wings 3D - You are detail-oriented and capable of following precise instructions - You are comfortable working independently and meeting tight deadlines - You have access to a physical Mac (M1 chip or higher and macOS 15.6 or higher) and can create a fresh macOS user profile if required **Nice-to-Have** - Prior experience with data collection, annotation, or QA work - Experience recording or documenting workflows - Comfort working with new tools and staged environments **What Success Looks Like** - Screen annotations are precise, consistent, and aligned with guidelines - Screen recordings accurately capture realistic, expert workflows - Tasks are completed efficiently while maintaining high quality - Collected data is usable at scale for downstream AI research and development We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

**Location:** Remote **(must have access to a physical Mac)** **Fluent Language Skills Required:** English **Why This Role Exists** the platform is supporting a high-priority data collection initiative aimed at improving how AI systems understand complex software interfaces and real-world, multi-step workflows. Current datasets lack the fidelity and expert grounding needed to reflect authentic professional software usage. This project addresses that gap by collecting high-quality screen annotations and screen recordings performed by experienced domain experts working in real digital environments. **What Youโ€™ll Do** Depending on the task phase, you may be asked to complete one or both of the following: - Record screen sessions demonstrating specific tasks, accompanied by clear verbal narration explaining each step - Annotate screenshots of professional software by drawing precise bounding boxes around relevant UI elements - Follow provided staging instructions to set up specific UI states prior to recording - Use a custom capture tool to record workflows accurately and consistently - Adhere closely to task guidelines to ensure data quality and usability **Who You Are** - You have strong familiarity with professional software tools used in your domain including: - MATLAB - Origin - Stata - EViews - You are detail-oriented and capable of following precise instructions - You are comfortable working independently and meeting tight deadlines - You have access to a physical Mac and can create a fresh macOS user profile if required **Nice-to-Have** - Prior experience with data collection, annotation, or QA work - Experience recording or documenting workflows - Comfort working with new tools and staged environments **What Success Looks Like** - Screen annotations are precise, consistent, and aligned with guidelines - Screen recordings accurately capture realistic, expert workflows - Tasks are completed efficiently while maintaining high quality - Collected data is usable at scale for downstream AI research and development We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’
$100 per hour

**Location:** Remote **(must have access to a physical Mac)** **Fluent Language Skills Required:** English **Why This Role Exists** the platform is supporting a high-priority data collection initiative aimed at improving how AI systems understand complex software interfaces and real-world, multi-step workflows. Current datasets lack the fidelity and expert grounding needed to reflect authentic professional software usage. This project addresses that gap by collecting high-quality screen annotations and screen recordings performed by experienced domain experts working in real digital environments. **What Youโ€™ll Do** Depending on the task phase, you may be asked to complete one or both of the following: - Record screen sessions demonstrating specific tasks, accompanied by clear verbal narration explaining each step - Annotate screenshots of professional software by drawing precise bounding boxes around relevant UI elements - Follow provided staging instructions to set up specific UI states prior to recording - Use a custom capture tool to record workflows accurately and consistently - Adhere closely to task guidelines to ensure data quality and usability **Who You Are** - You have strong familiarity with professional software tools used in your domain including: - Windows - MacOS - Linux - You are detail-oriented and capable of following precise instructions - You are comfortable working independently and meeting tight deadlines - You have access to a physical Mac and can create a fresh macOS user profile if required **Nice-to-Have** - Prior experience with data collection, annotation, or QA work - Experience recording or documenting workflows - Comfort working with new tools and staged environments **What Success Looks Like** - Screen annotations are precise, consistent, and aligned with guidelines - Screen recordings accurately capture realistic, expert workflows - Tasks are completed efficiently while maintaining high quality - Collected data is usable at scale for downstream AI research and development We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’
$100 per hour

**Location:** Remote **(must have access to a physical Mac)** **Fluent Language Skills Required:** English **Why This Role Exists** the platform is supporting a high-priority data collection initiative aimed at improving how AI systems understand complex software interfaces and real-world, multi-step workflows. Current datasets lack the fidelity and expert grounding needed to reflect authentic professional software usage. This project addresses that gap by collecting high-quality screen annotations and screen recordings performed by experienced domain experts working in real digital environments. **What Youโ€™ll Do** Depending on the task phase, you may be asked to complete one or both of the following: - Record screen sessions demonstrating specific tasks, accompanied by clear verbal narration explaining each step - Annotate screenshots of professional software by drawing precise bounding boxes around relevant UI elements - Follow provided staging instructions to set up specific UI states prior to recording - Use a custom capture tool to record workflows accurately and consistently - Adhere closely to task guidelines to ensure data quality and usability **Who You Are** - You have strong familiarity with professional software tools used in your domain including: - Word - PowerPoint - Excel - You are detail-oriented and capable of following precise instructions - You are comfortable working independently and meeting tight deadlines - You have access to a physical Mac and can create a fresh macOS user profile if required **Nice-to-Have** - Prior experience with data collection, annotation, or QA work - Experience recording or documenting workflows - Comfort working with new tools and staged environments **What Success Looks Like** - Screen annotations are precise, consistent, and aligned with guidelines - Screen recordings accurately capture realistic, expert workflows - Tasks are completed efficiently while maintaining high quality - Collected data is usable at scale for downstream AI research and development We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

**Location:** Remote **(must have access to a physical Mac)** **Fluent Language Skills Required:** English **Why This Role Exists** the platform is supporting a high-priority data collection initiative aimed at improving how AI systems understand complex software interfaces and real-world, multi-step workflows. Current datasets lack the fidelity and expert grounding needed to reflect authentic professional software usage. This project addresses that gap by collecting high-quality screen annotations and screen recordings performed by experienced domain experts working in real digital environments. **What Youโ€™ll Do** Depending on the task phase, you may be asked to complete one or both of the following: - Record screen sessions demonstrating specific tasks, accompanied by clear verbal narration explaining each step - Annotate screenshots of professional software by drawing precise bounding boxes around relevant UI elements - Follow provided staging instructions to set up specific UI states prior to recording - Use a custom capture tool to record workflows accurately and consistently - Adhere closely to task guidelines to ensure data quality and usability **Who You Are** - You have strong familiarity with professional software tools used in your domain including: - AutoCAD Mechanical - SolidWorks - Inventor - Vivado - You are detail-oriented and capable of following precise instructions - You are comfortable working independently and meeting tight deadlines - You have access to a physical Mac and can create a fresh macOS user profile if required **Nice-to-Have** - Prior experience with data collection, annotation, or QA work - Experience recording or documenting workflows - Comfort working with new tools and staged environments **What Success Looks Like** - Screen annotations are precise, consistent, and aligned with guidelines - Screen recordings accurately capture realistic, expert workflows - Tasks are completed efficiently while maintaining high quality - Collected data is usable at scale for downstream AI research and development We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

**Location:** Remote **(must have access to a physical Mac)** **Fluent Language Skills Required:** English **Why This Role Exists** the platform is supporting a high-priority data collection initiative aimed at improving how AI systems understand complex software interfaces and real-world, multi-step workflows. Current datasets lack the fidelity and expert grounding needed to reflect authentic professional software usage. This project addresses that gap by collecting high-quality screen annotations and screen recordings performed by experienced domain experts working in real digital environments. **What Youโ€™ll Do** Depending on the task phase, you may be asked to complete one or both of the following: - Record screen sessions demonstrating specific tasks, accompanied by clear verbal narration explaining each step - Annotate screenshots of professional software by drawing precise bounding boxes around relevant UI elements - Follow provided staging instructions to set up specific UI states prior to recording - Use a custom capture tool to record workflows accurately and consistently - Adhere closely to task guidelines to ensure data quality and usability **Who You Are** - You have strong familiarity with professional software tools used in your domain including: - Audio MIDI Setup (Creative) - Audio MIDI Setup.app (Creative) - Blender (Creative) - Digital Color Meter (Creative) - Digital Color Meter.app (Creative) - Font Book (Creative) - Font Book.app (Creative) - Freeform (Creative) - Freeform.app (Creative) - GIMP (Creative) - Godot Engine (Creative) - Inkscape (Creative) - Inkscape.app (Creative) - LMMS (Creative) - LMMS (Linux MultiMedia Studio) (Creative) - Penpot (Creative) - Shotcut (Creative) - Wings 3D (Creative) - You are detail-oriented and capable of following precise instructions - You are comfortable working independently and meeting tight deadlines - You have access to a physical Mac and can create a fresh macOS user profile if required **Nice-to-Have** - Prior experience with data collection, annotation, or QA work - Experience recording or documenting workflows - Comfort working with new tools and staged environments **What Success Looks Like** - Screen annotations are precise, consistent, and aligned with guidelines - Screen recordings accurately capture realistic, expert workflows - Tasks are completed efficiently while maintaining high quality - Collected data is usable at scale for downstream AI research and development We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

**Location:** Remote **(must have access to a physical Mac)** **Fluent Language Skills Required:** English **Why This Role Exists** the platform is supporting a high-priority data collection initiative aimed at improving how AI systems understand complex software interfaces and real-world, multi-step workflows. Current datasets lack the fidelity and expert grounding needed to reflect authentic professional software usage. This project addresses that gap by collecting high-quality screen annotations and screen recordings performed by experienced domain experts working in real digital environments. **What Youโ€™ll Do** Depending on the task phase, you may be asked to complete one or both of the following: - Record screen sessions demonstrating specific tasks, accompanied by clear verbal narration explaining each step - Annotate screenshots of professional software by drawing precise bounding boxes around relevant UI elements - Follow provided staging instructions to set up specific UI states prior to recording - Use a custom capture tool to record workflows accurately and consistently - Adhere closely to task guidelines to ensure data quality and usability **Who You Are** - You have strong familiarity with professional software tools used in your domain including: - Visual Studio Code - Android Studio - Quartus - VMware - You are detail-oriented and capable of following precise instructions - You are comfortable working independently and meeting tight deadlines - You have access to a physical Mac and can create a fresh macOS user profile if required **Nice-to-Have** - Prior experience with data collection, annotation, or QA work - Experience recording or documenting workflows - Comfort working with new tools and staged environments **What Success Looks Like** - Screen annotations are precise, consistent, and aligned with guidelines - Screen recordings accurately capture realistic, expert workflows - Tasks are completed efficiently while maintaining high quality - Collected data is usable at scale for downstream AI research and development We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

## **1\. Role Overview** the platform is seeking experienced **Spanish Copy Editors** to support high-quality written content across a range of AI training initiatives. This role is ideal for detail-oriented language professionals who can ensure clarity, consistency, grammatical precision, and stylistic excellence across complex Spanish written materials. We are looking for people who demonstrate exceptional command of Spanish, strong editorial judgment, and a meticulous eye for detail. This role requires consistent engagement and reliability over time. ## **2\. Key Responsibilities** - Edit and proofread Spanish-language content for grammar, spelling, punctuation, syntax, and clarity - Ensure consistency in tone, style, formatting, and terminology across documents - Improve structure, readability, and logical flow without altering intended meaning - Identify ambiguities, inconsistencies, and factual or structural issues - **Enforce Spanish style guide adherence and maintain high editorial standards** - Flag unclear instructions, weak reasoning, or editorial risks in source material - Provide thoughtful feedback to improve overall content quality ## **3\. Ideal Qualifications** - **3-5+ years of proven professional experience** in copy editing - Exceptional written and grammatical command of Spanish (native proficiency required) - Demonstrated history of high-precision editorial work with minimal error rates - Extremely high attention to detail and ability to catch subtle inconsistencies - Strong familiarity with following instructions in a professional written context (**RAE standards, Spanish editorial standards, or equivalent**) - Ability to work independently while maintaining consistent quality standards - Reliable availability and consistent engagement ## **4\. More About the Opportunity** - Minimum commitment: **20 hours per week** - Ongoing, consistent engagement expected - Work involves reviewing and refining high-volume written material ## **5\. Application Process** - Submit your resume highlighting relevant editorial experience - Complete the attached the required interview and assessment - We aim to follow up within a few days with next steps ## **6\. About the platform** the platform is a talent marketplace that connects top experts with leading AI labs and research organizations. Our investors include Benchmark, General Catalyst, Adam D'Angelo, and Jack Dorsey. Thousands of professionals across law, research, engineering, design, and creative fields contribute to the platform projects shaping the next generation of AI.

๐ŸŒ Remote
๐Ÿ‡ช๐Ÿ‡ธ
4/2/2026
Apply โ†’

## **1\. Role Overview** the platform is seeking experienced **Korean Copy Editors** to support high-quality written content across a range of AI training initiatives. This role is ideal for detail-oriented language professionals who can ensure clarity, consistency, grammatical precision, and stylistic excellence across complex Korean written materials. We are looking for people who demonstrate exceptional command of Korean, strong editorial judgment, and a meticulous eye for detail. This role requires consistent engagement and reliability over time. ## **2\. Key Responsibilities** - Edit and proofread Korean-language content for grammar, spelling, punctuation, syntax, and clarity - Ensure consistency in tone, style, formatting, and terminology across documents - Improve structure, readability, and logical flow without altering intended meaning - Identify ambiguities, inconsistencies, and factual or structural issues - **Enforce Korean style guide adherence and maintain high editorial standards** - Flag unclear instructions, weak reasoning, or editorial risks in source material - Provide thoughtful feedback to improve overall content quality ## **3\. Ideal Qualifications** - **3-5+ years of proven professional experience** in copy editing - Exceptional written and grammatical command of Korean (native proficiency required) - Demonstrated history of high-precision editorial work with minimal error rates - Extremely high attention to detail and ability to catch subtle inconsistencies - Strong familiarity with following instructions in a professional written context (**National Institute of Korean Language standards, Korean editorial standards, or equivalent**) - Ability to work independently while maintaining consistent quality standards - Reliable availability and consistent engagement ## **4\. More About the Opportunity** - Minimum commitment: **20 hours per week** - Ongoing, consistent engagement expected - Work involves reviewing and refining high-volume written material ## **5\. Application Process** - Submit your resume highlighting relevant editorial experience - Complete the attached the required interview and assessment - We aim to follow up within a few days with next steps ## **6\. About the platform** the platform is a talent marketplace that connects top experts with leading AI labs and research organizations. Our investors include Benchmark, General Catalyst, Adam D'Angelo, and Jack Dorsey. Thousands of professionals across law, research, engineering, design, and creative fields contribute to the platform projects shaping the next generation of AI.

๐ŸŒ Remote
๐Ÿ‡ฐ๐Ÿ‡ท
4/2/2026
Apply โ†’

## **1\. Role Overview** the platform is seeking experienced **Chinese Copy Editors** to support high-quality written content across a range of AI training initiatives. This role is ideal for detail-oriented language professionals who can ensure clarity, consistency, grammatical precision, and stylistic excellence across complex Chinese written materials. We are looking for people who demonstrate exceptional command of Chinese, strong editorial judgment, and a meticulous eye for detail. This role requires consistent engagement and reliability over time. ## **2\. Key Responsibilities** - Edit and proofread Chinese-language content for grammar, spelling, punctuation, syntax, and clarity - Ensure consistency in tone, style, formatting, and terminology across documents - Improve structure, readability, and logical flow without altering intended meaning - Identify ambiguities, inconsistencies, and factual or structural issues - **Enforce Chinese style guide adherence and maintain high editorial standards** - Flag unclear instructions, weak reasoning, or editorial risks in source material - Provide thoughtful feedback to improve overall content quality ## **3\. Ideal Qualifications** - **3-5+ years of proven professional experience** in copy editing - Exceptional written and grammatical command of Chinese (native proficiency required) - Demonstrated history of high-precision editorial work with minimal error rates - Extremely high attention to detail and ability to catch subtle inconsistencies - Strong familiarity with following instructions in a professional written context (**GB/T standards, Chinese editorial standards, or equivalent**) - Ability to work independently while maintaining consistent quality standards - Reliable availability and consistent engagement ## **4\. More About the Opportunity** - Minimum commitment: **20 hours per week** - Ongoing, consistent engagement expected - Work involves reviewing and refining high-volume written material ## **5\. Application Process** - Submit your resume highlighting relevant editorial experience - Complete the attached the required interview and assessment - We aim to follow up within a few days with next steps ## **6\. About the platform** the platform is a talent marketplace that connects top experts with leading AI labs and research organizations. Our investors include Benchmark, General Catalyst, Adam D'Angelo, and Jack Dorsey. Thousands of professionals across law, research, engineering, design, and creative fields contribute to the platform projects shaping the next generation of AI.

๐ŸŒ Remote
๐Ÿ‡จ๐Ÿ‡ณ
4/2/2026
Apply โ†’
$35 - $55 per hour

## **1\. Role Overview** the platform is seeking experienced **Bilingual Software Engineers** to support high-quality content across a range of AI training initiatives. This role involves creating image-based coding training examples โ€” finding relevant images, writing prompts, and crafting ideal model responses โ€” in both English and your native language. We are looking for people who demonstrate strong programming ability across multiple languages, clear technical communication in two languages, and a meticulous eye for detail. This role requires consistent engagement and reliability over time. **Languages we are hiring for:** Arabic, Portuguese, Spanish, Korean, French, Chinese, and Russian. ## **2\. Key Responsibilities** - Create high-quality training examples consisting of an image, a code-related prompt, and an ideal response - Write, debug, and explain code based on visual inputs such as screenshots, error tracebacks, architecture diagrams, flowcharts, and UI mockups - Generate working code from visual specifications including wireframes, UML diagrams, and system design sketches - Identify and fix bugs shown in code screenshots and terminal output - Write prompts and responses that sound natural and fluent in your native language โ€” not translated from English - Produce clean, well-commented code with proper error handling inside fenced, language-tagged code blocks - Follow style guides, system prompts, and formatting requirements with precision - Source appropriate images (code screenshots, terminal output, error tracebacks, architecture diagrams, flowcharts, UI mockups) that meet project quality standards ## **3\. Ideal Qualifications** - Bachelor's degree (or higher) in Computer Science, Software Engineering, or a closely related field - Bilingual proficiency in English and one of the following languages: Arabic, Portuguese, Spanish, Korean, French, Chinese, or Russian (native-level proficiency in at least one required) - Proficiency in one or more popular programming languages (Python, JavaScript, Java, C, C++, HTML/CSS, or equivalent) - Demonstrated professional or academic experience writing, reviewing, and debugging code - Ability to read and interpret system architecture diagrams, flowcharts, and UI mockups - Strong ability to explain technical concepts clearly in written form across both languages - Extremely high attention to detail and ability to catch subtle bugs and logic errors - Ability to work independently while maintaining consistent quality standards - Reliable availability and consistent engagement ## **4\. More About the Opportunity** - Minimum commitment: **20 hours per week** - Ongoing, consistent engagement expected - Work involves creating and refining high-volume code-related content in a multilingual context ## **5\. Application Process** - Submit your resume highlighting relevant software engineering and language experience - Complete the attached required interview and assessment - We aim to follow up within a few days with next steps ## **6\. About the platform** the platform is a talent marketplace that connects top experts with leading AI labs and research organizations. Our investors include Benchmark, General Catalyst, Adam D'Angelo, and Jack Dorsey. Thousands of professionals across law, research, engineering, design, and creative fields contribute to the platform projects shaping the next generation of AI.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/8/2026
Apply โ†’

**the platform is hiring experienced Software Engineers** to support high-impact research collaborations with leading AI labs. Freelancers will evaluate and compare the performance of AI-powered CLI coding agents on real-world infrastructure debugging tasks. This is a unique opportunity to apply your systems engineering expertise toward producing rigorous comparative analyses that directly inform product decisions at frontier AI companies. ### About the Project You'll solve **TerminalBench tasks**: real-world broken infrastructure scenarios running inside Docker containers. You'll use AI CLI agents to help you. Each task presents a failing system (databases, networking, security, pipelines) that you must diagnose and fix by writing a bash script, guided by AI agents in turn. ### Key Responsibilities - Solve the same infrastructure debugging task with CLI-based AI coding agent - Diagnose broken systems inside Docker containers (databases, TLS, pipelines, replication, access control) - Write bash scripts that fix the root cause and survive service restarts - Compare agents' approaches and rank their performance after each task ### Ideal Qualifications - 3+ years of experience in software engineering, with hands-on debugging of systems and infrastructure - Strong bash/shell scripting proficiency: you'll be writing non-trivial fix scripts from scratch - Docker and containerization experience: every task runs inside a Docker container you'll need to explore via `docker exec` - Infrastructure and systems debugging skills: experience with PostgreSQL, MySQL, Redis, nginx, TLS, systemd, log analysis, or similar - Familiarity with version control workflows (Git, PRs, issue tracking) - Experience with AI coding tools (Copilot, Cursor, Claude, or similar) is a plus: you need to effectively prompt and evaluate AI output, not just code yourself ### Project Timeline - **Start Date:** Immediate - **Duration:** 1-2 weeks - **Commitment:** Part-time (15-25 hours/week, with flexibility up to 40 hours/week) ### Application & Onboarding Process 1. Upload your resume 2. AI interview: A short, 15-minute conversational session to understand your background, experience, and interest in the role 3. Follow-up communication within a few days with next steps and onboarding details **Apply today** and leverage your systems engineering expertise to help evaluate the next generation of AI coding agents! This is a pay-per-task opportunity for writers. Eligible promotion to reviewers on a need basis.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/10/2026
Apply โ†’
$35 - $50 per hour

**Role Overview** We are seeking experienced e-commerce professionals to author and verify high-quality open-ended prompts for AI model evaluation. You will craft and review practical, unambiguous e-commerce problems across core subdomains, assessing AI reasoning quality and helping establish rigorous evaluation standards for frontier language models. You will be assigned one of two task types: **Authoring Task** Create 5 original, open-ended prompts from your assigned subdomain at varying difficulty levels (undergraduate, advanced undergraduate, or graduate/professional). Prompts should require human judgment to evaluate the quality of the AI's response, such as strategic analysis, campaign design, or data-driven decision making. **Verification Task** Review 5 authored prompts for clarity, scope alignment, difficulty accuracy, and uniqueness. Edit prompts and difficulty ratings where needed. **E-Commerce Subdomains Covered** Pricing Strategy, Sales Analytics, Marketing (incl. digital marketing, customer acquisition, retention, and campaign management). **Key Responsibilities** \- Author clear, unambiguous, open-ended e-commerce prompts that elicit evaluable AI responses \- Verify prompts are within the scope of the assigned subdomain and correctly rated for difficulty \- Ensure all 5 prompts in a task are sufficiently distinct from one another with varying difficulty levels \- Apply expert judgment to assess the depth and quality of e-commerce reasoning required \- Edit prompts and difficulty assignments where standards are not met **Ideal Qualifications** \- Master's degree or higher in Business, Marketing, Economics, Data Science, or a closely related field \- 2โ€“6 years of professional experience in e-commerce, digital marketing, retail analytics, or a related field \- Strong understanding of pricing strategy, sales funnel analytics, and performance marketing \- Experience with platforms such as Amazon Seller Central, Shopify, or major ad platforms is a plus \- Excellent written English and ability to craft precise, well-scoped technical questions **More About the Opportunity** \- Expected commitment: 10+ hours/week \- Asynchronous, fully remote work

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/10/2026
Apply โ†’
$40 - $45 per hour

**AI Data Annotator & Reviewer** We're looking for a detail-oriented contributor to support AI model training and data preparation for leading AI research labs. **What You'll Do** You'll annotate and review training data, audit work produced by other annotators, and assist with data conversion workflows that improve model performance. Day-to-day work involves following precise instructions, adapting quickly to evolving requirements, and helping shape the capabilities of frontier AI models. **What We're Looking For** - Strong ability to learn and apply new instructions quickly - High attention to detail and consistency in reviewing data - Comfort working with structured data formats and conversion tasks - Motivated by contributing to cutting-edge AI development **Why It Matters** This work directly impacts how next-generation AI models learn and perform. You'll be working at the frontier of AI capabilities development alongside top-tier research labs.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/10/2026
Apply โ†’
$90 - $150 per hour

**Overview** We're looking for experienced Grafana power users to design expert-level evaluation tasks that test whether AI agents can use Grafana the way a real professional does. Your domain expertise is what makes these tasks authentic. **What You'll Do** - ๏ปฟ๏ปฟDesign realistic, multi-step Grafana workflows - dashboards, alerting rules, data source configuration, panel setup, cross-module operations - ๏ปฟ๏ปฟPerform each workflow yourself on a hosted Grafana instance **to produce a reference trajectory** - ๏ปฟ๏ปฟWrite clear, specific task prompts with measurable outcomes that can be verified programmatically - ๏ปฟ๏ปฟImplement programmatic graders that check whether each instruction was completed correctly - ๏ปฟ๏ปฟReview AI agent attempts at your tasks, identify where and why they fail, and tag root causes - ๏ปฟ๏ปฟCalibrate task difficulty so tasks are challenging but solvable - iterating on prompts and constraints based on model performance **Requirements** - ๏ปฟ๏ปฟ2+ years of daily, professional Grafana experience (SRE, Platform Engineering, Observability, or similar) - ๏ปฟ๏ปฟDeep familiarity with PromQL, dashboard templating, alerting pipelines, and data source configuration (Prometheus, InfluxDB, etc.) - ๏ปฟ๏ปฟAbility to articulate workflows clearly enough for programmatic verification - ๏ปฟ๏ปฟComfort writing basic grading scripts (Python; engineering support provided as needed) **Nice to Have** - ๏ปฟ๏ปฟExperience with Grafana API automation - ๏ปฟ๏ปฟKubernetes/infrastructure monitoring background - ๏ปฟ๏ปฟFamiliarity with AI evaluation or benchmarking **Time Commitment** - ๏ปฟ๏ปฟ10-15 hrs/week minimum during the project - ๏ปฟ๏ปฟFast turnaround expected - responsiveness matters

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/10/2026
Apply โ†’
$75 - $100 per hour

**Role Overview** We are seeking expert legal professionals to author and verify high-quality open-ended prompts for AI model evaluation. You will craft and review challenging, unambiguous legal problems across core subdomains, assessing AI reasoning quality and helping establish rigorous evaluation standards for frontier language models. You will be assigned one of two task types: - **Authoring Task** โ€” Create 5 original, open-ended prompts from your assigned subdomain at varying difficulty levels (undergraduate, advanced undergraduate, or graduate/professional). Prompts should require human judgment to evaluate the quality of the AI's response, such as legal reasoning, statutory interpretation, or case analysis. - **Verification Task** โ€” Review 5 authored prompts for clarity, scope alignment, difficulty accuracy, and uniqueness. Edit prompts and difficulty ratings where needed. **Law Subdomains Covered** Professional & Statutory Law, Jurisprudence & Legal Theory, International Law, Contract Law, Criminal Law, Constitutional Law, Regulatory & Administrative Law. **Key Responsibilities** - Author clear, unambiguous, open-ended legal prompts that elicit evaluable AI responses - Verify prompts are within the scope of the assigned subdomain and correctly rated for difficulty - Ensure all 5 prompts in a task are sufficiently distinct from one another with varying difficulty levels - Apply expert judgment to assess the depth and quality of legal reasoning required - Edit prompts and difficulty assignments where standards are not met **Ideal Qualifications** - JD, LLM, SJD, or Master's degree or higher in Law or a closely related field - 2โ€“6 years of professional experience in legal practice, academia, or policy - Strong command of legal reasoning, statutory interpretation, and jurisprudential theory - Bar admission, judicial clerkship, or legal research experience is a strong plus - Excellent written English and ability to craft precise, well-scoped legal questions **More About the Opportunity** - Expected commitment: 10+ hours/week - Asynchronous, fully remote work

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/11/2026
Apply โ†’
$60 - $80 per hour

**Finance Model Prompt Evaluator** The team needs expert finance and economics professionals to author and verify high-quality open-ended prompts for AI model evaluation You will craft and review challenging, unambiguous financial Analysis:problems across core subdomains, assessing AI reasoning quality and helping establish rigorous evaluation standards for frontier language models ## Details - \- Expected - Commitment: 10+ hours/week - \- Asynchronous, remote work ## Day-to-day - \- Author clear, unambiguous, open-ended financial prompts that elicit evaluable AI responses - \- Verify prompts are within the scope of the assigned subdomain and correctly rated for difficulty - \- Ensure all 5 prompts in a task are sufficiently distinct from one another with varying difficulty levels - \- Apply expert judgment to assess the depth and quality of financial reasoning required - \- Edit prompts and difficulty assignments where standards are not met ## Bonus - \- Master's degree or higher in Finance, Economics, Financial Engineering, or a closely related field - \- 2-6 years of professional experience in financial services, investment banking, asset management, or a related field - \- Strong command of financial modeling, quantitative methods, and domain-specific regulatory frameworks - \- CFA, FRM, CPA, or equivalent professional certification is a strong plus

๐ŸŒ Remote4/2/2026
Apply โ†’
$15 - $95 per hour

**Generalist Language Experts (Search)** This engagement focuses on real interactions you might have with an AI model. It's an opportunity to contribute your expertise to cutting-edge AI research. ## Day-to-day - Youโ€™ll be asked to create tasks and deliverables regarding common requests within your professional domain ## You bring - At least 2+ years of education and/or experience - Excellent written communication with strong grammar and spelling skills More About the Opportunity - Expected workload: ~10 hours per week, with flexibility to scale up to 20+ hours - Project

๐ŸŒ Remote4/2/2026
Apply โ†’

**Indonesian STEM Translation Reviewer** We're looking for native Indonesian speakers with a strong STEM background to evaluate AI-generated translations of science, technology, engineering, and math content. ## Day-to-day - Review translated responses to STEM questions and ## You bring - Native or near-native Indonesian speaker - Strong STEM background (degree or professional experience in science, engineering, math, or technology) - Ability to evaluate both technical accuracy and linguistic quality - Remote, hourly | ~1.25 hrs per task | Flexible

๐ŸŒ Remote4/11/2026
Apply โ†’

**Spanish (LATAM) STEM Translation Reviewer** We're looking for native Spanish (LATAM) speakers with a strong STEM background to evaluate AI-generated translations of science, technology, engineering, and math content. ## Day-to-day - Review translated responses to STEM questions and ## You bring - Native or near-native Spanish (LATAM) speaker - Strong STEM background (degree or professional experience in science, engineering, math, or technology) - Ability to evaluate both technical accuracy and linguistic quality - Remote, hourly | ~1.25 hrs per task | Flexible

๐ŸŒ Remote4/11/2026
Apply โ†’

**Korean STEM Translation Reviewer** We're looking for native Korean speakers with a strong STEM background to evaluate AI-generated translations of science, technology, engineering, and math content. ## Day-to-day - Review translated responses to STEM questions and ## You bring - Native or near-native Korean speaker - Strong STEM background (degree or professional experience in science, engineering, math, or technology) - Ability to evaluate both technical accuracy and linguistic quality - Remote, hourly | ~1.25 hrs per task | Flexible

๐ŸŒ Remote4/11/2026
Apply โ†’

**Arabic (Modern Standard Arabic) STEM Translation Reviewer** We're looking for native Arabic (Modern Standard Arabic) speakers with a strong STEM background to evaluate AI-generated translations of science, technology, engineering, and math content. ## Day-to-day - Review translated responses to STEM questions and ## You bring - Native or near-native Arabic (Modern Standard Arabic) speaker - Strong STEM background (degree or professional experience in science, engineering, math, or technology) - Ability to evaluate both technical accuracy and linguistic quality - Remote, hourly | ~1.25 hrs per task | Flexible

๐ŸŒ Remote4/11/2026
Apply โ†’
$55 - $135 per hour

We are looking for experienced legal professionals to write complex legal reasoning problems from CourtListener, a comprehensive database of U.S. court opinions and legal records. In this role, you will author legal reasoning problems grounded in real court opinions, orders, and judicial records. Your work will directly support the development of AI systems trained on legal reasoning and judicial language. **Responsibilities:** - Review and annotate U.S. court opinions and legal filings sourced from CourtListener - Apply legal judgment to evaluate case outcomes, reasoning quality, and document classification - Complete structured tasks with consistency and accuracy **Requirements:** - Have **3+ years of professional experience practicing law** at a law firm, in-house legal department, government agency, or legal research organization - Hold a **J.D., LL.M. or equivalent legal degree** from a top U.S. university - Experience with U.S. federal or state court systems - Strong reading comprehension of legal texts - Ability to commit ~15 hours/week **Nice to have:** - Prior experience in legal data annotation or AI/ML projects - Familiarity with CourtListener

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/11/2026
Apply โ†’
$20 - $30 per hour

The Vendor Management Associate plays a critical role in coordinating and managing vendor users on the company's proprietary localization tools, as well as maintaining approved vendor resource databases. Approximately 80% of this role involves executing core operational and transactional tasks, including reviewing vendor onboarding requests, performing validations, and data entry with a minimum accuracy of 97%. This role supports the company's Procure-to-Pay processes by ensuring supplier data accuracy, compliance, and payment integrity. ## Responsibilities - Maintain and update the global Supplier Master System, including supplier tax, contact, banking, and payment-related details with minimal oversight. - Review and approve supplier onboarding requests, ensuring compliance with internal controls and documented processes. - Execute supplier profile changes and modifications, including critical data such as tax IDs, bank accounts, and legal names. - Perform thorough validations of supplier information to ensure accuracy and prevent duplicate or defective records. - Manage communication with suppliers globally to obtain required documentation, resolve discrepancies, and support onboarding and ongoing supplier maintenance. - Collaborate with cross-functional teams and tooling partners to improve and automate Procure-to-Pay workflows. - Support Supplier Master Quality Assurance by validating controls related to supplier data and compliance before approving profiles. - Track and prioritize issues affecting operational efficiency and supplier data quality. - Maintain and update desktop procedures and documentation related to supplier onboarding and maintenance. - Work closely with suppliers to ensure all required documentation is obtained timely, complete, and accurate, including tax forms and bank account proofs. - Support risk management efforts by ensuring supplier data complies with tax regulations and payment policies, and by managing holds related to missing or invalid documentation. - Participate in projects and initiatives as assigned. ## Qualifications - Strong analytical and problem-solving skills with keen attention to detail. - Ability to work independently, prioritize tasks, and meet deadlines in a fast-paced environment. - Experience with database management and data validation processes. - Effective verbal and written communication skills in English; additional languages are a plus. - Ability to manage multiple projects simultaneously under pressure. - Familiarity with Procure-to-Pay processes, vendor management, and accounts payable functions. - Bachelorโ€™s degree in a relevant field or equivalent experience. - Preferred experience in Vendor/Supplier Management, Procurement, Risk Management, Accounts Payable, or Audit. California Fair Chance Act: The company will consider for employment qualified applicants with arrest and conviction records. ## Work Authorization Applicants must be located in the United States. Cincinnatus does not sponsor visas and will not provide visa sponsorship for this role. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

Product Operations Specialist (Technical + LLM) The Product & Regulatory Operations organization is a vital part of a prestigious tech company's commitment to user and business safety on its platforms, delivering operations for emerging and critical priorities across the company and Global Operations in close partnership with product, engineering, legal, and cross-functional stakeholders. As a Technical Product Operations Specialist, you will work on projects that drive growth, engagement, and quality of the company's products through data-driven decision making and technical tooling. We are looking for experienced professionals with strong technical acumen and product sense who can independently build analytical solutions, leverage AI tools to accelerate workflows, and drive programmatic improvements through data analysis and automation. ## Responsibilities - Support program execution strategy for multiple products and platforms, including kickstarting 0 to 1 efforts, accelerating execution, and improving quality/outcomes for product objectives via programmatic and technical solutions - Write and optimize SQL queries to collect, transform, and analyze product data from various sources, identifying trends, patterns, and insights that inform business decisions - Build and maintain dashboards, reports, and visualizations (e.g., internal dashboard tools, Google Sheets) to effectively communicate findings to stakeholders and support real-time decision-making - Leverage AI agents and tools to accelerate data analysis, automate repetitive tasks, and prototype solutions - Proactively identify program risks, develop data-backed mitigation plans, and communicate rationale and updates clearly across cross-functional teams - General program management to identify opportunities and predict roadblocks through data analysis, strengthen cross-functional relationships, and execute on plans with technical rigor - Labeling and large language model (LLM) training across all organizations ## Qualifications - 8+ years of relevant experience in consulting, strategy, operations, or equivalent program management experience with particular focus on technical product operations - Proficient in SQL, able to write advanced queries including CTEs, window functions, joins, aggregations, and data transformations - Experience with AI agents and AI-assisted workflows, comfortable prompting and iterating with AI tools to accelerate work - Expert-level proficiency in Google Workspace, especially Google Sheets, able to create detailed, step-by-step documentation, complex formulas, pivot tables, and automated workflows - Strong multi-tasking abilities with demonstrated experience managing multiple workstreams simultaneously - Self-starter mentality, able to work independently, take initiative, and drive projects forward without close supervision - Eager to learn new tools, technologies, and domains quickly; adapts to changing technical requirements - Effective critical thinking and experience leveraging data to anticipate and unblock problems and drive solutions - Proven time-management and organizational skills - Data annotation/labeling skills and LLM training experience among product quality teams Preferred Qualifications: - Experience with VS Code or similar development environments - Experience with low-code/no-code prototyping or AI-assisted code generation - Experience building dashboards and data visualizations (e.g., internal dashboard tools, Tableau, Looker, or similar) - Familiarity with data pipelines, ETL processes, or data infrastructure concepts - Experience working with engineering teams on technical specifications and requirements - Background in quality assurance, triage operations, or ML/AI validation workflows - Experience with testing, developing, and implementing test strategies (manual and automated) for products and features Pursuant to the California Fair Chance Act, Los Angeles County Fair Chance Ordinance for Employers, Los Angeles Fair Chance Initiative for Hiring Ordinance, and San Francisco Fair Chance Ordinance, qualified applicants will be considered for assignment with arrest and conviction records. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness, meet client expectations, standards, and accompanying requirements, and safeguard business operations and company reputation. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’
$60 - $85 per hour

Project Manager, 1P Retail Operations + Technology Location: Burlingame, CA; Menlo Park, CA; Los Angeles, CA; or New York, NY (Hybrid, 3 days/week on-site) The 1P Retail team at a prestigious tech company is responsible for launching and managing innovative retail stores that create immersive experiences to showcase the company's products. This dynamic and fast-paced team operates at the intersection of technology and retail, dedicated to building the future of in-person brand connection. We are seeking a highly motivated and experienced Project Manager to join our 1P Retail Operations + Technology team. This key executional role involves hands-on management and delivery of complex projects that form the operational and technological foundation of our retail stores. The ideal candidate is an exceptional operator who excels at breaking down complex initiatives into manageable tasks and driving them to completion in a fast-paced environment. ## Responsibilities - Manage and deliver cross-functional initiatives and projects through all stages of the lifecycle, from initiation to completion. - Serve as a conduit between technical and non-technical teams, translating complex technical details into clear, understandable language and action plans. - Break down complex issues into manageable components, employing systematic approaches to execute solutions. - Receive directional guidance where required, and proactively exercise best judgment to select effective techniques to implement and evaluate outcomes. - Thrive in environments of ambiguity and velocity, maintaining clear focus and a path to execution. - Collaborate with cross-functional teams including Retail Operations, Product Marketing, Enterprise Systems and Product, Retail Design and Implementation, Retail Finance, and Product Management to ensure seamless project execution. - Drive the implementation and development of retail systems, including POS, ERP, CRM, AV, and in-store demo technologies. - Curate and present project status, risks, and progress from working teams to leadership. ## Qualifications - Bachelor's Degree. - 5+ years of experience in project or program management within a Direct-to-Consumer (DTC) consumer electronics retailer, manufacturer, technology, or premium hospitality company. - Proven competence and experience with project management tools (e.g., JIRA, Asana, MS Project, Wrike, or in-house tools) and general productivity software (Microsoft, Apple, GSuite). - Deep experience with the implementation and development of retail systems (POS, ERP, CRM, AV, Demo). Preferred Qualifications: - MBA or other advanced degree. - PMP, Agile, or similar advanced project/portfolio management certification. - Demonstrated ability to leverage AI in daily work to boost efficiency and productivity. - Strong analytical and problem-solving skills with a data-driven approach to execution. - Excellent communication and interpersonal skills, with the ability to influence at all levels of the organization and collaborate effectively with a wide range of stakeholders. Pursuant to the California Fair Chance Act, Los Angeles County Fair Chance Ordinance for Employers, Los Angeles Fair Chance Initiative for Hiring Ordinance, and San Francisco Fair Chance Ordinance, qualified applicants will be considered for assignment with arrest and conviction records. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness, meet client expectations, standards, and accompanying requirements, and safeguard business operations and company reputation. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’

The Infrastructure Accounting Analyst is responsible for supporting the financial management and accounting processes related to a prestigious tech company's global infrastructure investments. This role partners closely with the company's Infrastructure Supply Chain to manage end-to-end accounting processes and operationalize new accounting programs. This position is essential for ensuring accurate financial reporting and delivering accounting guidance to cross-functional teams, all while adapting to the companyโ€™s rapid growth. ## Responsibilities - Prepare, review, and analyze monthly, quarterly, and annual financial statements related to infrastructure assets (e.g., data centers, network equipment, real estate). - Support end-to-end accounting processes and operationalize any new accounting program to support Infrastructure Supply Chain. - Support the month-end and quarter-end close processes, including cash flow, journal entries, reconciliations, and variance analysis. - Maintain dashboards and reports to monitor infrastructure spend and capital project progress. - Identify opportunities to streamline accounting processes and improve data accuracy. - Support the implementation and documentation of internal controls related to infrastructure accounting. - Assist with audits and compliance activities as needed. ## Qualifications - Bachelorโ€™s degree in Accounting, Finance, or related field. - 2+ years of experience in accounting, preferably with exposure to fixed assets. - Strong understanding of US GAAP and/or IFRS. - Proficiency in Excel and experience with ERP systems (e.g., Oracle Financials and Hyperion Essbase). - Excellent analytical, organizational, and communication skills. Pursuant to the California Fair Chance Act, Los Angeles County Fair Chance Ordinance for Employers, Los Angeles Fair Chance Initiative for Hiring Ordinance, and San Francisco Fair Chance Ordinance, qualified applicants will be considered for assignment with arrest and conviction records. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness, meet client expectations, standards, and accompanying requirements, and safeguard business operations and company reputation. ## About Cincinnatus Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from the platform. While opportunities may be discovered through the platform's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. ## Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/2/2026
Apply โ†’
$50 - $200 per hour

the platform is recruiting experienced professionals to join a **first-of-its-kind AI research project** at the frontier of artificial intelligence. This initiative focuses on helping AI systems better understand and complete real-world tasks by designing high-quality prompts, executions, and evaluation rubrics. AI is already strong at answering factual questions; but it struggles with nuanced, real-life activities. Your expertise will help bridge that gap. ### **About the Role** As an **Expert Contributor**, you will use your real-world experience to design and execute tasks that reflect how people actually interact with AI systems. Your responsibilities will include: - Claiming tasks across **100+ real-world archetypes or activities** that people commonly ask AI to help with _(e.g., attending museums, running routines, volunteering, reading, laundry, daily planning, etc.)_ - Writing a **clear, realistic prompt** for the task - **Executing the task** while recording your screen - Writing a **detailed evaluation rubric** to assess AI performance on that task ### **Working Hours & Flexibility** - Minimum commitment of **15 hours per week** - **You should be able to turnaround tasks within 24 hours** ### **Additional Details** - This is an **ongoing research project** with long-term potential - A **desktop or laptop computer** is required (Chromebooks are not supported)

๐ŸŒ Remote
๐Ÿ‡บ๐Ÿ‡ธ
4/7/2026
Apply โ†’
$80 - $100 per hour

**UI/UX SWE Experts** **Job type:** task On this project, you will be asked to improve a given document, spreadsheet, or slide deck based on a pre-set style guide and then document all of the changes you made. ## Pay - You will be paid per task, not hourly. - $200: document - $300: spreadsheet - $400: slide deck or multi-file

๐ŸŒ Remote4/10/2026
Apply โ†’

**Psychologist โ€“ EQ & Theory of Mind Annotator** The team is looking for licensed psychologists, clinical psychologists, or researchers with expertise in social cognition to join Project TOM ## Details - Remote, flexible schedule | ~1-1.25 hrs per task | Paid hourly - Pilot phase: 30 entries before full production - Triple-blind annotation: each task reviewed by 3 independent annotators - Target accuracy: 80%+ label consistency with Gold Standard ## Day-to-day - Evaluate AI model responses across 7 Theory of Mind (ToM) dimensions: Intentions (T1), Emotions (T2), Desires (T3), Percepts (T4), Knowledge (T5), Beliefs (T6), and Non-literal Communication / Subtext (T7) - Compare two AI model outputs (Model A vs. Model B) and select the preferred response with a written rationale (~50 words per dimension) - Assess whether AI models correctly diagnose mental states (Stage - 1), form coherent behavioral strategies (Stage - 2), and produce psychologically authentic final responses (Stage - 3) - Identify false beliefs, emotional misattributions, hallucinated knowledge states, and failures of Theory of Mind reasoning ## You bring - Licensed psychologist, clinical psychologist, counselor, or researcher with expertise in social cognition, ToM, or interpersonal psychology - Strong ability to perceive subtext, emotional undercurrents, and causal chains in human behavior - Experience with sentiment - Analysis:, behavioral assessment, or text-based psychological evaluation - Familiarity with Eastern and Western cultural norms in interpersonal conflict resolution - Details - Remote, flexible schedule | ~1-1.25 hrs per task | Paid hourly

๐ŸŒ Remote4/11/2026
Apply โ†’

Join Mercor

Sign up through our referral link to get started

๐ŸŽ $50-$2000 per hire (avg $458)

Sign Up Now โ†’

Platform Stats

Active Gigs170
Languages10
Last UpdatedApril 14, 2026

Available Categories

writing19
finance8
stem27
medical7
coding17
evaluation18
general69
legal5