Loading...
Loading...
**Biology Expert (General)** The team needs PhD-level experts with experience applying design, optimization, prediction, and scientific reasoning to biological or chemical systems Ideal candidates have worked across experimental and/or computational settings, and can interpret complex data to guide
**Business Operations Associate** This role supports a fast-growing AI lab building advanced technology for enterprise use cases The team operates with high velocity and is led by an experienced founder/CEO ## Details - Support event planning (conferences, customer dinners, demo days) - Act as a
**About the Role** We're looking for licensed attorneys to review a corpus of legal documents and assess whether their meaning and logical flow are fully preserved. Reviewers will evaluate documents at the sentence and paragraph level, flagging any instances where content appears incomplete, ambiguous, or where the intended legal meaning has been lost or altered. This is a short-term, part-time engagement with an expected duration of 3 days. **Responsibilities** - Review redacted legal documents for semantic coherence and completeness - Flag passages where meaning is unclear, inconsistent, or appears to have been disrupted - Provide brief written rationale for any flagged content - Complete assigned document batches within daily targets **Qualifications** - JD required - Prior experience with legal records, court filings, or litigation documents strongly preferred - Familiarity with AI or document annotation project experience is a plus - Strong attention to detail and ability to work efficiently under time constraints
**About the Role** We're looking for creative, detail-oriented experts to help train the next generation of AI agents. You'll design realistic, complex digital worlds and craft challenging scenarios that test how well AI navigates real-world tasks โ think scheduling conflicts, information overload, competing priorities, and more. **What You'll Do** - Build richly detailed personas and simulated digital environments (Gmail, Slack, Calendar, WhatsApp, Google Drive, and more) - Write tasks that challenge an AI agent's ability to reason, filter, and prioritize - Run the agent against your scenarios, evaluate its performance, and guide it to success through structured hints - Document your work clearly in Airtable and Crucible **You're a Good Fit If You** - Have strong written communication and attention to detail - Think creatively and enjoy constructing layered, realistic scenarios - Are comfortable working with structured tools like JSON editors, Airtable, and AI platforms - Can spot inconsistencies and think critically about information quality - Work independently and manage your own workflow - **Must Have** - Undergrad degree, 2+ years professional experience. - **Nice to have:** - **Writing chops** โ copywriter, UX writer, journalist, content strategist, editor, published blog/newsletter, creative writing background. Strong writers make the best task creators _and_ reviewers - **Multi-stakeholder coordination** โ PM, account manager, consultant, client success, event coordinator, producer. Creates richer, more authentic professional scenarios - **World-building / creative design** โ game designer, curriculum designer, UX researcher, simulation designer, tabletop RPG creator - **Technical / data comfort** โ JSON familiarity, Jira/Asana power user, technical PM, data analyst, SQL/Python exposure. A plus but not required โ we'll provide a JSON onboarding guid
**Location:** US-Based and Non-US-Based **Type**: Full-time or Part-time Contract Work **Fluent Language Skills Required:** English **Why This Role Exists** the platform partners with leading AI teams to improve the quality, usefulness, and reliability of general-purpose conversational AI systems. These systems are used across a wide range of everyday and professional scenarios, and their effectiveness depends on how clearly, accurately, and helpfully they respond to real user questions. In coding and software engineering contexts, conversational AI systems must demonstrate correct reasoning, strong problem-solving ability, and adherence to real-world engineering best practices. This project focuses on evaluating and improving how models reason about code, generate solutions, and explain technical concepts across a variety of programming tasks and complexity levels. **What Youโll Do** - **Evaluate LLM-generated responses** to coding and software engineering queries for accuracy, reasoning, clarity, and completeness - **Conduct fact-checking** using trusted public sources and authoritative references - Conduct accuracy testing by **executing code and validating outputs using appropriate tools** - **Annotate model responses** by identifying strengths, areas of improvement, and factual or conceptual inaccuracies - Assess code quality, readability, algorithmic soundness, and explanation quality - Ensure **model responses align with expected conversational behavior** and system guidelines - **Apply consistent evaluation standards** by following clear taxonomies, benchmarks, and detailed evaluation guidelines **Who You Are** - You hold a **BS, MS, or PhD in Computer Science or a closely related field** - You have **significant (3+ years) real-world experience in software engineering** or related technical roles - You are an expert in at **least two relevant programming languages (e.g., Python, Java, C++, C, JavaScript, Go, Rust, Ruby, SQL, Powershell, Bash, Swift, Kotlin, R, TypeScript, HTML/CSS)** - You are able to solve **HackerRank or LeetCode Medium and Hardโlevel problems independently** - You have experience contributing to well-known open-source projects, including merged pull requests - You have **significant experience using LLMs while coding** and understand their strengths and failure modes - **You have strong attention to detail** and are **comfortable evaluating complex technical reasoning**, identifying subtle bugs or logical flaws **Nice-to-Have Specialties** - Prior experience with RLHF, model evaluation, or data annotation work - Track record in competitive programming - Experience reviewing code in production environments - Familiarity with multiple programming paradigms or ecosystems - Experience explaining complex technical concepts to non-expert audiences **What Success Looks Like** - You identify incorrect logic, inefficiencies, edge cases, or misleading explanations in model-generated code, technical concepts, and system design discussions - Your feedback improves the correctness, robustness, and clarity of AI coding outputs - You deliver reproducible evaluation artifacts that strengthen model performance - the platform customers trust AI systems to assist reliably with real-world coding tasks **Why Join the platform** At the platform, experienced software engineers play a direct role in shaping how AI systems reason about and generate code. This remote role allows you to apply your technical expertise to high-impact AI development work, improving systems used by developers around the world.
the platform is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author promptโgolden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### **Job Details:** - **Design and Optimize Prompts**: Create detailed prompts with multiple constraints and instructions. - **Define and Document Evaluation Standards**: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric. - **Conduct Model Testing and Grading**: Run prompts through models and assess preliminary outputs against expectations. - **Support Benchmarking and Quality Assurance**: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ### **Minimum Qualifications:** - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests. - US or Canada based. ### **Preferred Qualifications:** - Experience in teaching or research. ### **Application & Onboarding Process:** - Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ### **More Details About This Role:** - Expect to contribute at least **20 hours per week**. - Expect a commitment of around 2 month. - Youโll be working in a structured project environment with clear goals and tools.
This is a unique opportunity to apply your engineering expertise toward shaping the next generation of intelligent systems. Up to $90/hr.
Professional Domain Expert (Health/Med) โ hourly contract role . Up to $75/hr.
Professional Domain Expert (Government/Non-Profit) โ hourly contract role . Up to $35/hr.
Professional Domain Expert (Legal) โ hourly contract role . Up to $75/hr.
Software Engineer III - OpenXR Developer โ full-time contract role . Up to $105/hr.
Technical Program Manager I - Hardware โ full-time contract role . Up to $95/hr.
Professional Domain Expert (Retail/Wholesale Trade) โ hourly contract role . Up to $30/hr.
Professional Domain Expert (Manufacturing) โ hourly contract role . Up to $35/hr.
Professional Domain Expert (Real Estate and Leasing) โ hourly contract role . Up to $35/hr.
Excel/PowerPoint/Document Style Experts โ part-time contract role .
We are seeking detail-oriented transcribing/writing experts to contribute to a high-impact audio AI research project with a leading lab. Up to $50/hr.
This role involves performing comprehensive power analysis throughout various design stages, from RTL to GDSII. Up to $95/hr.
We are seeking expert mathematicians to author and verify high-quality open-ended prompts for AI model evaluation. Up to $60/hr.
we is working with a leading intelligence AI lab to identify the most important open questions in core AI/ML fields and to build structured knowledge bases that could me...