We are looking for a QA Specialist for generative AI systems who is detail-oriented, systematic, and accountable for ensuring AI-driven applications function reliably across prompts, edge cases, and real-world scenarios. This role is critical in safeguarding output quality, consistency, and safety before and after deployment. If you take ownership of AI quality, think in edge cases, and believe that precision builds trust and credibility in intelligent systems, this role is for you.
What You Will Do
As our Generative AI QA Specialist, you will:
- Conduct structured testing across prompts, outputs, and edge case scenarios
- Validate response accuracy, consistency, and contextual relevance
- Test prompt flows, system instructions, and multi-turn interactions
- Evaluate AI behavior across different models, configurations, and environments
- Identify hallucinations, bias, safety risks, and output inconsistencies
- Perform pre-deployment and post-deployment validation checks
- Monitor logs, errors, and failure cases in AI responses
- Assess performance such as latency, token usage, and response stability
- Document issues with clear prompts, outputs, and reproduction steps
- Collaborate with developers and prompt engineers to refine and validate improvements
What We Are Looking For
You may be a strong fit if you have hands-on experience with:
- QA methodologies for Generative AI, including prompt and output validation
- Testing AI models, LLM-powered applications, or conversational systems
- Prompt engineering concepts and multi-turn interaction testing
- Evaluating response accuracy, consistency, and contextual relevance
- Identifying hallucinations, bias, and safety risks in AI outputs
- Working with APIs for AI services and model integrations
- Monitoring logs, outputs, and performance metrics such as latency and token usage
- Basic understanding of NLP concepts and LLM behavior
Additional Strengths:
- High attention to detail and structured, analytical thinking
- Ability to anticipate edge cases and unpredictable AI behaviors
- Clear and precise documentation of prompts, outputs, and test scenarios
- Strong ownership of quality, safety, and reliability in AI systems
- A systematic and disciplined approach to testing non-deterministic systems
Your Work Setup
- Remote freelance work
- 4 days per week, 7:00 AM to 4:00 PM (UTC+8)
- Stable workload with clear project timelines and deliverables
- Opportunity to grow into a leadership role as Chagible AI Lab expands