Quality Engineer (AI)
Maya
Date: 5 hours ago
City: Mandaluyong City
Contract type: Full time

Overview:
The Quality Engineer collaborates with Data Scientists and Machine Learning Engineers to assess and validate machine learning models for production use. They ensure that models meet quality standards not only in terms of functionality and performance, but also with respect to trade-offs such as thresholds, data variability, latency, and real-world feasibility.
What you will do:
- Develop manual and automated test designs (test automation scripts)
- Design and implement manual and automated tests for machine learning and deep learning models across training, validation, and inference pipelines.
- Develop CI/CD-integrated testing workflows to automate model evaluation at every release stage.
- Execute model validation strategies including functional, regression, performance, robustness, and fairness testing.
- Conduct adversarial and edge-case testing to identify brittle behavior in ML models.
- Evaluate large language models using prompt-based test suites and metrics such as coherence, factuality, and hallucination rate
- Employ a variety of testing techniques to successfully deliver product releases including functional, regression, performance, and system tests.
- Work closely with data scientists and machine learning engineers to ensure quality in model deployment and continuous integration pipelines
What we are looking for:
- 3–5 years of experience in QA, with at least 1–2 years working with ML models or AI systems, LLM or conversational agent experience is a plus.
- Experience in Backend Testing and App Testing
- Experience with test automation frameworks for data and model validation
- Working knowledge of Python and familiarity with ML libraries (Scikit-learn, TensorFlow, PyTorch, Langchain)
- Understanding of ML concepts including classification, regression, overfitting, model drift, and evaluation metrics. Also has a baseline understanding of generative model risks (e.g., hallucinations, toxic outputs).
- Experience in testing applications in different domains (e-commerce, banking and finance)
- Working knowledge on at least 1 test automation framework (BDD, KDT, DDT)
- Established foundation in different basic tools for Test Documentation, Bug Logging, and Agile practice
- Basic understanding of fairness, bias detection, and explainability in ML models is a plus
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Senior Software Engineer (Lending)
Maya,
Mandaluyong City
2 days ago
Required Qualifications:Background or familiarity in Financial ServicesExperience with Microservice and Event Driven ArchitectureExpertise in microservice development with Java, Spring BootExpertise with at least one cloud technology (AWS, GCP, Azure)Expertise with at least one messaging technology (AWS SNS, AWS SQS, Kafka, RabbitMQ, etc.)Expertise with at least one database technology (PostgreSQL, MySQL, Oracle, MSSQL, etc.)Expertise in software design patterns and principlesMastery with...

Specialist I Buying Planning
Emerson,
Mandaluyong City
3 days ago
Job DescriptionEmerson’s 130+ years of history have been filled with achievements and challenges that have driven innovative thinking and bold transformations, molding us into the company we are today. By joining us as a Specialist I, Buying Planning, you will be able to apply your knowledge of Supply Planning across all plant sites, identifying and procuring goods that our organization...

RCI: Finance Analyst (Billing and Collection)
ACTIVEONE HEALTH, INC.,
Mandaluyong City
3 days ago
The Finance Analyst for billing and collection will be responsible for the collection and billing of services/goods rendered and proper recording and monitoring thereof, which will have an impact on the company's cash flows. S/He will ensure that the company receives accurate payment in a timely manner. S/He will be responsible for managing and collecting outstanding accounts receivable from clients...
