Testing Agents

Chat-Based Testing

Navigate to Agents → Select agent → Start Chat. Test with different conversation types:

Simple queries: Basic understanding and responses
Complex requests: Multi-step tasks requiring reasoning
Edge cases: Unusual inputs or boundary conditions
Error handling: Invalid inputs and how agent recovers

Document behavior including tool usage, response quality, and accuracy.

Creating Test Cases

Define expected behavior for validation: What to test:

Task completion accuracy
Tool selection and usage
Response format consistency
Error handling
Response time

Example test: “Extract key information about Apple Inc.”

Expected tools: web_search
Required fields: name, industry, headquarters
Success: All fields present and accurate

Batch Testing

Test agents on sample datasets before production use.

Create test dataset with 10-50 records
Run agent on test data
Review output quality and accuracy
Fix issues and retest
Deploy when quality meets standards

Compare outputs against expected results to catch issues early.

Performance Validation

Speed: Simple queries < 2 seconds, complex analysis < 30 seconds Quality: Check response relevance, accuracy, completeness, and clarity Consistency: Same input should yield similar results across tests Monitor resource usage and tool call patterns for optimization opportunities.

Common Issues

Inconsistent results: Model variability is normal. Run multiple tests to identify patterns. Slow performance: Review tool usage, consider caching frequent queries, optimize system prompt. Accuracy problems: Refine system prompt instructions, adjust tool configurations, add examples. Tool failures: Verify API credentials, check rate limits, test connectivity.

Production Testing

Gradual rollout: Test on small dataset subset first, then expand gradually. Monitor continuously: Track accuracy, response times, error rates in production. User feedback: Collect and review user reports to identify improvement areas.

Next Steps

Chat with Agents

Start testing through chat

Managing Tools

Test tool configurations

Configuring Models

Optimize model settings

Creating Agents

Create new agents

Workspaces

Datasets

Agents

Tools

Pages

Chat-Based Testing

Creating Test Cases

Batch Testing

Performance Validation

Common Issues

Production Testing

Next Steps

Chat with Agents

Managing Tools

Configuring Models

Creating Agents

Workspaces

Datasets

Agents

Tools

Pages

​Chat-Based Testing

​Creating Test Cases

​Batch Testing

​Performance Validation

​Common Issues

​Production Testing

​Next Steps

Chat with Agents

Managing Tools

Configuring Models

Creating Agents

Chat-Based Testing

Creating Test Cases

Batch Testing

Performance Validation

Common Issues

Production Testing

Next Steps