Learn how to optimize Cloud Native MCP Server for predictable latency and higher throughput in real production workloads.
Cache and Response Strategy#
The server already includes internal cache and response shaping mechanisms. You can improve performance further by reducing response scope per call:
- Prefer namespace-scoped queries over cluster-wide queries.
- Use pagination parameters (for tools that support them) on large datasets.
- Query only the fields you actually need for the current decision.
Example: Limit payload size#
| |
Tune Kubernetes and Service Timeouts#
Use runtime variables that are supported by the current server: