The platform applies limits to keep workflow execution stable and fair across your organization. This page explains the limits you may encounter and how to work with them.Documentation Index
Fetch the complete documentation index at: https://docs.platform.aiplanet.com/llms.txt
Use this file to discover all available pages before exploring further.
Concurrency limits
To keep the platform responsive, there’s a cap on how many runs can execute at the same time:- A limit per workflow — how many runs of a single workflow can run concurrently.
- A limit per organization — how many runs across all your workflows can run concurrently.
A single thread runs one execution at a time. Sending a new message in a thread waits for the previous run in that thread to finish.
API rate limits
Requests to the API are rate limited. If you send requests too quickly, the API responds with a rate-limit error — wait briefly and retry. Build retry handling into any integration that calls the API at volume.Working within limits
- Watch Tracing to spot runs that are slower than expected.
- Stagger bulk work instead of triggering many runs simultaneously.
- Retry on rate-limit errors with a short delay in any API integration.
Next steps
Calling workflows via API
Trigger workflows from your own applications.