Using Models in Serverless Mode

Serverless mode lets you call models immediately, with no instance to create and no server to configure. Use the prebuilt endpoints provided by Elice AI Cloud, and pay only for what you use.

Characteristics

Ready to use, no infrastructure to prepare
Per-call billing
Automatic adaptation to changing traffic
Suited to testing, prototyping, and flexible deployments

Using a Serverless Endpoint

Selecting a model that supports Serverless in the Model Library makes its Serverless endpoint immediately available.

Use — Serverless

ML API → Go to Model Library
Choose a model that supports Serverless
Review the API call information provided
Call with your API key included

Calling a Serverless Endpoint

Calls to a Serverless endpoint require API key authentication. Include the API key in the request header.

For API keys, see Managing API Keys.

For how to call deployed models, see API Requests.

Serverless Usage Monitoring

The Serverless Usage page in ML API shows your usage and cost.

Serverless monitoring

What you can see

Total spend
Calls per time period
Token, seconds (audio), megapixel (image) usage
Filters by model and by API key

This helps you analyze usage trends and manage costs efficiently.

Pricing

Serverless is per-call billing, and pricing policy may vary by model type.

Billed by input/output tokens
Call volume shown in a chart
Use the date filter to view history

Characteristics​

Using a Serverless Endpoint​

Calling a Serverless Endpoint​

Serverless Usage Monitoring​

What you can see​

Pricing​