Aliases:Documentation Index
Fetch the complete documentation index at: https://docs.nscale.com/llms.txt
Use this file to discover all available pages before exploring further.
inferencing, infer
Subcommands
- list-models — List available models
- list-endpoints — List inferencing endpoints
- chat — Send a chat completion request
list-models
Returns a list of all models available on the serverless platform.Flags
| Flag | Description |
|---|---|
--json | Output in JSON format |
Example
list-endpoints
Returns a list of all model endpoints available for use by the specified organization.Flags
| Flag | Description |
|---|---|
--org string | Organization ID |
--json | Output in JSON format |
Example
chat
Send a chat completion request to the inference API using a configuration file. Supports both batch and interactive modes.Flags
| Flag | Description |
|---|---|
--config string | Path to a chat configuration file (JSON) |
--messages string | Path to a JSON+LD file containing additional messages |
--ui | Launch interactive chat TUI |
Examples
Related
Models
Learn about available models on the Nscale platform.
Chat Use Case
End-to-end guide for chat inferencing.