Inferencing

Aliases: inferencing, infer

Subcommands

list-models — List available models
list-endpoints — List inferencing endpoints
chat — Send a chat completion request

list-models

Returns a list of all models available on the serverless platform.

nscale inferencing list-models [flags]

Flags

Flag	Description
`--json`	Output in JSON format

Example

nscale inferencing list-models --json

list-endpoints

Returns a list of all model endpoints available for use by the specified organization.

nscale inferencing list-endpoints [flags]

Flags

Flag	Description
`--org string`	Organization ID
`--json`	Output in JSON format

Example

nscale inferencing list-endpoints --org <org-id>

chat

Send a chat completion request to the inference API using a configuration file. Supports both batch and interactive modes.

nscale inferencing chat [flags]

Flags

Flag	Description
`--config string`	Path to a chat configuration file (JSON)
`--messages string`	Path to a JSON+LD file containing additional messages
`--ui`	Launch interactive chat TUI

Examples

# Send a chat completion using a config file
nscale inferencing chat --config chat-config.json

# Launch the interactive chat TUI
nscale inferencing chat --config chat-config.json --ui

# Include additional messages from a file
nscale inferencing chat --config chat-config.json --messages extra-messages.json

Models

Learn about available models on the Nscale platform.

Chat Use Case

End-to-end guide for chat inferencing.

CLI

Reference

Subcommands

list-models

Flags

Example

list-endpoints

Flags

Example

chat

Flags

Examples

Models

Chat Use Case

CLI

Reference

Documentation Index

​Subcommands

​list-models

​Flags

​Example

​list-endpoints

​Flags

​Example

​chat

​Flags

​Examples

​Related

Models

Chat Use Case

Subcommands

list-models

Flags

Example

list-endpoints

Flags

Example

chat

Flags

Examples

Related