> ## Documentation Index
> Fetch the complete documentation index at: https://docs.nscale.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Inferencing

> Manage inferencing resources — list models, endpoints, and send chat completions.

**Aliases:** `inferencing`, `infer`

## Subcommands

* [list-models](#list-models) — List available models
* [list-endpoints](#list-endpoints) — List inferencing endpoints
* [chat](#chat) — Send a chat completion request

***

## list-models

Returns a list of all models available on the serverless platform.

```bash theme={null}
nscale inferencing list-models [flags]
```

### Flags

<table>
  <thead><tr><th>Flag</th><th>Description</th></tr></thead>

  <tbody>
    <tr>
      <td><code>--json</code></td>
      <td>Emit the full JSON payload (mutually exclusive with <code>-q</code>)</td>
    </tr>

    <tr>
      <td><code>-q, --query stringArray</code></td>
      <td>jq filter for value extraction (see <a href="/docs/cli/query-output">Query output with <code>-q</code></a>)</td>
    </tr>
  </tbody>
</table>

### Example

```bash theme={null}
nscale inferencing list-models --json
```

***

## list-endpoints

Returns a list of all model endpoints available for use by the specified organization.

```bash theme={null}
nscale inferencing list-endpoints [flags]
```

### Flags

<table>
  <thead><tr><th>Flag</th><th>Description</th></tr></thead>

  <tbody>
    <tr><td><code>--org string</code></td><td>Organization ID</td></tr>

    <tr>
      <td><code>--json</code></td>
      <td>Emit the full JSON payload (mutually exclusive with <code>-q</code>)</td>
    </tr>

    <tr>
      <td><code>-q, --query stringArray</code></td>
      <td>jq filter for value extraction (see <a href="/docs/cli/query-output">Query output with <code>-q</code></a>)</td>
    </tr>
  </tbody>
</table>

### Example

```bash theme={null}
nscale inferencing list-endpoints --org <org-id>
```

***

## chat

Send a chat completion request to the inference API using a configuration file. Supports both batch and interactive modes.

```bash theme={null}
nscale inferencing chat [flags]
```

### Flags

| Flag                | Description                                           |
| ------------------- | ----------------------------------------------------- |
| `--config string`   | Path to a chat configuration file (JSON)              |
| `--messages string` | Path to a JSON+LD file containing additional messages |
| `--ui`              | Launch interactive chat TUI                           |

### Reasoning content

When you use the interactive TUI (`--ui`) with a model that supports reasoning, the model's thought process appears in a separate "Thought Process" bubble above the response. The reasoning streams live as the model works through the problem, and the final answer appears in the standard response bubble once reasoning is complete.

This gives you visibility into how the model arrives at its answer without cluttering the final response.

### Examples

```bash theme={null}
# Send a chat completion using a config file
nscale inferencing chat --config chat-config.json

# Launch the interactive chat TUI
nscale inferencing chat --config chat-config.json --ui

# Include additional messages from a file
nscale inferencing chat --config chat-config.json --messages extra-messages.json
```

***

## Related

<CardGroup cols={2}>
  <Card title="Models" icon="cloud" href="/docs/ai-services/models">
    Learn about available models on the Nscale platform.
  </Card>

  <Card title="Chat Use Case" icon="message" href="/docs/use-cases/chat">
    End-to-end guide for chat inferencing.
  </Card>
</CardGroup>
