Overview

Building with Nscale

We make it easy to integrate AI into your applications. Nscale is a high-performance compute platform designed to simplify AI development, from fine-tuning to inference. It provides on-demand AI services that enable you to fine-tune, evaluate, deploy, and run AI models at scale—without the complexity of managing infrastructure. Built on Nscale’s powerful compute technology, these services are accessible in a self-serve, prepaid model. Whether you’re optimising models for production or running large-scale inference, Nscale delivers the flexibility and performance needed to power AI-driven applications.

Nscale’s capabilities

Serverless inference – Run popular LLMs effortlessly with our API, without worrying about infrastructure management.

Fine-tuning – Customise open-source models with fine-tuning and deploy them on a dedicated endpoint for high-performance inference.

Evaluation (Coming soon) – Evaluate model performance with tools designed to meet your specific requirements.

GPU clusters – Accelerate your workflow with cutting-edge infrastructure like NKS (Nscale Kubernetes Service), Slurm, or bare metal machines. Contact sales to learn more.

Quickstart

Create an account

Head to console.nscale.com and create an account

Add credit to your account

From the dashboard, add a minimum of $5 of credit to start using our service

Call your first endpoint

Call the inference endpoint with your Service token

Head to the quickstart page for a more detailed view on how to get started with Nscale.

Platform Infrastructure

Provision and manage compute, networking, and storage resources directly in the Nscale Console.

Instances

Create GPU and CPU virtual machines with SSH access

VPC Networks

Isolated private networks for your resources

Filesystem

Shared persistent NFS storage for instances and clusters

Managed Kubernetes

Provision isolated Kubernetes clusters for container workloads

Security Groups

Firewall rules controlling inbound and outbound traffic

Terraform Provider

Manage Nscale infrastructure as code

Building with Nscale

Quick links

Quickstart

Serverless Inference

Contact Support

Get GPU Clusters

Nscale’s capabilities

Quickstart

Platform Infrastructure

Instances

VPC Networks

Filesystem

Managed Kubernetes

Security Groups

Terraform Provider

Get help

Contact Support

​Building with Nscale

​Quick links

Quickstart

Serverless Inference

Contact Support

Get GPU Clusters

​Nscale’s capabilities

​Quickstart

​Platform Infrastructure

Instances

VPC Networks

Filesystem

Managed Kubernetes

Security Groups

Terraform Provider

​Get help

Contact Support

Building with Nscale

Quick links

Nscale’s capabilities

Quickstart

Platform Infrastructure

Get help