cURL
curl --request POST \ --url https://inference.api.nscale.com/v1/embeddings \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json; charset=utf-8' \ --data '{ "dimensions": null, "encoding_format": "float", "input": "The food was delicious and the waiter was very friendly.", "model": "Qwen3-Embedding-8B", "user": null }'
{ "object": "<string>", "data": [ { "object": "<string>", "index": 123, "embedding": [ 123 ] } ], "model": "<string>", "usage": { "prompt_tokens": 123, "total_tokens": 123 } }
Get a vector representation of a given input that can be easily consumed by machine learning models and algorithms.
Security scheme that accepts Identity JWTs and deprecated API/User keys.
The response is of type object.
object