Interface GoogleVertexAITextInput

Interface representing the input to the Google Vertex AI model.

Hierarchy

GoogleVertexAIBaseLLMInput<GoogleAuthOptions>
- GoogleVertexAITextInput

Index

Properties

apiVersion? authOptions? cache? callbackManager? callbacks? concurrency? endpoint? location? maxConcurrency? maxOutputTokens? maxRetries? metadata? model? onFailedAttempt? tags? temperature? topK? topP? verbose?

Properties

`Optional` apiVersion

apiVersion?: string

The version of the API functions. Part of the path.

`Optional` authOptions

authOptions?: GoogleAuthOptions<JSONClient>

`Optional` cache

cache?: boolean | BaseCache<Generation[]>

`Optional` callbackManager

callbackManager?: CallbackManager

⚠️ Deprecated ⚠️

Use callbacks instead

This feature is deprecated and will be removed in the future.

It is not recommended for use.

`Optional` callbacks

callbacks?: Callbacks

`Optional` concurrency

concurrency?: number

Deprecated

Use maxConcurrency instead

`Optional` endpoint

endpoint?: string

Hostname for the API call

`Optional` location

location?: string

Region where the LLM is stored

`Optional` maxConcurrency

maxConcurrency?: number

The maximum number of concurrent calls that can be made. Defaults to Infinity, which means no limit.

`Optional` maxOutputTokens

maxOutputTokens?: number

Maximum number of tokens to generate in the completion.

`Optional` maxRetries

maxRetries?: number

The maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.

`Optional` metadata

metadata?: Record<string, unknown>

`Optional` model

model?: string

Model to use

`Optional` onFailedAttempt

onFailedAttempt?: FailedAttemptHandler

Custom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.

`Optional` tags

tags?: string[]

`Optional` temperature

temperature?: number

Sampling temperature to use

`Optional` topK

topK?: number

Top-k changes how the model selects tokens for output.

A top-k of 1 means the selected token is the most probable among all tokens in the model’s vocabulary (also called greedy decoding), while a top-k of 3 means that the next token is selected from among the 3 most probable tokens (using temperature).

`Optional` topP

topP?: number

Top-p changes how the model selects tokens for output.

Tokens are selected from most probable to least until the sum of their probabilities equals the top-p value.

For example, if tokens A, B, and C have a probability of .3, .2, and .1 and the top-p value is .5, then the model will select either A or B as the next token (using temperature).

`Optional` verbose

verbose?: boolean

Interface GoogleVertexAITextInput

Hierarchy

Index

Properties

Properties

`Optional` apiVersion

`Optional` authOptions

`Optional` cache

`Optional` callbackManager

⚠️ Deprecated ⚠️

`Optional` callbacks

`Optional` concurrency

Deprecated

`Optional` endpoint

`Optional` location

`Optional` maxConcurrency

`Optional` maxOutputTokens

`Optional` maxRetries

`Optional` metadata

`Optional` model

`Optional` onFailedAttempt

`Optional` tags

`Optional` temperature

`Optional` topK

`Optional` topP

`Optional` verbose

Settings

Member Visibility

Theme

On This Page

Interface GoogleVertexAITextInput

Hierarchy

Index

Properties

Properties

Optional apiVersion

Optional authOptions

Optional cache

Optional callbackManager

⚠️ Deprecated ⚠️

Optional callbacks

Optional concurrency

Deprecated

Optional endpoint

Optional location

Optional maxConcurrency

Optional maxOutputTokens

Optional maxRetries

Optional metadata

Optional model

Optional onFailedAttempt

Optional tags

Optional temperature

Optional topK

Optional topP

Optional verbose

Settings

Member Visibility

Theme

On This Page

`Optional` apiVersion

`Optional` authOptions

`Optional` cache

`Optional` callbackManager

`Optional` callbacks

`Optional` concurrency

`Optional` endpoint

`Optional` location

`Optional` maxConcurrency

`Optional` maxOutputTokens

`Optional` maxRetries

`Optional` metadata

`Optional` model

`Optional` onFailedAttempt

`Optional` tags

`Optional` temperature

`Optional` topK

`Optional` topP

`Optional` verbose