improvement: Add vertex embeddings support #622

elentaure · 2024-09-22T15:02:29Z

Title:

Add support for embeddings in vertex ai

Description:
Based on the embeddings code for google ai studio and modified to adapt to the vertex api differences.

narengogi

Tried making a request to this provider in local, the request failed when the request is openai compliant, suggested changes in comments

Failing request

curl --location 'http://localhost:8787/v1/embeddings' \
--header 'x-portkey-provider: vertex-ai' \
--header 'x-portkey-vertex-region: us-central1' \
--header 'Content-Type: application/json' \
--header 'x-portkey-api-key: jd-' \
--header 'Authorization: ya29.c....' \
--header 'x-portkey-vertex-project-id: {{YOUR_PROJECT_ID}}' \
--data-raw '{
    "model": "textembedding-gecko@001",
    "input": "Hello this is a test",
}'

narengogi · 2024-09-26T11:24:05Z

src/providers/google-vertex-ai/types.ts

@@ -68,3  70,49 @@ export interface VertexLlamaChatCompleteStreamChunk {
  created?: number;
  provider?: string;
 }
+
+export const GoogleErrorResponseTransform: (


this method can be moved to a utils.ts file (preferred), or just the embed.ts file if it's only being used there.

narengogi · 2024-09-26T12:20:43Z

src/providers/google-vertex-ai/embed.ts

+import { GOOGLE_VERTEX_AI } from '../../globals';
+import { generateInvalidProviderResponseError } from '../utils';
+
+export const GoogleEmbedConfig: ProviderConfig = {


The request structure is incorrect, you've typed params as having type VertexEmbedParams, this wouldn't be OpenAI compliant. The gateway transforms an OpenAI embeddings request to a Vertex embeddings request, so the code should be as below

export interface EmbedInstancesData { content: string; } export const GoogleEmbedConfig: ProviderConfig = { input: { param: 'instances', required: true, transform: (params: EmbedParams): Array<EmbedInstancesData> => { const instances = Array<EmbedInstancesData>(); if (Array.isArray(params.input)) { params.input.forEach((text) => { instances.push({ content: text, }); }); } else { instances.push({ content: params.input, }); } return instances; }, }, parameters: { param: 'parameters', required: false, }, };

Better implementation, with support for task_type

export interface EmbedInstancesData { content: string; } enum TASK_TYPE {...} interface GoogleEmbedParams extends EmbedParams { task_type: TASK_TYPE | string } export const GoogleEmbedConfig: ProviderConfig = { input: { param: 'instances', required: true, transform: (params: EmbedParams): Array<EmbedInstancesData> => { const instances = Array<EmbedInstancesData>(); if (Array.isArray(params.input)) { params.input.forEach((text) => { instances.push({ content: text, task_type: params.task_type }); }); } else { instances.push({ content: params.input, }); } return instances; }, }, parameters: { param: 'parameters', required: false, }, };

Modified according to suggestions. Please check if now works ok with task type and the SDK

narengogi

Looks good!!

VisargD · 2024-10-01T10:01:20Z

Thanks for the PR! We will merge this today.

…tation

VisargD · 2024-10-03T21:11:44Z

Hey @elentaure - I noticed one difference while going through the vertex embeddings documentation. The gateway is currently picking up the usage object from response.metadata.billableCharacterCount. But this will give us the character count and not the token count. To keep it OpenAI schema compliant, it would be better to use statistics.token_count for usage object.

Documentation for the tokenCount parameter: https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/text-embeddings-api#response_body

{
  "predictions": [
    {
      "embeddings": {
        "statistics": {
          "truncated": boolean,
          "token_count": integer
        },
        "values": [ number ]
      }
    }
  ]
}

I will raise a quick PR to make this change.

Gleser, Ignacio {FQJA~MADRID-OSIRIS} and others added 2 commits September 22, 2024 16:51

Add vertex embeddings support

d6778f3

Merge branch 'main' into feat/vertex-embed

557fcfc

narengogi suggested changes Sep 26, 2024

View reviewed changes

elentaure and others added 4 commits September 26, 2024 14:48

Merge branch 'main' into feat/vertex-embed

0c18b6e

move GoogleErrorResponseTransform to utils

3ac8217

Modify task_type param

41baf0e

Remove custom vertex params

71cc978

narengogi approved these changes Oct 1, 2024

View reviewed changes

narengogi added a commit to Portkey-AI/docs-core that referenced this pull request Oct 1, 2024

docs for vertex embeddings, refer Portkey-AI/gateway#622 for implemen…

d700674

…tation

narengogi mentioned this pull request Oct 1, 2024

Docs for vertex embeddings Portkey-AI/docs-core#9

Merged

1 task

Merge branch 'main' into feat/vertex-embed

0f6a66d

VisargD merged commit 4840893 into Portkey-AI:main Oct 1, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improvement: Add vertex embeddings support #622

improvement: Add vertex embeddings support #622

elentaure commented Sep 22, 2024

narengogi left a comment •

edited

Loading

narengogi Sep 26, 2024

elentaure Sep 27, 2024

narengogi Sep 26, 2024

narengogi Sep 27, 2024

elentaure Sep 27, 2024

narengogi left a comment

VisargD commented Oct 1, 2024

VisargD commented Oct 3, 2024

improvement: Add vertex embeddings support #622

improvement: Add vertex embeddings support #622

Conversation

elentaure commented Sep 22, 2024

narengogi left a comment • edited Loading

Choose a reason for hiding this comment

narengogi Sep 26, 2024

Choose a reason for hiding this comment

elentaure Sep 27, 2024

Choose a reason for hiding this comment

narengogi Sep 26, 2024

Choose a reason for hiding this comment

narengogi Sep 27, 2024

Choose a reason for hiding this comment

elentaure Sep 27, 2024

Choose a reason for hiding this comment

narengogi left a comment

Choose a reason for hiding this comment

VisargD commented Oct 1, 2024

VisargD commented Oct 3, 2024

narengogi left a comment •

edited

Loading