Adding support for VertexAi matching engine. #289

pascalconfluent · 2023-11-17T02:15:41Z

No description provided.

…ests

langchain4j

Hi @pascalconfluent thanks a lot for your contribution!
I have left some comments, please check them, thanks!

langchain4j-vertex-ai/pom.xml

langchain4j-vertex-ai/src/main/java/dev/langchain4j/model/vertexai/VertexAiMatchingEngine.java

langchain4j · 2023-11-24T15:21:27Z

...chain4j-vertex-ai/src/test/java/dev/langchain4j/model/vertexai/VertexAiMatchingEngineIT.java

+
+    @Override
+    protected double getCosineSimilarity(Embedding embedding, Embedding referenceEmbedding) {
+        return CosineSimilarity.between(embedding, referenceEmbedding);


It would be better to keep the behavior of all embedding stores the same. Score should be a number from 0 to 1.

In VertexAI is depends on the algorithm you are using when creating the index. The one use default in the unit tests is not recommended by Google. But happy to revert back and put a note in the unit tests.

which algorithms are supported and wich is the default one? can't find it in the code...

It is defined during index creation. Here is different algorithm that can be configured:

SQUARED_L2_DISTANCE

Euclidean (L2) Distance

L1_DISTANCE

Manhattan (L1) Distance

DOT_PRODUCT_DISTANCE

Default value. Defined as a negative of the dot product.

COSINE_DISTANCE

Cosine Distance. We strongly suggest using DOT_PRODUCT_DISTANCE UNIT_L2_NORM instead of the COSINE distance. Our algorithms have been more optimized for the DOT_PRODUCT distance, and when combined with UNIT_L2_NORM, it offers the same ranking and mathematical equivalence as the COSINE distance.

I am using the default one for testing!

Hmm... is index created automatically by your implementation or user manually creates it? I see that you create VertexAiEmbeddingIndex each time addAll() is called, but I can't find the code responsible for defining algorithm in your implementation...

But my initial point was that instead of using CosineSimilarity.between(embedding, referenceEmbedding), please use RelevanceScore.fromCosineSimilarity(CosineSimilarity.between(embedding, referenceEmbedding)). This way it will be in line with all other integrations. And it should work the same for both cosine similarity and dot product L2 normalization.

Each List has an index file attached to it. This index file is used by the matching engine to index it. If you create a batch mode index, then matching engine scan all the files to add the embeddings to the index. In short, an index is created in 2 phases, first create an index in matching engine (that is where you define the algorithm, ....), then you need to create files that represent a list of embeddings. I implemented the 2 phase, will probably do the first one but this one can be done manually using the UI.

For the relevance, yes I can do that and assume that the index is created with the right algorithm.

By the way I looked the implementation from the python's langchain and they are doing the same.

...chain4j-vertex-ai/src/test/java/dev/langchain4j/model/vertexai/VertexAiMatchingEngineIT.java

langchain4j-vertex-ai/src/main/java/dev/langchain4j/model/vertexai/VertexAiMatchingEngine.java

langchain4j · 2023-11-24T16:32:08Z

...chain4j-vertex-ai/src/main/java/dev/langchain4j/model/vertexai/internal/MatchingService.java

+                        final IndexDatapoint resultDatapoint = neighbor.getDatapoint();
+                        return new EmbeddingMatch<>(neighbor.getDistance(),
+                                id,
+                                Embedding.from(resultDatapoint.getFeatureVectorList()),


since you made returnFullDatapoint configurable, the check is needed here that the embedding is returned

It return an empty array. I think we should also return an empty embedding.

An empty Embedding makes no sense, if we don't have an embedding, it is better to return null

That's debatable. Same goes with strings and other empty/null entities.
Anyway, happy to return null instead of empty if no result.

...chain4j-vertex-ai/src/main/java/dev/langchain4j/model/vertexai/internal/MatchingService.java

...4j-vertex-ai/src/main/java/dev/langchain4j/model/vertexai/internal/IndexEndpointService.java

...hain4j-vertex-ai/src/main/java/dev/langchain4j/model/vertexai/internal/VertexAIDocument.java

langchain4j · 2023-11-24T17:40:17Z

BTW is this a recommended way to store and search embeddings using Vertex AI? Somehow feels overly complicated and in some places inefficient.

...in4j-core/src/test/java/dev/langchain4j/store/embedding/EmbeddingStoreWithoutMetadataIT.java

...j-vertex-ai/src/main/java/dev/langchain4j/store/embedding/vertexai/VertexAiVectorSearch.java

langchain4j · 2023-12-04T14:29:02Z

langchain4j-vertex-ai/src/main/java/dev/langchain4j/model/vertexai/VertexAiMatchingEngine.java

+        log.info("Uploading {} index to GCS.", filename);
+
+        // Upload the index to GCS
+        getGcpBlobService().upload(filename, index.toString());


LangChain implementation seem to upload embeddings to GCS bucket first and then update the index, providing a reference to the bucket (where to read embeddings from). But in your implementation you provide embeddings directly in the index, so it feels that uploading them to GCS is unnecessary. Could you please remove this line and test if it still works? Thanks!

...ai/src/main/java/dev/langchain4j/store/embedding/vertexai/internal/IndexEndpointService.java

langchain4j

Hi @pascalconfluent, could you please check my comments? Thanks!

geoand · 2024-06-05T08:09:46Z

I assume this has gone pretty stale at this point :)

langchain4j · 2024-06-05T09:15:14Z

AFAIK @glaforge wanted to work on adding Vertex AI Search soon, so maybe this PR can be revived?

glaforge · 2024-06-17T15:19:23Z

Would be interesting to know if @pascalconfluent wants to update the PR?
We'll have to add support for metadata filtering as well, I guess.
But that would be great to see Vertex AI Search support (used to be called Matching Engine)

pascalconfluent · 2024-06-17T16:20:02Z

Yes. I want to finish this PR.

…

On Jun 17, 2024, at 11:19, Guillaume Laforge ***@***.***> wrote: Would be interesting to know if @pascalconfluent <https://github.com/pascalconfluent> wants to update the PR? We'll have to add support for metadata filtering as well, I guess. But that would be great to see Vertex AI Search support (used to be called Matching Engine) — Reply to this email directly, view it on GitHub <#289 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AKB2NQBZJDDLSXX5ND6XQUDZH35BDAVCNFSM6AAAAAA7PDIZFGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCNZTGY4TKMBWGU>. You are receiving this because you were mentioned.

langchain4j · 2024-10-11T09:48:09Z

@pascalconfluent closing this PR due to inactivity. Please feel free to reopen if/when needed.

pascalconfluent and others added 14 commits November 16, 2023 21:14

Adding support for matching engine

841b412

Merge branch 'main' into matchingengine

39233e9

Make endpoint optional

e7297fa

Add scope to lombok

d5c51b3

Merge branch 'main' into matchingengine

ec331ce

Fix build

571ffdd

Adding arguments tests

b592036

Cleanup and more arguments testing

2d9e608

Code cleanup

6ad7835

Add nonnull annotation for mandatory properties

56ff0c6

Adding support to delete index and use of EmbeddingStoreIT for unit t…

c3a876d

…ests

Merge branch 'main' into matchingengine

a56aafd

Make index ids not mandatory

421c337

Merge branch 'main' into matchingengine

e1ac43f

langchain4j reviewed Nov 24, 2023

View reviewed changes

pascalconfluent added 6 commits November 24, 2023 12:17

Remove unnecessary dependencies

81d0c02

Rename to vector search

9b68b88

Rename to vector search

8905d78

Rename to vector search

21c3e08

Move to proper package

8808269

Using AllMiniLmL6V2QuantizedEmbeddingModel

46e04e5

pascalconfluent added 5 commits November 24, 2023 12:52

Change message value

a06590e

Rename method

e5a6fce

Rename class

a4f0113

Code cleanup

a116c19

Change default to false

931b710

langchain4j reviewed Dec 4, 2023

View reviewed changes

langchain4j added 2 commits December 18, 2023 10:06

Merge branch 'main' into matchingengine

9cddb7f

Merge branch 'main' into matchingengine

5f7551e

Merge branch 'main' into matchingengine

92d7135

langchain4j requested changes Dec 18, 2023

View reviewed changes

pascalconfluent and others added 10 commits December 18, 2023 16:05

Merge branch 'main' into matchingengine

7028d02

Rename getCosineSimilarity to relevanceScore

75f0f42

rename returnFullDatapoint to retrieveEmbeddingsOnSearch

5250a41

Rename indicesToDelete to embeddingIdsToDelete

2b7f650

Fail is indexid not specified

2693027

Merge branch 'main' into matchingengine

a87ff00

Merge branch 'main' into matchingengine

dfbdd3f

Merge branch 'main' into matchingengine

832194c

Merge branch 'main' into matchingengine

db32f20

Merge branch 'main' into matchingengine

3bb821c

langchain4j added the P3 Medium priority label Sep 10, 2024

langchain4j closed this Oct 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for VertexAi matching engine. #289

Adding support for VertexAi matching engine. #289

pascalconfluent commented Nov 17, 2023

langchain4j left a comment

langchain4j Nov 24, 2023

pascalconfluent Nov 24, 2023

langchain4j Nov 24, 2023

pascalconfluent Nov 24, 2023

pascalconfluent Nov 24, 2023

langchain4j Dec 4, 2023 •

edited

Loading

pascalconfluent Dec 18, 2023

langchain4j Nov 24, 2023

pascalconfluent Nov 24, 2023

langchain4j Dec 4, 2023

pascalconfluent Dec 18, 2023

langchain4j commented Nov 24, 2023

langchain4j Dec 4, 2023

langchain4j left a comment

geoand commented Jun 5, 2024

langchain4j commented Jun 5, 2024

glaforge commented Jun 17, 2024

pascalconfluent commented Jun 17, 2024 via email

langchain4j commented Oct 11, 2024

Adding support for VertexAi matching engine. #289

Adding support for VertexAi matching engine. #289

Conversation

pascalconfluent commented Nov 17, 2023

langchain4j left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

langchain4j Dec 4, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

langchain4j commented Nov 24, 2023

Choose a reason for hiding this comment

langchain4j left a comment

Choose a reason for hiding this comment

geoand commented Jun 5, 2024

langchain4j commented Jun 5, 2024

glaforge commented Jun 17, 2024

pascalconfluent commented Jun 17, 2024 via email

langchain4j commented Oct 11, 2024

langchain4j Dec 4, 2023 •

edited

Loading