[Bug] LlamaIndex embeddings using wrong method #515

logan-markewich · 2024-08-19T00:32:53Z

The llamaindex tea embeddings are using a private method

GenAIComps/comps/embeddings/llama_index/embedding_tei.py

Line 21 in cd83854

embed_vector = embeddings._get_query_embedding(input.text)

It's not clear to me if the embedding service is meant to handle query embeddings or document embeddings. But either way, we should be using embed_model.get_text_embedding(text) or embed_model.get_query_embedding(query)

Likely we should have two endpoints, one for query and one for normal text documents.

We might also want to consider using get_text_embeddings_batch() instead of processing one document at a time, but again, depends on how we want to define our embedding endpoints

The text was updated successfully, but these errors were encountered:

srinarayan-srikanthan · 2024-08-19T21:44:16Z

the object is used to call the functions, but agree that we might want to consider text_embedding_batch()

logan-markewich · 2024-08-20T17:13:52Z

Right, but I think we also don't want to ignore the fact that its pretty common for embedding models to embed queries and documents differently (cohere and nomic are both examples that need this)

Maybe this needs to be a parameter on the request?

logan-markewich · 2024-11-07T16:27:18Z

I've also noticed that the embeddings API in general only works by sending a single piece of text at a time. This is neither efficient nor openai compatible sadly

Related PR: run-llama/llama_index#16666

XinyaoWa · 2024-11-13T08:33:50Z

Hi, this problem has been fixed in PR #892

joshuayao · 2024-11-15T01:09:39Z

@logan-markewich Could you please help close the issue if the PR works? Thanks.

logan-markewich changed the title ~~LlamaIndex embeddings using wrong method~~ [Bug] LlamaIndex embeddings using wrong method Aug 19, 2024

preethivenkatesh assigned srinarayan-srikanthan Aug 21, 2024

preethivenkatesh added the aitce label Aug 21, 2024

chickenrae added OPEAHack Issue created for OPEA Hackathon Hacktoberfest labels Sep 27, 2024

srinarayan-srikanthan assigned kevinintel Nov 7, 2024

srinarayan-srikanthan added the DEV features label Nov 7, 2024

srinarayan-srikanthan removed their assignment Nov 7, 2024

joshuayao added the bug Something isn't working label Nov 8, 2024

joshuayao added this to OPEA Nov 8, 2024

joshuayao added this to the v1.1 milestone Nov 8, 2024

lvliang-intel assigned Spycsh Nov 8, 2024

Spycsh assigned lkk12014402 and unassigned Spycsh Nov 8, 2024

joshuayao added the r1.1 label Nov 11, 2024

joshuayao moved this to In progress in OPEA Nov 12, 2024

joshuayao assigned XinyaoWa and unassigned lkk12014402 and kevinintel Nov 12, 2024

joshuayao closed this as completed Nov 15, 2024

github-project-automation bot moved this from In progress to Done in OPEA Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] LlamaIndex embeddings using wrong method #515

[Bug] LlamaIndex embeddings using wrong method #515

logan-markewich commented Aug 19, 2024

srinarayan-srikanthan commented Aug 19, 2024

logan-markewich commented Aug 20, 2024

logan-markewich commented Nov 7, 2024

XinyaoWa commented Nov 13, 2024

joshuayao commented Nov 15, 2024

[Bug] LlamaIndex embeddings using wrong method #515

[Bug] LlamaIndex embeddings using wrong method #515

Comments

logan-markewich commented Aug 19, 2024

srinarayan-srikanthan commented Aug 19, 2024

logan-markewich commented Aug 20, 2024

logan-markewich commented Nov 7, 2024

XinyaoWa commented Nov 13, 2024

joshuayao commented Nov 15, 2024