[New Feature] Add new route for dllama api for embeding models #96

testing0mon21 · 2024-07-01T20:08:24Z

std::vector<Route> routes = {
    {
        "/v1/chat/completions",
        HttpMethod::METHOD_POST,
        std::bind(&handleCompletionsRequest, std::placeholders::_1, &api)
    },
    {
        "/v1/models",
        HttpMethod::METHOD_GET,
        std::bind(&handleModelsRequest, std::placeholders::_1)
    }
};

in ddlama api at master branch we have only 2 routes /v1/chat/completions and /v1/models but some model looks like llama3:8b has embedding functionality. Can add you add new route for /api/embeddings ?

The text was updated successfully, but these errors were encountered:

testing0mon21 · 2024-07-01T20:37:52Z

@b4rtaz what do you think about new route?

testing0mon21 · 2024-07-03T20:17:19Z

i mean this for ddlama api

Generate Embeddings

POST /api/embeddings

Generate embeddings from a model

testing0mon21 · 2024-07-04T17:50:37Z

i hope that i explain) @b4rtaz

testing0mon21 · 2024-07-10T18:20:27Z

what do you think about this revision or should I explain in more detail? @b4rtaz

b4rtaz · 2024-07-10T22:25:39Z

Hello @testing0mon21 I'm not too familiar with embedding, but if I see correctly, llama.cpp supports it. This is not a priority for me, but contributions are welcome.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New Feature] Add new route for dllama api for embeding models #96

[New Feature] Add new route for dllama api for embeding models #96

testing0mon21 commented Jul 1, 2024

testing0mon21 commented Jul 1, 2024

testing0mon21 commented Jul 3, 2024

testing0mon21 commented Jul 4, 2024

testing0mon21 commented Jul 10, 2024

b4rtaz commented Jul 10, 2024

[New Feature] Add new route for dllama api for embeding models #96

[New Feature] Add new route for dllama api for embeding models #96

Comments

testing0mon21 commented Jul 1, 2024

testing0mon21 commented Jul 1, 2024

testing0mon21 commented Jul 3, 2024

Generate Embeddings

testing0mon21 commented Jul 4, 2024

testing0mon21 commented Jul 10, 2024

b4rtaz commented Jul 10, 2024