Build your own Copilot

This tutorial helps you to build your own local Copilot with CodeLlama model.

Prepare the environment

Install CUDA

First, make sure you've installed the NVIDIA driver and CUDA Toolkit according to the Prepare the CUDA environment in AWS G5 instances undert Ubuntu 24.04 article.

Install Ollama

See how to Setup Ollama.

Install models

ollama run codellama:7b-instruct
ollama run codellama:7b-code

You may noticed that we installed two models, codellama:7b-instruct is for auto-complete, and codellama:7b-code is for chat. We will tell you how to use them later.

Install VSCode

sudo apt install code

Or you want to download and install manually.

Install Twinny

See this picture that how to install Twinny:

Configure Twinny

See the picture how to configure Twinny:

Auto-complete
Hostname: localhost
Port: 11434
Path: /api/generate
Model Name: codellama:7b-code
FIM Template: codellama

Chat
Hostname: localhost
Port: 11434
Path: /v1/chat/completions
Model Name: codellama:7b-instruct

Test Twinny

Code complete:

Chat:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

build-your-own-copilot.md

build-your-own-copilot.md

Build your own Copilot

Prepare the environment

Install CUDA

Install Ollama

Install models

Install VSCode

Install Twinny

Configure Twinny

Test Twinny

Files

build-your-own-copilot.md

Latest commit

History

build-your-own-copilot.md

File metadata and controls

Build your own Copilot

Prepare the environment

Install CUDA

Install Ollama

Install models

Install VSCode

Install Twinny

Configure Twinny

Test Twinny