🦙🌲🤏 Cerebras-LoRA

This repository contains code for reproducing the Stanford Alpaca results using low-rank adaptation (LoRA) and Cerebras-GPT 6.7B instead of LLama 7B. We provide an Instruct model of similar quality to text-davinci-003 that can run on a Raspberry Pi (for research), and the code is easily extended to the 13b, 30b, and 65b models.

In addition to the training code, which runs within hours on a single RTX 4090, we publish a script for downloading and inference on the foundation model and LoRA, as well as the resulting LoRA weights themselves. To fine-tune cheaply and efficiently, we use Hugging Face's PEFT as well as Tim Dettmers' bitsandbytes.

Without hyperparameter tuning, the LoRA model produces outputs comparable to the Stanford Alpaca model. (Please see the outputs included below.) Further tuning might be able to achieve better performance; I invite interested users to give it a try and report their results.

Quickstart with 🤗Transformers:

from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
from peft import PeftModel
tokenizer = AutoTokenizer.from_pretrained("cerebras/Cerebras-GPT-6.7B")
model = AutoModelForCausalLM.from_pretrained("cerebras/Cerebras-GPT-6.7B", torch_dtype=torch.float16, device_map='auto', load_in_8bit=True)
model = PeftModel.from_pretrained(model, "bjoernp/alpaca-cerebras-6.7B", torch_dtype=torch.float16, device_map='auto')
pipe = pipeline("text-generation", model=model, tokenizer=tokenizer)
generated_text = pipe("Tell me about alpacas.", max_length=50, do_sample=False, no_repeat_ngram_size=2)[0]

Local Setup

Install dependencies
```
pip install -r requirements.txt
```
If bitsandbytes doesn't work, install it from source. Windows users can follow these instructions.

Training (`finetune.py`)

This file contains a straightforward application of PEFT to the LLaMA model, as well as some code related to prompt construction and tokenization. PRs adapting this code to support larger models are always welcome.

Inference (`generate.py`)

This file reads the foundation model from the Hugging Face model hub and the LoRA weights from tloen/alpaca-lora-7b, and runs a Gradio interface for inference on a specified input. Users should treat this as example code for the use of the model, and modify it as needed.

Example usage:

python generate.py \
    --load_8bit \
    --base_model 'cerebras/Cerebras-GPT-6.7B' \
    --lora_weights 'bjoernp/alpaca-cerebras-6.7B'

Official weights

The most recent Alpaca-LoRA adapter available at bjoernp/alpaca-cerebras-6.7B was trained on March 30 with the following command:

python finetune.py

Finetuning took ~5.5 hours on a 3090Ti

Docker Setup & Inference

Build the container image:

docker build -t alpaca-lora .

Run the container (you can also use finetune.py and all of its parameters as shown above for training):

docker run --gpus=all --shm-size 64g -p 7860:7860 -v ${HOME}/.cache:/root/.cache --rm alpaca-lora generate.py \
    --load_8bit \
    --base_model 'decapoda-research/llama-7b-hf' \
    --lora_weights 'tloen/alpaca-lora-7b'

Open https://localhost:7860 in the browser

Docker Compose Setup & Inference

(optional) Change desired model and weights under environment in the docker-compose.yml
Build and run the container

docker-compose up -d --build

Open https://localhost:7860 in the browser
See logs:

docker-compose logs -f

Clean everything up:

docker-compose down --volumes --rmi all

Notes

We can likely improve our model performance significantly if we had a better dataset. Consider supporting the LAION Open Assistant effort to produce a high-quality dataset for supervised fine-tuning (or bugging them to release their data).
We're continually fixing bugs and conducting training runs, and the weights on the Hugging Face Hub are being updated accordingly. In particular, those facing issues with response lengths should make sure that they have the latest version of the weights and code.
Users with multiple GPUs should take a look here.
We include the Stanford Alpaca dataset, which was made available under the ODC Attribution License.

Example outputs

Instruction: Tell me about alpacas.

Cerebras-GPT: Alpacas are one of the oldest domesticated animals in the world. They are native to the Andes Mountains in South America and are closely related to llamas and vicunas. Alpacas can be found in a variety of habitats, including the Andes, the Amazon rainforest, and the deserts of North America. Alpacas are herbivores that eat a variety of plants, including grasses, shrubs, and trees. Alpacas can be found in a variety of colors, including white, black, and brown.

Alpaca-LoRA: Alpacas are members of the camelid family and are native to the Andes Mountains of South America. They are known for their soft, luxurious fleece, which is used to make clothing, blankets, and other items. Alpacas are herbivores and graze on grasses and other plants. They are social animals and live in herds of up to 20 individuals.

Stanford Alpaca: Alpacas are small, fluffy animals related to camels and llamas. They are native to Peru and Bolivia, and were first domesticated around 5,000 years ago. They are kept mainly for their fine, soft fleece, which is used to make knitwear and other garments. Alpacas are herd animals and live in small family groups, led by an older male. They are highly social animals and can form strong bonds with their owners.

text-davinci-003: Alpacas are a domesticated species of South American camelid, similar to llamas. They are native to the Andes Mountains in South America and are kept as livestock for their soft, luxurious wool. Alpacas are social animals and live in herds of up to 20 individuals. They are typically kept in small herds of two to five animals, and are relatively easy to care for. Alpacas are herbivores and feed on grass, hay, and other vegetation. They are also known for their gentle and friendly nature, making them popular as pets.

Instruction: Tell me about the president of Mexico in 2019.

Cerebras-GPT: The president of Mexico in 2019 is Andrés Manuel López Obrador (AMLO). AMLO is a former mayor of Mexico City and a member of the Institutional Revolutionary Party (PRI). AMLO was elected president in 2018, defeating the incumbent, Enrique Peña Nieto. AMLO is known for his progressive policies, such as increasing the minimum wage, providing free education and healthcare to all Mexicans, and reducing poverty. AMLO has also been a vocal supporter of the #MeToo movement, calling for an end to sexual harassment in the workplace.

Alpaca-LoRA: The president of Mexico in 2019 was Andrés Manuel López Obrador, who took office on December 1, 2018. He is a member of the National Regeneration Movement (MORENA) political party and is the first left-wing president of Mexico since 1946. He is known for his anti-corruption and anti-neoliberal policies, as well as his commitment to improving the living conditions of the Mexican people.

Stanford Alpaca: In 2019, Andres Manuel Lopez Obrador was sworn in as President of Mexico. He is the country’s 58th president and the first from the left-wing Morena party. He was elected in 2018 on a promise to combat corruption and reduce poverty and has faced criticism for his handling of the economy, crime and the country’s relationship with the United States. He is also known for his ambitious social welfare programs and has been praised for raising the minimum wage and providing aid to low-income families. He is seen as a champion of democracy and human rights in Mexico.

text-davinci-003: The president of Mexico in 2019 is Andrés Manuel López Obrador, commonly known as AMLO. He is a Mexican politician who has been the President of Mexico since December 1, 2018. He is a member of the National Regeneration Movement (MORENA) party and previously served as the Head of Government of Mexico City from 2000 to 2005. He is a left-wing populist and has promised to reduce corruption and inequality in Mexico. He has also promised to increase public spending on social programs and infrastructure projects.

Instruction: Tell me about the king of France in 2019.

Cerebras-GPT: In 2019, the king of France is Emmanuel Macron. He was elected in 2017 and is the youngest French president in history. He is known for his progressive policies and his focus on the environment. He is also known for his efforts to modernize the French economy and for his efforts to diversify the country's economy.

Alpaca-LoRA: The King of France in 2019 is Emmanuel Macron, who was elected in May 2017. He is the youngest president in the history of the Fifth Republic and the first president to be born after World War II. He is also the first president to have never held elected office before.

Stanford Alpaca: In 2019, the King of France was Emmanuel Macron. He was sworn in as President of France on May 14, 2017, and has been leading the country ever since. He is a member of the centrist La République En Marche! Party. He is known for his reformist agenda and has faced a number of protests from both the right and left of the political spectrum. He is the youngest French president in history.

text-davinci-003: The King of France is a largely symbolic role, as France is a republic. The current President of France is Emmanuel Macron, who was elected in 2017.

Instruction: List all Canadian provinces in alphabetical order.

Cerebras-GPT: Alberta, British Columbia, Manitoba, New Brunswick, Newfoundland and Labrador, Nova Scotia, Ontario, Prince Edward Island, Quebec, Saskatchewan, Yukon.

Alpaca-LoRA: Alberta, British Columbia, Manitoba, New Brunswick, Newfoundland and Labrador, Nova Scotia, Ontario, Prince Edward Island, Quebec, Saskatchewan

Stanford Alpaca: Alberta, British Columbia, Manitoba, New Brunswick, Newfoundland and Labrador, Nova Scotia, Ontario, Prince Edward Island, Quebec, Saskatchewan, Northwest Territories, Nunavut, Yukon.

text-davinci-003:

Alberta
British Columbia
Manitoba
New Brunswick
Newfoundland and Labrador
Northwest Territories
Nova Scotia
Nunavut
Ontario
Prince Edward Island
Quebec
Saskatchewan
Yukon

Instruction: Write a Python program that prints the first 10 Fibonacci numbers.

Cerebras-GPT:

def fib(n):
    if n == 0:
        return 0
    elif n == 1:
        return 1
    else:
        return fib(n-1) + fib(n-2)

print(fib(10))

Alpaca-LoRA:

def fibonacci(n):
    if n == 0:
        return 0
    elif n == 1:
        return 1
    else:
        return fibonacci(n-1) + fibonacci(n-2)

print(fibonacci(10))

Stanford Alpaca:

numbers = [0, 1]

for i in range(2, 11):
    numbers.append(numbers[i-2] + numbers[i-1])

print(numbers)
# Output: [0, 1, 1, 2, 3, 5, 8, 13, 21, 34]

text-davinci-003:

# Fibonacci numbers are the numbers in the following integer sequence, called the Fibonacci sequence, and characterized by the fact that every number after the first two is the sum of the two preceding ones.

# Fibonacci sequence: 0, 1, 1, 2, 3, 5, 8, 13, 21, 34, ...

# Program:

# Initialize the first two numbers of the Fibonacci sequence
a = 0
b = 1

# Print the first 10 numbers of the Fibonacci sequence
print(a)
print(b)

for i in range(2,10):
    c = a + b
    a = b
    b = c
    print(c)

Instruction: Write a program that prints the numbers from 1 to 100. But for multiples of three print 'Fizz' instead of the number and for the multiples of five print 'Buzz'. For numbers which are multiples of both three and five print 'FizzBuzz'.

Cerebras-GPT:

#include <stdio.h>

int main()
{
    int i;
    for (i = 1; i <= 100; i++)
    {
        if (i % 3 == 0)
        {
            printf("Fizz\n");
        }
        else if (i % 5 == 0)
        {
            printf("Buzz\n");
        }
        else
        {
            printf("%d\n", i);
        }
    }
    return 0;
}

Alpaca-LoRA:

for i in range(1, 101):
    if i % 3 == 0 and i % 5 == 0:
        print("FizzBuzz")
    elif i % 3 == 0:
        print("Fizz")
    elif i % 5 == 0:
        print("Buzz")
    else:
        print(i)

Stanford Alpaca:

for i in range(1, 101):
    if i % 3 == 0 and i % 5 == 0:
        print('FizzBuzz')
    elif i % 3 == 0:
        print('Fizz')
    elif i % 5 == 0:
        print('Buzz')
    else:
        print(i)

text-davinci-003:

for (let i = 1; i <= 100; i++) {
  if (i % 3 === 0 && i % 5 === 0) {
    console.log("FizzBuzz");
  } else if (i % 3 === 0) {
    console.log("Fizz");
  } else if (i % 5 === 0) {
    console.log("Buzz");
  } else {
    console.log(i);
  }
}

Instruction: Tell me five words that rhyme with 'shock'.

Cerebras-GPT: Five words that rhyme with'shock' are:

Shocked
Shock
Shock
Shock
Shock

Alpaca-LoRA: Flock, lock, rock, stock, and sock.

Stanford Alpaca: Five words that rhyme with shock are: rock, pop, shock, cook, and snock.

text-davinci-003:

Block
Lock
Dock
Clock
Rock

Instruction: Translate the sentence 'I have no mouth but I must scream' into Spanish.

Cerebras-GPT: No tengo la boca pero tengo que gritar.

Alpaca-LoRA: No tengo boca pero tengo que gritar.

Stanford Alpaca: No tengo boca, pero debo gritar.

text-davinci-003: No tengo boca pero debo gritar.

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
.github/workflows		.github/workflows
templates		templates
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
DATA_LICENSE		DATA_LICENSE
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
alpaca_data.json		alpaca_data.json
alpaca_data_cleaned.json		alpaca_data_cleaned.json
docker-compose.yml		docker-compose.yml
finetune.py		finetune.py
generate.py		generate.py
lengths.ipynb		lengths.ipynb
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🦙🌲🤏 Cerebras-LoRA

Quickstart with 🤗Transformers:

Local Setup

Training (`finetune.py`)

Inference (`generate.py`)

Official weights

Docker Setup & Inference

Docker Compose Setup & Inference

Notes

Example outputs

About

Releases

Packages

Languages

License

bjoernpl/cerebras-lora

Folders and files

Latest commit

History

Repository files navigation

🦙🌲🤏 Cerebras-LoRA

Quickstart with 🤗Transformers:

Local Setup

Training (finetune.py)

Inference (generate.py)

Official weights

Docker Setup & Inference

Docker Compose Setup & Inference

Notes

Example outputs

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Training (`finetune.py`)

Inference (`generate.py`)

Packages