GitHub - terarachang/LLMDecomp: LLM decomposition; few-shot learning (EMNLP 2024)

When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models (EMNLP 2024)

Ting-Yun Chang, Jesse Thomason, and Robin Jia

Paper: https://arxiv.org/abs/2406.13131 Blog: https://terarachang.github.io/projects/llm-decomp.html

Methods

Quick Start

export HF_TOKEN="YOUR TOKEN"
pip install -r requirements.txt

Component Reweighting

$ bash scripts/comp_rw.sh

Implementation of model decomposition: decompose.py
Implementation of reweighting: train_components.py

Standard ICL

$ bash scripts/standard.sh

Calib+

$ bash scripts/calibration.sh

Implementation of trainable calibration: train_calib.py

Adding New Models

Our repo supports LLMs in the Llama and Mistral family
To support new models, please add hooks to the model and follow the naming convention of my_modeling_llama.py
If the new model also uses RMSNorm, the decompose.py file is directly applicable. Otherwise, please take care of layernorms, which may greatly influence model performance!
*We do not fully adopt TransformerLens to avoid numerical issues in Llama-3 and reduce computation overhead

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
scripts		scripts
.gitignore		.gitignore
README.md		README.md
config.py		config.py
decompose.py		decompose.py
my_modeling_llama.py		my_modeling_llama.py
my_modeling_mistral.py		my_modeling_mistral.py
requierments.txt		requierments.txt
test_icl.py		test_icl.py
train_calib.py		train_calib.py
train_components.py		train_components.py
utils.py		utils.py
utils_post.py		utils_post.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models (EMNLP 2024)

Methods

Quick Start

Component Reweighting

Standard ICL

Calib+

Adding New Models

About

Releases

Packages

Languages

terarachang/LLMDecomp

Folders and files

Latest commit

History

Repository files navigation

When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models (EMNLP 2024)

Methods

Quick Start

Component Reweighting

Standard ICL

Calib+

Adding New Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages