LLM Bench

LLM Bench is a benchmarking tool designed to compare the performance of various Language Models (LLMs). It allows you to evaluate and compare different LLMs on a specific task or dataset, providing insights into their speed and accuracy.LLM Bench is inspired by PrivateGPT

Prerequisites

Python 3.9 or higher
PDM package manager
Pretrained language models in the models directory

Installation

Clone this repository to your local machine.
Rename example.env to .env and edit the variables appropriately.
Install the required dependencies using PDM. Run the following command in your terminal:
```
pdm sync
```

Usage

Ensure that the documents you want to benchmark are located in the source_documents directory.

Instructions for ingesting your own dataset

Put any and all your files into the source_documents directory.

The supported extensions are:

.csv: CSV,
.docx: Word Document,
.doc: Word Document,
.enex: EverNote,
.eml: Email,
.epub: EPub,
.html: HTML File,
.md: Markdown,
.msg: Outlook Message,
.odt: Open Document Text,
.pdf: Portable Document Format (PDF),
.pptx : PowerPoint Document,
.ppt : PowerPoint Document,
.txt: Text file (UTF-8),

Run the ingestion script to index the documents:
- Execute the following command in your terminal:
```
python ingest.py
```
Run the main application to benchmark the LLMs:
- Execute the following command in your terminal:
```
python main.py
```
The application will start and display a Streamlit user interface in your browser.
Select the desired LLM model type:
- Use the sidebar radio button to choose a specific model type from the available options.
Select a model:
- Use the dropdown selector in the sidebar to choose a specific model for the selected model type.
Enter a query:
- Type your query in the provided text input field.
Press the "Enter" or "Return" key to execute the query and initiate the benchmarking process.
The application will display the question, answer, and relevant documents for the given query, along with the execution time for the selected LLM model.
Repeat the process with different queries and models to compare their performance.

Contributing

Contributions are welcome! If you find any issues or have suggestions for improvement, please feel free to submit a pull request or open an issue in the repository.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
models		models
sound		sound
source_documents		source_documents
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
constants.py		constants.py
example.env		example.env
ingest.py		ingest.py
main.py		main.py
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Bench

Prerequisites

Installation

Usage

Instructions for ingesting your own dataset

Contributing

License

About

Releases

Packages

Languages

License

katticot-ledger/LLMbench

Folders and files

Latest commit

History

Repository files navigation

LLM Bench

Prerequisites

Installation

Usage

Instructions for ingesting your own dataset

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages