LLM API

This is a very simple web API to run various LLMs for inference. Bugs included.

Models

python api.py llama2chat --kwargs size=70b --gpus 0 1 2 3

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.gitignore		.gitignore
README.md		README.md
api.py		api.py
models.py		models.py
requirements.txt		requirements.txt
test_models.py		test_models.py