Skip to content
This repository was archived by the owner on Nov 14, 2025. It is now read-only.

gpauloski/llm-pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

439 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

LLM Training Scripts

Docs Tests pre-commit.ci status

Tools and training scripts I have developed for building large language models in PyTorch.

This repository provides:

  • data preprocessing scripts,
  • training scripts, and
  • training guides.

This repository is the successor to my old training tools BERT-PyTorch as the old code had a lot of technical debt and was not well tested. Compared to the old repository, this codebase aims to have better code health and maintainability thanks to tests, type checking, linters, documentation, etc.

Install

See the Installation Guide.

Getting Started

See the available Guides.

About

Tools and training scripts for large language models

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Contributors

Languages

Generated from gpauloski/python-template