GTcomplex

High-performance 3D search and alignment of macromolecular complexes
Fast, accurate, scalable – for proteins, RNAs, DNAs

GTcomplex is also available via a web service

Features

Graphics processing unit (GPU) version
CPU/multiprocessing version (to appear later)
Configurable GPU/CPU memory
Utilization of multiple GPUs
Tested on NVIDIA Pascal (GeForce GTX 1080MQ), Volta (V100), Ampere (A100), Ada Lovelace (GeForce RTX 4090), and Blackwell (GeForce RTX 5090) GPU architectures
Same executable for different architectures
Up to 4 orders of magnitude faster than US-align running on 64 cores
More sensitive and accurate than US-align
Correct TM-scores are guaranteed for produced superpositions
Correct RMSDs are guaranteed for produced alignments
Many options for speed-accuracy tradeoff
Support for PDB, PDBx/mmCIF, and gzip (thanks to zlib) formats
Reading (un)compressed structures from TAR archives
Allows searching within directories up to three levels deep
Clustering ability (GPU only)
Cross-platform/portable code

Available Platforms

GTcomplex was tested on and the binaries are provided for the following platforms:

Linux x64
Windows x64

Tested compilers include GCC versions 7.5.0, 8.3.0, and 11.4.0; LLVM/Clang version 10.0.0; and native MSVC compilers.

System requirements (GPU version)

CUDA-enabled GPU(s) with compute capability >=5 (released in 2014)
NVIDIA driver version >=418.87 (>=425.25 for Win64) and CUDA version >=10.1

System requirements (CPU/multiprocessing version)

GLIBC version >=2.16 (Linux)

Installation of pre-compiled binaries

Download or clone the repository:

git clone https://github.com/minmarg/gtcomplex.git

On Linux, run the shell script and follow the instructions:

Linux_installer_GPU/GTcomplex-linux64-installer-GPU.sh

On MS Windows, run the GPU-version installer:

MS_Windows_installer_GPU/GTcomplex-win64-installer.msi

Installation from source code

Installation on Linux

Software requirements

To build and install the GTcomplex software from the source code on Linux, these tools are required to be installed:

CMake version 3.10 or greater
GNU Make version 3.82 or greater
GNU GCC compiler version (7.5) or greater, or LLVM clang compiler version 10 or greater (or another C++ compiler that supports C++14)
the NVIDIA CUDA toolkit version 10.0 or greater (required for GPU version only)

Installation

Run the shell script for the GPU (Linux) version using GCC or LLVM/Clang compilers (takes several minutes to compile):

BUILD_and_INSTALL__GPU__unix.sh

BUILD_and_INSTALL__GPU__unix__Hopper.sh (Hopper architecture; e.g., H100)

BUILD_and_INSTALL__GPU__unix__clang.sh

Installation on MS Windows

Software requirements

To build and install GTcomplex from the source code on MS Windows, these tools are required to be installed:

CMake version 3.10 or greater (free software)
Visual C++ compiler, e.g., Visual Studio Community (free for open source projects; GTcomplex is an open source project)
the NVIDIA CUDA toolkit version 10.0 or greater (free software) (required for GPU version only)

Installation

Run the command (batch) file for the GPU version:

BUILD_and_INSTALL__GPU__win64.cmd

Getting started

Type gtcomplex for a description of the options.

Query structures and/or directories with queries are specified with the option --qrs. Reference structures (to align queries with) and/or their directories to be searched are specified with the option --rfs.

Note that GTcomplex reads .tar archives of compressed and uncompressed structures.

Here are some examples:

gtcomplex -v --qrs=str1.cif.gz --rfs=my_huge_structure_database.tar -o my_output_directory --speed=12 --sort=2

gtcomplex -v --qrs=struct1.pdb --rfs=struct2.pdb,struct3.pdb,struct4.pdb -o my_output_directory

gtcomplex -v --qrs=struct1.pdb,my_struct_directory --rfs=my_ref_directory -o my_output_directory

gtcomplex -v --qrs=str1.pdb.gz,str2.cif.gz --rfs=str3.cif.gz,str4.ent,my_ref_dir -s 0.3 -o mydir

Queries and references are processed in chunks. The maximum total length of queries in one chunk is controlled with the option --dev-queries-total-length-per-chunk. The maximum (minimum) length for a reference chain (as opposed to the total complex length) can be specified with the option --dev-max-length (--dev-min-length). Longer (shorter) chains will be skipped during a search.

The maximum number of query chains is controlled with the --dev-queries-total-length-per-chunk option. The default value is 100; it can be increased to 512. This option calculates the total length across all query chains. There are no constraints on the number of chains in the reference complex, only the available memory may limit the processing of extremely large reference complexes.

Alignment sorting

GTcomplex offers the --sort option to arrange alignment based on various criteria. Users can choose to sort alignments by TM-score, RMSD (root-mean-squared deviation), or the secondary TM-score, 2TM-score, which is calculated over the alignment while excluding unmatched helices. Consequently, the 2TM-score penalizes topological inconsistencies more than the TM-score.

All metrics (TM-scores, RMSDs, etc.) are calculated at both the complex and individual chain levels.

Clustering

The GPU version of GTcomplex allows for clustering (by complete or single linkage) of large datasets. For example,

gtcomplex -v --cls=my_huge_structure_database.tar -o my_output_directory

instructs GTcomplex to cluster the complexes stored in my_huge_structure_database.tar using the default parameters. To obtain the superimposed members of a cluster, run gtcomplex with the first member as the query and all other members as references, using the options --pre-score=0 -s 0 --referenced. This will produce transformation matrices to superimpose each reference complex onto the query.

Usage tips and recommendations

Optimizing performance for large datasets

Leverage fast searching for large data
Use fast searching (--speed=[10-16]) when processing very large datasets to significantly reduce runtime.
Enable cached data for faster disk access
Utilize the -c <cache_directory> option to cache data and speed up reading from disk when working with numerous query structures.

Fine-tuning output and memory management

Sort alignments by TM-score normalized by query length
Sort alignments by the query length-normalized TM-score (--sort=2) to prioritize structural similarities extending across larger portions of query structures.
Generate transformation matrices for reference structures
Use --referenced to generate transformation matrices for reference structures. This allows you to visually inspect all reference structures superimposed on a query in a graphical environment.
Optimize memory usage
Control the memory allocation for GTcomplex using the --dev-mem option. This allows for running multiple instances of GTcomplex simultaneously on a single GPU or CPU.

GTcomplex demo notebook on Google Colab

The GTcomplex_demo1 notebook demonstrates all-against-all alignment of queries from the Ref-2-100 and Viral-C (viral capsids) datasets, completing in approximately 25 seconds and 2 minutes, respectively.

Citation

If you use, reference, or benefit from the GTcomplex software or data, please cite:

Margelevicius, M. GTcomplex: Spatial indexing-powered search and alignment of macromolecular complexes. bioRxiv 2025.12.15.694356 (2025). https://doi.org/10.64898/2025.12.15.694356

@article {Margelevicius2025.12.15.694356,
  author = {Margelevicius, Mindaugas},
  title = {{GTcomplex}: Spatial indexing-powered search and alignment of macromolecular complexes},
  elocation-id = {2025.12.15.694356},
  year = {2025},
  doi = {10.64898/2025.12.15.694356},
  publisher = {Cold Spring Harbor Laboratory},
  URL = {https://www.biorxiv.org/content/early/2025/12/17/2025.12.15.694356},
  eprint = {https://www.biorxiv.org/content/early/2025/12/17/2025.12.15.694356.full.pdf},
  journal = {bioRxiv}
}

Reporting issues

Bug reports, comments, suggestions are welcome.

Contacts

For inquiries, please contact Mindaugas Margelevicius at [email protected].

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Funding

This project was supported by an NVIDIA Academic Grant.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Linux_installer_GPU		Linux_installer_GPU
MS_Windows_installer_GPU		MS_Windows_installer_GPU
imgs		imgs
out		out
src		src
AUTHORS		AUTHORS
BUILD_and_INSTALL__GPU__unix.sh		BUILD_and_INSTALL__GPU__unix.sh
BUILD_and_INSTALL__GPU__unix__Hopper.sh		BUILD_and_INSTALL__GPU__unix__Hopper.sh
BUILD_and_INSTALL__GPU__unix__clang.sh		BUILD_and_INSTALL__GPU__unix__clang.sh
BUILD_and_INSTALL__GPU__win64.cmd		BUILD_and_INSTALL__GPU__win64.cmd
CITATION.cff		CITATION.cff
ChangeLog		ChangeLog
GTcomplex_demo1.ipynb		GTcomplex_demo1.ipynb
LICENSE.md		LICENSE.md
README.md		README.md
VERSION		VERSION

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GTcomplex

Features

Available Platforms

System requirements (GPU version)

System requirements (CPU/multiprocessing version)

Installation of pre-compiled binaries

Installation from source code

Installation on Linux

Software requirements

Installation

Installation on MS Windows

Software requirements

Installation

Getting started

Alignment sorting

Clustering

Usage tips and recommendations

Optimizing performance for large datasets

Fine-tuning output and memory management

GTcomplex demo notebook on Google Colab

Citation

Reporting issues

Contacts

License

Funding

About

Uh oh!

Releases 1

Packages

Languages

License

minmarg/gtcomplex

Folders and files

Latest commit

History

Repository files navigation

GTcomplex

Features

Available Platforms

System requirements (GPU version)

System requirements (CPU/multiprocessing version)

Installation of pre-compiled binaries

Installation from source code

Installation on Linux

Software requirements

Installation

Installation on MS Windows

Software requirements

Installation

Getting started

Alignment sorting

Clustering

Usage tips and recommendations

Optimizing performance for large datasets

Fine-tuning output and memory management

GTcomplex demo notebook on Google Colab

Citation

Reporting issues

Contacts

License

Funding

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages