Version 2 2024-11-21, 07:44Version 2 2024-11-21, 07:44
Version 1 2024-11-21, 07:28Version 1 2024-11-21, 07:28
software
posted on 2024-11-21, 07:44authored byPhi Le, Leah UngLeah Ung, Hai Yang, Anwen Huang, Tao He, Peter Bruno, David Oh, Bridget Keenan, Li Zhang
Pep-TCRNet is a novel approach to constructing a prediction model that can evaluate the probability of recognition between a TCR and a peptide amino acid sequence while combining inputs such as TCR sequences, HLA types, and VJ genes.
Pep-TCRNet operates in two key steps:
Feature Engineering: This step processes different types of variables:
TCR and peptide amino acid sequencing data: The model incorporates neural network architectures inspired by language representation models and graph representation model to learn the meaningful embeddings.
Categorical data: Specialized encoding techniques are used to ensure optimal feature representation for HLA types and VJ genes.
Prediction Model: The second step involves training a prediction model to evaluate the likelihood of a TCR recognizing a specific peptide, based on the features generated in the first step.