PhosBoost Phosphosite Predictions tracks for IWGSC Chinese Spring wheat v2, barley Morex v3, and oat Sang
Phosphosites were predicted using PhosBoost, a novel machine learning approach (Poretsky et al., Plant Direct, 2023) which leverages the power of gradient boosting trees and pretrained protein language models to predict protein phosphorylation. A model trained on the complete qPTMplants database protein phosphorylation data was used to generate genome-wide phosphosite predictions in plants. Phosphosites were also inferred from the qPTMplants phosphosites based on sequence similarity by using a DIAMOND pairwise sequence alignment analysis step. For all proteins, phosphosites in one representative gene model were predicted.