Random Forest model reveals the interaction between N6-methyladenosine modifications and RNA-binding proteins.
Academic Article
Overview
abstract
RNA-binding proteins (RBPs) have critical roles in N6-methyladenosine (m6A) modification process. We designed a Random Forest (RF) model to systematically analyze the interaction among RBPs and m6A modifications by integrating the binding signals from hundreds of RBPs. Accurate prediction of m6A sites demonstrated significant connections between RBP bindings and m6A modifications. The relative importance of different RBPs from the model provided a quantitative metric to evaluate their interactions with m6A modifications. Redundancy analysis showed that several RBPs may have similar binding patterns with m6A sites. The RF model exhibited fairly high prediction accuracy across cell lines, suggesting a conservative RBP interaction network regulates m6A occupancy. Specific RBPs can engage to the corresponding regional m6A sites and deploy distinct regulatory processes, such as cleavage site selection of the alternative polyadenylation (APA). We also integrated histone modifications into our RF model, which demonstrated H3K36me3 and H3K27me3 as determining features for m6A distribution.