Τετάρτη 25 Ιανουαρίου 2017

Predicting Enhancer Activity and Variant Impact using gkm-SVM

ABSTRACT

We participated in the Critical Assessment of Genome Interpretation eQTL challenge to further test computational models of regulatory variant impact and their association with human disease. Our prediction model is based on a discriminative gapped-kmer SVM (gkm-SVM) trained on genome-wide chromatin accessibility data in the cell type of interest. The comparisons with Massively Parallel Reporter Assays (MPRA) in lymphoblasts show that gkm-SVM is among the most accurate prediction models even though all other models used the MPRA data for model training, while gkm-SVM did not. In addition, we compare to other MPRA datasets and show that gkm-SVM is a reliable predictor of expression and that deltaSVM is a reliable predictor of variant impact in K562 cells and mouse retina. We further show that DHS (DNase-I Hypersensitive Sites) and ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing) data are equally predictive substrates for training gkm-SVM, and that DHS regions flanked by H3K27Ac and H3K4me1 marks are more predictive than DHS regions alone.

This article is protected by copyright. All rights reserved



from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2k1ZJhX
via IFTTT

Δεν υπάρχουν σχόλια:

Δημοσίευση σχολίου

Δημοφιλείς αναρτήσεις