Statistics, Department of

Department of Statistics: Faculty Publications

Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data

Timonthy Bailey, The University of QueenslandFollow
Pawel Krajewski, Institute of Plant Genetics
Istvan Ladunga, University of Nebraska at LincolnFollow
Celine Lefebvre, Cancer Institute Gustave Roussy
Qunhua Li, Penn State University
Tao Liu, University at Buffalo
Pedro Madrigal, Institute of Plant GeneticsFollow
Cenny Taslim, Ohio State University
Jie Zhang, Ohio State University

ORCID IDs

Istvan Ladunga

Document Type

Article

Date of this Version

11-13-2013

Citation

Bailey T, Krajewski P, Ladunga I, Lefebvre C, Li Q, et al. (2013) Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data. PLoS Comput Biol 9(11): e1003326. doi:10.1371/journal.pcbi.1003326

Comments

Abstract

Mapping the chromosomal locations of transcription factors, nucleosomes, histone modifications, chromatin remodeling enzymes, chaperones, and polymerases is one of the key tasks of modern biology, as evidenced by the Encyclopedia of DNA Elements (ENCODE) Project. To this end, chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is the standard methodology. Mapping such protein-DNA interactions in vivo using ChIP-seq presents multiple challenges not only in sample preparation and sequencing but also for computational analysis. Here, we present step-by-step guidelines for the computational analysis of ChIP-seq data. We address all the major steps in the analysis of ChIP-seq data: sequencing depth selection, quality checking, mapping, data normalization, assessment of reproducibility, peak calling, differential binding analysis, controlling the false discovery rate, peak annotation, visualization, and motif analysis. At each step in our guidelines we discuss some of the software tools most frequently used. We also highlight the challenges and problems associated with each step in ChIP-seq data analysis. We present a concise workflow for the analysis of ChIP-seq data in Figure 1 that complements and expands on the recommendations of the ENCODE andmodENCODE projects. Each step in the workflow is described in detail in the following sections.

Download

Included in

Other Statistics and Probability Commons

COinS

Statistics, Department of

Department of Statistics: Faculty Publications

Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data

ORCID IDs

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Search

Browse

Author Corner

Links

Statistics, Department of

Department of Statistics: Faculty Publications

Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data

Authors

ORCID IDs

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Share

Search

Browse

Author Corner

Links