CaltechAUTHORS
  A Caltech Library Service

Occupancy maps of 208 chromatin-associated proteins in one human cell type

Partridge, E. Christopher and Chhetri, Surya B. and Prokop, Jeremy W. and Ramaker, Ryne C. and Jansen, Camden S. and Goh, Say-Tar and Mackiewicz, Mark and Newberry, Kimberly M. and Brandsmeier, Laurel A. and Meadows, Sarah K. and Messer, C. Luke and Hardigan, Andrew A. and Coppola, Candice J. and Dean, Emma C. and Jiang, Shan and Savic, Daniel and Mortazavi, Ali and Wold, Barbara J. and Myers, Richard M. and Mendenhall, Eric M. (2020) Occupancy maps of 208 chromatin-associated proteins in one human cell type. Nature, 583 (7818). pp. 720-728. ISSN 0028-0836. https://resolver.caltech.edu/CaltechAUTHORS:20181128-093527238

[img]
Preview
PDF - Published Version
Creative Commons Attribution.

12Mb
[img] PDF - Submitted Version
Creative Commons Attribution Non-commercial No Derivatives.

2642Kb
[img] PDF - Supplemental Material
Creative Commons Attribution.

2094Kb
[img]
Preview
PDF (Reporting Summary) - Supplemental Material
Creative Commons Attribution.

120Kb
[img] MS Excel (Supplementary Tables 1-6) - Supplemental Material
Creative Commons Attribution.

9Mb
[img] Image (JPEG) (Extended Data Fig. 1: CAP associations with annotated TSSs and IDEAS regions) - Supplemental Material
Creative Commons Attribution.

280Kb
[img] Image (JPEG) (Extended Data Fig. 2: CAP associations with varying CpG and GC content) - Supplemental Material
Creative Commons Attribution.

184Kb
[img] Image (JPEG) (Extended Data Fig. 3: Motif analysis) - Supplemental Material
Creative Commons Attribution.

120Kb
[img] Image (JPEG) (Extended Data Fig. 4: CAPs associated with FOX TFs and motifs) - Supplemental Material
Creative Commons Attribution.

208Kb
[img] Image (JPEG) (Extended Data Fig. 5: Read count correlations between CAPs) - Supplemental Material
Creative Commons Attribution.

366Kb
[img] Image (JPEG) (Extended Data Fig. 6: Motif and peak associations) - Supplemental Material
Creative Commons Attribution.

435Kb
[img] Image (JPEG) (Extended Data Fig. 7: Self-organizing maps) - Supplemental Material
Creative Commons Attribution.

511Kb
[img] Image (JPEG) (Extended Data Fig. 8: GATAD2A analyses) - Supplemental Material
Creative Commons Attribution.

253Kb
[img] Image (JPEG) (Extended Data Fig. 9: Extensive co-associations between CAPs) - Supplemental Material
Creative Commons Attribution.

175Kb
[img] Image (JPEG) (Extended Data Fig. 10: PIQ and SVM analyses in CAP co-associated regions) - Supplemental Material
Creative Commons Attribution.

189Kb
[img] Image (JPEG) (Extended Data Fig. 11: SVM and motif analyses in HOT sites) - Supplemental Material
Creative Commons Attribution.

234Kb

Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20181128-093527238

Abstract

Transcription factors are DNA-binding proteins that have key roles in gene regulation. Genome-wide occupancy maps of transcriptional regulators are important for understanding gene regulation and its effects on diverse biological processes. However, only a minority of the more than 1,600 transcription factors encoded in the human genome has been assayed. Here we present, as part of the ENCODE (Encyclopedia of DNA Elements) project, data and analyses from chromatin immunoprecipitation followed by high-throughput sequencing (ChIP–seq) experiments using the human HepG2 cell line for 208 chromatin-associated proteins (CAPs). These comprise 171 transcription factors and 37 transcriptional cofactors and chromatin regulator proteins, and represent nearly one-quarter of CAPs expressed in HepG2 cells. The binding profiles of these CAPs form major groups associated predominantly with promoters or enhancers, or with both. We confirm and expand the current catalogue of DNA sequence motifs for transcription factors, and describe motifs that correspond to other transcription factors that are co-enriched with the primary ChIP target. For example, FOX family motifs are enriched in ChIP–seq peaks of 37 other CAPs. We show that motif content and occupancy patterns can distinguish between promoters and enhancers. This catalogue reveals high-occupancy target regions at which many CAPs associate, although each contains motifs for only a minority of the numerous associated transcription factors. These analyses provide a more complete overview of the gene regulatory networks that define this cell type, and demonstrate the usefulness of the large-scale production efforts of the ENCODE Consortium.


Item Type:Article
Related URLs:
URLURL TypeDescription
https://doi.org/10.1038/s41586-020-2023-4DOIArticle
https://rdcu.be/b53tJPublisherFree ReadCube access
https://doi.org/10.1101/464800DOIDiscussion Paper
https://www.addgene.org/crispr/taggingRelated ItemCETCh–seq reagents
https://github.com/chhetribsurya/PartridgeChhetri_etalRelated ItemCode
ORCID:
AuthorORCID
Ramaker, Ryne C.0000-0001-8666-4841
Mortazavi, Ali0000-0002-4259-6362
Wold, Barbara J.0000-0003-3235-8130
Alternate Title:Occupancy patterns of 208 DNA-associated proteins in a single human cell type
Additional Information:© 2020 The Author(s). This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. Received 04 October 2017; Accepted 09 January 2020; Published 29 July 2020. Research reported in this publication was supported by the National Human Genome Research Institute of the National Institutes of Health under Award Number U54HG006998 to R.M.M. and E.M.M. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. This work was also supported by funds from The HudsonAlpha Institute for Biotechnology. We thank R. Nguyen, D. Moore, and M. McEown for their technical efforts in this study; B. S. Roberts and G. M. Cooper for comments; HudsonAlpha’s Genomic Services Laboratory led by S. Levy for the high-throughput sequencing of much of the data used in this paper; and members of the ENCODE Consortium for public deposition of data generated by other Consortium groups. Data availability: Data sets generated from this study are available at the ENCODE portal or at the Gene Expression Omnibus under accession number GSE104247. CETCh–seq reagents are available at https://www.addgene.org/crispr/tagging/. Code availability: All code is available at https://github.com/chhetribsurya/PartridgeChhetri_etal. Author Contributions: These authors contributed equally: E. Christopher Partridge, Surya B. Chhetri, E.C.P., M.M., K.M.N., L.A.B., S.K.M., C.L.M., C.J.C., E.C.D., and D.S. developed the CETCh–seq method and performed ChIP–seq and CETCh–seq experiments and accompanying validations; S.B.C. performed peak calling and mapped TF binding sites; S.B.C. and E.C.P. performed motif analyses, gene expression analyses, IDEAS segmentation analyses, and co-association analyses; J.W.P. and S.B.C. performed GATAD2A analyses and experiments; M.M. performed immunoprecipitation–mass spectrometry analyses and managed the production of ChIP–seq and CETCh–seq experiments; C.S.J., S.J., and A.M. performed SOM analyses; S.B.C. and S.-T.G. performed conservation and co-association analyses; S.B.C., R.C.R., and A.A.H. performed LS-GKM SVM, random forest, PCA, and TF footprint analyses; E.C.P., S.B.C., B.J.W., R.M.M., and E.M.M. conceived and designed the study; R.M.M. and E.M.M. directed the study; E.C.P., S.B.C., and E.M.M. wrote the manuscript with assistance from all authors; and all authors read and approved the manuscript. The authors declare no competing interests.
Funders:
Funding AgencyGrant Number
NIHU54HG006998
HudsonAlpha Institute for BiotechnologyUNSPECIFIED
Subject Keywords:DNA; Epigenomics; Gene regulation; Transcriptomics
Issue or Number:7818
Record Number:CaltechAUTHORS:20181128-093527238
Persistent URL:https://resolver.caltech.edu/CaltechAUTHORS:20181128-093527238
Official Citation:Partridge, E.C., Chhetri, S.B., Prokop, J.W. et al. Occupancy maps of 208 chromatin-associated proteins in one human cell type. Nature 583, 720–728 (2020). https://doi.org/10.1038/s41586-020-2023-4
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:91283
Collection:CaltechAUTHORS
Deposited By: George Porter
Deposited On:28 Nov 2018 19:18
Last Modified:03 Aug 2020 20:22

Repository Staff Only: item control page