TY - JOUR
T1 - Bimodal specificity of TF–DNA recognition in embryonic stem cells
AU - Povolotskii, Michael
AU - Yehezkehely, Maor
AU - Ram, Oren
AU - Lukatsky, David B.
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/5/8
Y1 - 2025/5/8
N2 - Transcription factors (TFs) bind genomic DNA regulating gene expression and developmental programs in embryonic stem cells (ESCs). Even though comprehensive genome-wide molecular maps for TF–DNA binding are experimentally available for key pluripotency-associated TFs, the understanding of molecular design principles responsible for TF–DNA recognition remains incomplete. Here, we show that binding preferences of key pluripotency TFs, such as Pou5f1 (Oct4), Smad1, Otx2, Srf, and Nanog, exhibit bimodality in the local GC-content distribution. Sequence-dependent binding specificity of these TFs is distributed across three major contributions. First, local GC-content is dominant in high-GC-content regions. Second, recognition of specific k-mers is predominant in low-GC-content regions. Third, short tandem repeats (STRs) are highly predictive in both low- and high-GC-content regions. In sharp contrast, the binding preferences of c-Myc are exclusively dominated by local GC-content and STRs in high-GC-content genomic regions. We demonstrate that the transition in the TF–DNA binding landscape upon ESC differentiation is regulated by the concentration of c-Myc, which forms a bivalent c-Myc-Max heterotetramer upon promoter binding, competing with key pluripotency factors such as Smad1. Finally, a direct interaction between c-Myc and key pluripotency factors is not required to achieve this transition.
AB - Transcription factors (TFs) bind genomic DNA regulating gene expression and developmental programs in embryonic stem cells (ESCs). Even though comprehensive genome-wide molecular maps for TF–DNA binding are experimentally available for key pluripotency-associated TFs, the understanding of molecular design principles responsible for TF–DNA recognition remains incomplete. Here, we show that binding preferences of key pluripotency TFs, such as Pou5f1 (Oct4), Smad1, Otx2, Srf, and Nanog, exhibit bimodality in the local GC-content distribution. Sequence-dependent binding specificity of these TFs is distributed across three major contributions. First, local GC-content is dominant in high-GC-content regions. Second, recognition of specific k-mers is predominant in low-GC-content regions. Third, short tandem repeats (STRs) are highly predictive in both low- and high-GC-content regions. In sharp contrast, the binding preferences of c-Myc are exclusively dominated by local GC-content and STRs in high-GC-content genomic regions. We demonstrate that the transition in the TF–DNA binding landscape upon ESC differentiation is regulated by the concentration of c-Myc, which forms a bivalent c-Myc-Max heterotetramer upon promoter binding, competing with key pluripotency factors such as Smad1. Finally, a direct interaction between c-Myc and key pluripotency factors is not required to achieve this transition.
UR - http://www.scopus.com/inward/record.url?scp=105004007150&partnerID=8YFLogxK
U2 - 10.1093/nar/gkaf333
DO - 10.1093/nar/gkaf333
M3 - Article
C2 - 40287827
AN - SCOPUS:105004007150
SN - 0305-1048
VL - 53
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 8
M1 - gkaf333
ER -