Inferring transcription factor complexes from ChIP-seq

0 downloads 0 Views 1MB Size Report
Supplementary Figure 1 Calculation of motif displacements. Supplementary Figure 2 Parameters to binomial distribution. Supplementary Figure 3 Motivation for ...
Inferring transcription factor complexes from ChIP-seq data Tom Whitington, Martin C. Frith, James Johnson and Timothy L. Bailey

Supplementary figures and tables: Supplementary Figure 1

Calculation of motif displacements

Supplementary Figure 2

Parameters to binomial distribution

Supplementary Figure 3

Motivation for motif trimming

Supplementary Figure 4

Distribution of inter-motif distances

Supplementary Figure 5

Distribution of significant results count

Supplementary Figure 6

Distribution of number of significant intervals

Supplementary Table 1

Input datasets and motifs

Supplementary Table 2

Types of evidence

Supplementary Table 3

Table of all results

Supplementary Table 4

Motif database

1

Supplementary Figure 1. Calculation of motif distances. Schematic illustrating how motif displacement is calculated. Two hypothetical example sequences are shown, with primary motif hits highlighted in blue and secondary motif hits highlighted in green. If the secondary motif occurrence is located at the 3’ side of the primary motif occurrence, then the displacement is positive. In this case (as in the first example), the displacement is defined as d = g + 1, where g = the gap size between the motifs (in basepairs). If the secondary motif occurrence is at the 5’ side of the primary motif occurrence, then the displacement is negative. In this case (as in the second example), the displacement is defined as d = −1 × (g + 1).

!"#$%"&' ()*+'

,-.)/0%"&' ()*+'

CCGGAAGT

A

G

T

G

A

AC

TGA

C

G C

T G

1 2 3 4 5

1 2 3 4 5 6 7 8 9

A

ACCGGAAGTCTGTAATGAC! !"#$%&'()(*+,-,./,

,-.)/0%"&' ()*+'

TGA

C T

CCGGAAGT

A

G

G

1 2 3 4 5

G

A

T

AC

1 2 3 4 5 6 7 8 9

A

G C

!"#$%"&' ()*+'

ATGACCTGACCGGAAGT! !"#$%&'()(*+,-,01,

2

Supplementary Figure 2. Parameters to binomial distribution. Schematic explaining parameter q in the binomial distribution of number of sequences exhibiting displacement in a specified interval of size x. The maximum magnitude of the distance between the primary and secondary motifs is given by r. The schematic illustrates why p r = m−w − ws . Since the primary and secondary motif occurrences can occur on the same 2 strand or opposite strands, the fraction of sequences expected to exhibit displacements in a x given interval of width x at random is given by q = 4×r .

!"#

!$# &#

TGA

C T

1 2 3 4 5

G

CCGGAAGT

A

G

T

G

A

AC

1 2 3 4 5 6 7 8 9

A

G C

%#

3

'#

Supplementary Figure 3. Motivation for motif trimming. Schematic illustrating the motivation for trimming low information flanking columns from motifs. Some motifs, such as the two original untrimmed motifs shown in this example, consist of a set of columns with moderate or high information content (the “motif core”), with low information content columns occurring at either or both sides of the motif core. When the original, untrimmed motifs are considered, spacing enrichments cannot be detected involving short distances between the motif cores, as we do not allow motif overlap in our analysis. When low information flanking columns are trimmed from the motifs, spacing enrichments involving small inter-motif distances can be detected.

!"#$#%&'()*+,(-( A

T G

A

A

A G

1 2 3 4 5 6 7 8 9 10

T

A

CT T

A

A

T G

A

A

A G

GTA

1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8

C

GTA

1 2 3 4 5 6 7 8

CT

C

!"#$#%&'()*+,(.(

!"#$%&'()*+,-++&)./01)$/2+()3&4+,+$,#*5+) /"#0012()*+,(.(

/"#0012()*+,(-(

A

GTA

T G

A

1 2 3 4 5 6

T

A

1 2 3

CT

C

G

GTA

1 2 3 4 5 6

1 2 3

CT

C

T

A

A

T A

!"#$%&'()*+,-++&)./01)$/2+()#2+)&/-)4+,+$,#*5+)

4

Supplementary Figure 4. Distribution of inter-motif distances. We obtained the single spacing interval with the lowest p-value, from each of the 87 nonredundant spacing patterns that pass a p-value threshold of 0.01 (shown in Supplementary Table 3). We recorded the absolute value of the displacement value of each of those intervals, and made a histogram of these values. The vast majority of significant spacing enrichments have a distance of 1bp, with no gap between the primary and secondary motifs.

20 10 0

Count

30

40

Motif separataion distribution for all significant spacing results (N=87)

0

5

10 Inter−motif distance of strongest interval

5

15

20

Supplmenentary Figure 5. Distribution of significant results counts. Histogram of the number of non-redundant significant secondary motif results, for all 41 analyses.

0

5

Count

10

15

Distribution of number of significant results over all input ChIP−seq datasets (N=41)

0

2

4

6 Number of non−redundant secondary motif results at p < 0.01

6

8

10

12

Supplementary Figure 6. Distribution of number of significant intervals. Histogram of the number of significant intervals considering all 87 results significant at a p-value threshold of 0.01. For each significant secondary motif result, the maximum number of significant intervals is reported, considering same strand and opposite strand displacements.

30 20 10 0

Count

40

50

60

Distribution of number of significant intervals over all significant results (N=87)

0

2

4

6 Number of significant intervals

7

8

10

12

Supplementary Table 1. Input ChIP-seq datasets and primary motifs. Summary of input datasets, sorted first by Author, then by Assembly to resolve ties, then by Tissue, then by Factor. “TF” represents the factor targeted in the ChIP-seq experiment. “Assembly” represents the UCSC genome assembly for the ChIP-seq peak region sequences. “Tissue” represents the tissue in which the ChIP-seq was carried out. “Primary motif” represents the short name of the primary motif employed in the SpaMo analysis (full motif name and sequence logo is shown in Supplementary Table 4). “Number of ChIP-seq peaks” represents the number of peak regions declared by the authors, and used as input to SpaMo.

8

TF

Assembly

Tissue

Author

Primary motif

E2f1

mm8

ESC

[1]

J E2F1

20699

Esrrb

mm8

ESC

[1]

C Esrrb

21647

Klf4

mm8

ESC

[1]

U Klf7 i

10875

Nanog

mm8

ESC

[1]

C Nanog Loh2

10343

Oct4

mm8

ESC

[1]

C Oct4

3761

STAT3

mm8

ESC

[1]

C Stat3

2546

Smad1

mm8

ESC

[1]

U Smad3 i

1126

Sox2

mm8

ESC

[1]

C Oct4

4526

Tcfcp2l1

mm8

ESC

[1]

C Tcfcp2l1

26910

Zfx

mm8

ESC

[1]

C Zfx

10338

cMyc

mm8

ESC

[1]

J Mycn

3422

nMyc

mm8

ESC

[1]

J Mycn

7182

Gata1

mm9

G1EER4

[2]

U Gata6 i

14711

Jund

hg18

Gm12878

[3]

U Jundm2 ii

12958

Max

hg18

Gm12878

[3]

U Max i

6546

cFos

hg18

Gm12878

[3]

C NFYA

3284

cFos

hg18

Gm12878

[3]

U Jundm2 ii

3284

Tcf4

hg18

Hct116

[3]

U Tcf3 i

34522

Srebp1a

hg18

Hepg2

[3]

C Srebp

5461

Srebp2

hg18

Hepg2

[3]

C Srebp

3761

Jund

hg18

K562

[3]

U Jundm2 ii

1500

Max

hg18

K562

[3]

U Max i

10480

Nfe2

hg18

K562

[3]

C Nfe2

5071

cFos

hg18

K562

[3]

C NFYA

18963

cFos

hg18

K562

[3]

C Jundm2 ii

18963

cJun

hg18

K562

[3]

C Jundm2 ii

26920

cMyc

hg18

K562

[3]

J Mycn

15749

Gata1

hg18

K562b

[3]

U Gata6 i

5496

Znf263

hg18

K562b

[3]

C Znf263

27549

NRSF canonical

mm8

ESC

[4]

J REST

1698

NRSF canonical

mm8

NSC

[4]

J REST

692

STAT1

hg18

HeLaStim

[5]

C Stat3

41582

STAT1

hg18

HeLaUnstim

[5]

C Stat3

11004

FoxA2

mm8

Liver

[5]

U Foxa2 i

11475

STAT1

hg18

HeLaStim

[6]

C Stat3

36998

Srebp1

mm9

Liver

[7]

C Srebp

426

GABP

hg18

Jurkat

[8]

U Gabpa i

6442

NRSFmono

hg18

Jurkat

[8]

J REST

2596

NRSFpoly

hg18

Jurkat

[8]

J REST

2960

SRF

hg18

Jurkat

[8]

U Srf i

2429

Scl

mm8

HPC7

[9]

U Ascl2 i

199

9

Number of ChIP-seq peaks

Supplementary Table 2. Types of evidence. The class of evidence supporting the existence of a TF complex is shown, for each reference given in Tables 1, 2 or 3 of Results. “Reference” indicates the reference for the article as shown in the supplementary bibliography. “Type of evidence” indicates the experimental evidence supporting the interaction. “EMSA” indicates electrophoretic mobility shift assay. *: This paper supports an interaction between E2F family members and YY1. The authors do not find evidence for a direct E2F1/YY1 interaction. Therefore, the YY1 spacing enrichment observed in the E2F1 data is potentially due to an interaction between YY1 and other E2F TFs binding at the same locations as E2F1. Reference

PubMed ID

Type of evidence

[10]

14559893

X-ray crystallography

[11]

11846562

X-ray crystallography

[12]

9491997

EMSA

[13]

16272134

EMSA

[14]

9214632

EMSA

[15]

18026136

EMSA

[16]

15525604

EMSA with ortholog

[17]

14757430

Review paper

[18]

16007074

Luciferase assay

[19]

12411495

Y2H *

Immunoprecipitation,

10

Supplementary Table 3. Table of all results. All secondary motif results significant at a p-value threshold of 0.01 are reported. Results are grouped according to the input ChIP-seq dataset. In the first column, the assembly, TF, tissue and reference for the input ChIP-seq dataset is given. For “Primary motif” and “Secondary motif” columns, the seqlogos and summary names are provided. Same strand and opposite strand displacement histograms are shown. The x-axis of each histogram shows the motif displacement value. The y-axis shows the number of sequences that exhibited the given secondary-primary motif displacement value, and is scaled linearly with the origin corresponding to zero. The “Sig. Interval” specifies the displacement value and strand for the single most significant interval, with “Opp.” indicating opposite strand. The p-value of that interval is given. Secondary motifs that are highly similar to the primary motif (considering reverse complement) are highlighted in bold font. Factor

Primary PWM

Secondary PWM

   

Same Disps

Opp Disps

34

hg18 Gata1

GATAA

K562b [3]

U Gata6 i

A T

T

C

G

A C

T

C G

A T

CA TG C Ebox

150

34

0

150

28

hg18 Gata1

GATAA

K562b [3]

U Gata6 i

A T

T

C

G

A C

T

C G

A T

CTGG

T

G A

A TT

C

CAG

C A

T

T

T

G

A

T

G

C

150

J Hand1::Tcfe2a

hg18 Gata1

GATAA

K562b [3]

U Gata6 i

A T

T

G

A C

T

C G

A T

A

CA G

CC

CC

ACTGACGGG

G

AC T A

CG T

A

GT CG

A AT AGGA

C T

T

A

T

TG

150

U Myf6 ii

0

150

hg18 Gata1

GATAA

K562b [3]

U Gata6 i

A T

T

G

A C

T

C CAGCAGG

A

C

T

G

G

A T

A

TA

C

T

G

T

TG T

T

U Zic2 ii

150

0

150

hg18 Gata1

GATAA

K562b [3]

U Gata6 i

A T

T

G

A C

T

C G

A T

T C GGAA T GCT

C

G T A

A

A

T

G

T

G A

A

T G AGC C

C

C Stat3

150

0

150

T

AA T

G

hg18 Nfe2 K562 [3]

C TA

TG T AG C

AC

C

G

T CA

A

C

TC

C Nfe2

T

A

A

T

C

G

A

TGA TCA

G

TG

GA CT

C

C A

G

C

TC

A

U Jundm2 ii

11

C T

G

150

-9, Same 2.76e-16

150

0

150

-5, Same 7.02e-13

150

0

150

-7, Opp. 1.78e-04

150

0

150

-6, Opp. 3.59e-03

16

0

150

13

C

150

16

16

C

0

19

16

C

150

28

19

C

Sig. Interval

150

0

150

-2, Same 7.22e-03

13

0

150

150

0

150

+2, Opp. 1.08e-05

        8

GC T TG GA TGTAGT C

C

G

hg18 Srebp1a

T

A

A

T

A CG C T TA

CG

T

G

Hepg2 [3]

G

A

G

A

T

A AC

A

G

T

T

C

C

A

C

G

A

T

C

CGCGCG

CG T

C

C

T AAG

G

C Srebp

CGG

GA

G C

A

T

T CG

A

CT

C

A T GG A A

T G

C

A

150

U Rsc30

8

0

150

28

T

T

T

hg18 Tcf4 Hct116 [3]

T AAA

AC G

G AC

G

G

AAT

T

T

A

AGG

TCAC C

G

T

T

U Tcf3 i

TGA TCA G

A

C

G

TG

GA CT

C

C A

TC

A

C T

G

150

U Jundm2 ii

G T

G

AC

G A

AC

T

GT

hg18 cFos Gm12878 [3]

AT G C

C CCTCG

C

A

C

A

T

T

G

T CC A

CCAAT

AG C G

AG

A

G

C

T

C NFYA

A

T

G

C A

A

CT

150

J NFYA

0

150

G T

G

AC

G A

AC

T

GT

hg18 cFos Gm12878 [3]

A TG C

C CCTCG

C

A

C

A

T

T

G

T CC A

GG

C

TA

G T

C

C NFYA

CTGTCA

AC

A

GG C

AT

CA

C

A

A

A

TC

G

G G

150

U Pknox2

0

150

G T

G

AC

G A

AC

T

GT

hg18 cFos Gm12878 [3]

AT G C

C CCTCG

C

A

C

A

T

T

G

T CC A

GG G

A C TG C

A G

A TT

C NFYA

T

T

C

GA

A T

C

T

G

GT

C

G G C

TT

C C

A

C

G

C

150

U Sp4 ii

0

150

G T

G

AC

G A

AC

hg18 cFos Gm12878 [3]

AT G C

C CCTCG T

GT

C

A

C

A

T

T

G

T CC A

C NFYA

T

A G

A

A

C

T

T

GT A

TG C

C C

C

A

CACGTG

G C

T

TC

AC T

C

A

A

G T TG A

150

U Cbf1 b

0

150

A

TGA TCA G

G

TG

GA

C

AC

hg18 cFos K562 [3]

C

TC

C

A

T

G

U Jundm2 ii

A TG

GGAA

A

CC

A

A G T

T

AT

T

J SPIB

150

0

150

A

TGA TCA

G

TG

GA

C

AC

hg18 cFos K562 [3]

G

C

TC

C

A

U Jundm2 ii

T

G

A

TGA TCA

G

TG

GA CT

C

C A

G

C

TC

A

U Jundm2 ii

12

C T

G

150

0

150

-1, Same 3.12e-04

150

0

150

-17, Opp. 2.87e-23

150

0

150

+2, Same 4.80e-20

150

0

150

-4, Opp. 3.11e-06

150

0

150

+6, Opp. 2.02e-05

41

0

150

21

CT

150

10

41

CT

3.58e-08

18

10

T CC G T T GG

-1, Opp.

21

18

T CC G T T GG

150

34

21

T CC G T T GG

0

28

34

T CC G T T GG

150

150

0

150

1, Opp. 9.24e-14

21

0

150

150

0

150

-1, Same. 6.6e-06

  ! "$# % &(' ) *+

21

A

TGA TCA G

G

TG

GA CT

C

AC

hg18 cFos K562 [3]

C

TC

C

A

T

G

U Jundm2 ii

TGA

A

G

C T

T A

C

G

150

J NFE2L1::MafG

21

0

150

25

G T

G

AC

G A

AC

T

GT

hg18 cFos K562 [3]

AT G C

C CCTCG

T CC G T T GG

C

A

C

A

T

T

G

T CC A

T

A

GT

A

A

C

T

G

T

C NFYA

A

TG C

C

C

C

A

CACGTG

G C

T

TC

AC T

C

A

A

G T TG A

150

U Cbf1 b

G T

G

AC

G A

AC

T

GT

hg18 cFos K562 [3]

AT G C

C CCTCG

C

A

C

A

T

T

G

T CC A

CCAAT

AG C G

AG

A

G

C

T

C NFYA

A

T

G

C A

A

C

T

150

J NFYA

0

150

G T

G

AC

G

A

AC

T

GT

hg18 cFos K562 [3]

AT G C

C CCTCG

C

A

C

A

T

T

G

T CC A

CTGTCA

AC

A

GG

C

G T

GG TA

C

C

C NFYA

AT

CA

C

A

A

A

TC

G

G G

150

U Pknox2

0

150

G T

G

AC

G A

AC

T

GT

hg18 cFos K562 [3]

AT G C

C CCTCG

C

A

C

A

T

T

G

T CC A

GG G

A C TG C

A G

A TT

G G C

C

C NFYA

GA

A

T

T

C

T

G

GT

CT

TT

C C

A

C

G

C

150

U Sp4 ii

0

150

G T

G

AC

G A

AC

T

GT

hg18 cFos K562 [3]

AT G C ATGC AAT T

C CCTCG

C

A

C

A

T

T A

G

T CC A

C NFYA

C Octamer b

150

0

150

G T

G

AC

G A

AC

hg18 cFos K562 [3]

AT G C

C CCTCG T

GT

C

A

C

A

T

T

G

T CC A

C NFYA

A

TGACGTCA

GG T

C

C

C

A

T

T

TA

A

A

T

CC A T

G

G G

150

U Jundm2 i

0

150

A

TGA TCA

G

TG

GA

C

AC

hg18 cJun K562 [3]

G

C

TC

C

A

U Jundm2 ii

T

G

A TG

GGAA

A

CC

A

A G T

T

AT

J SPIB

13

T

150

0

150

+6, Opp. 8.62e-21

150

0

150

-17, Opp. 6.71e-20

150

0

150

+2, Same 6.55e-18

150

0

150

-4, Opp. 1.59e-06

150

0

150

-4, Opp. 3.69e-05

10

0

150

53

CT

150

7

10

T CC G T T GG

6.6e-03

24

7

T CC G T T GG

-5, Same.

24

24

T CC G T T GG

150

36

24

T CC G T T GG

0

25

36

T CC G T T GG

150

150

0

150

-9, Same 5.88e-04

53

0

150

150

0

150

+1, Opp. 3.49e-16

,./ 021 3 465 7 8:9 ;

27

A

TGA TCA G

G

TG

GA CT

C

AC

hg18 cJun K562 [3]

A

C

C

TC

A

T

U Jundm2 ii

TGA TCA G

C

G

TG

GA CT

G

C

C A

TC

A

C T

G

U Jundm2 ii

150

27

0

150

17

A

TGA TCA G

G

TG

GA CT

C

AC

hg18 cJun K562 [3]

C

C

TC

A

T

G

U Jundm2 ii

CACGTGA

T

C

G

A

CG

T

TA

G

A

C

T

T

150

U Bhlhb2 i

A

TGA TCA G

G

TG

GA

C

AC

hg18 cJun K562 [3]

C

C

TC

A

T

G

TCA

CG

G

AATGT CG

T

T

T

G C

A

T

A

C

C

150

U Jundm2 ii

J Pax2

0

150

A

TGA TCA G

G

TG

GA

C

AC

hg18 Jund Gm12878 [3]

C

C

TC

A

T

G

CGAAAC

A

TT T

C

C

G

A

C

U Jundm2 ii

A

T

T

G A

A

150

U Irf4 i

0

150

A

TGA TCA

G

TG

GA

C

AC

hg18 Jund Gm12878 [3]

G

C

C

TC

A

T

G

GGAA

A TG

A

CC

A

A G T

U Jundm2 ii

T

AT

T

J SPIB i

150

0

150

C A

TTAC

G

C

hg18 cMyc K562 [3]

J Mycn

J bZIP910

150

0

150

CCGGAAGT

A

G

TG

hg18 GABP Jurkat [8]

A

AC

T

A

CGCA

GA

CT

AG

U Gabpa i

T

AA T

GC T

C

T G

150

U Fhl1

0

150

CCGGAAGT

A

G

TG

hg18 GABP Jurkat [8]

A

AC

T

U Gabpa i

A

CGCCCCC

C A

C A T

T

A T

T

ATC

U Sp4 i

14

A

G

150

0

150

-1, Opp. 3.68e-05

150

0

150

-5, Opp. 2.78e-03

150

0

150

+1, Same. 2.02e-16

150

0

150

+1, Same. 3.76e-08

150

0

150

-1, Same 6.30e-05

39

0

150

47

A C

150

12

39

A C

5.78e-08

27

12

CACGTG TGACGT

-1, Same.

28

27

CT

150

29

28

CT

0

17

29

CT

150

150

0

150

+1, Same 7.95e-28

47

0

150

150

0

150

+1, Same 1.43e-21

= ? @BA C DFE G HJI K

18

CCGGAAGT

A

G

AC

T

TG

A C

hg18 GABP Jurkat [8]

A

A

TGACGT

C A

G

U Gabpa i

J bZIP910

150

18

0

150

33

CCGGAAGT

A

G

AC

T

TG

A C

hg18 GABP Jurkat [8]

A

A

U Gabpa i

GGAA

C

CA G A T

G A T

C

150

J Eip74EF

CCGGAAGT

A

G

AC

T

TG

hg18 GABP Jurkat [8]

A

A

CG

TC

G

A

CGGA

G

GT

C

TC

CG

T

A

T A CC

G

A

C

G

150

U Gabpa i

U Hal9

0

150

CCGGAAGT

A

G

AC

T

TG

hg18 GABP Jurkat [8]

A

A

GG TTCC T

G

CT TG A TCA

T

A

C

U Gabpa i

AAA T A

G

C

J REL

150

0

150

A T

TA AA T A T A

TT

G

C

G

G C

C

T

A

C

A

C T

T

U Srf i

G T

A

C G T

A

hg18 SRF Jurkat [8]

TCC

A

T C

G

G

150

J ETS1

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [5]

A

T

G

T

G A

A

T

G AGC C

C

C Stat3

CCAT

G A

T

T

J YY1

150

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [5]

A

T

G

T

G A

A

G AGC T

C

C

C Stat3

CACGTG

G A

TA

G C

T

J USF1

150

0

150

T C GGAA

G T A

hg18 STAT1 HeLaStim [5]

T GCT

A

A

T

G

T

G A

A

C Stat3

G AGC

TGA

A

T

G

C

C

C

T A

C T

G

J NFE2L1::MafG

15

150

0

150

+14, Same 2.54e-09

150

0

150

-1, Opp. 9.60e-04

150

0

150

+1, Opp. 1.16e-03

150

0

150

-7, Opp. 3.99e-04

150

0

150

+1, Opp. 9.70e-10

36

0

150

47

C

150

68

36

C

6.71e-15

12

68

C

+2, Opp.

19

12

CC TATA GG

150

24

19

A C

0

33

24

A C

150

150

0

150

-1, Same 2.85e-09

47

0

150

150

0

150

+1, Same 5.47e-09

LNM O PRQ S TVU W XZY [

26

T C GGAA

C

G T A

hg18 STAT1

T GCT

A

HeLaStim [5]

A

T

G

T

G A

A

G AGC T

C

C

A

A

G

CA A T T G

T C G

A

C Stat3

GATCTAC

G

C

C

G

T T

T

A

A

A

AG

C

T G

T

C

G

150

U Gat3

26

0

150

46

T C GGAA

C

G T A

hg18 STAT1

T GCT

A

HeLaStim [5]

A

T

G

T

G A

A

T

G AGC C

GAA

G T

ACT

G

TG

A

C Stat3

ATCA T

G

TG

G

T

C

C

T

C

C

G

A

A

U Hdx

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [5]

A

T

G

T

G A

A

T

G AGC C

C

C

ACGT A

T

CT AGT

AA

C Stat3

T G

J TGA1A

150

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [5]

A

T

G

T

G A

A

G AGC T

C

C

C Stat3

CC GG

G

C

TGA

C

T

CCG G

CA

T

A

G

GT

AC T

G

150

U Tcfap2e i

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [5]

A

T

G

T

G A

A

G AGC T

C

C

CCCCCGGGGGG

G

AA AGGTC T T T TC T A A A AT TT G

GT

A

AC A

C

T AC

CA

150

C Stat3

U Zic3 i

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [5]

A

T

G

T

G A

A

T

G AGC C

C

C Stat3

A

TGA TCA G

C

G

TG

GA CT

C

C A

TC

A

C T

G

150

U Jundm2 ii

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [5]

A

T

G

T

G A

A

T

G AGC C

C

CC CGCCC C

A

G

AA T

C

T

TG

C Stat3

A

C

A

T

TA

T

150

U Klf7 i

0

150

T C GGAA

G T A

hg18 STAT1 HeLaStim [5]

T GCT

A

A

T

G

T

G A

A

C Stat3

G AGC T

C

C

C AG GGC

CA

C

GT G

G T AA AG T

C

T

A

C

AT

GG AT

T

A

AC

T GGC

CT AT AG

C CTCF

16

T

C

TG A

150

0

150

+7, Opp. 4.20e-07

150

0

150

-1, Same 6.91e-07

150

0

150

-1, Opp. 9.60e-06

150

0

150

-1, Opp. 1.05e-05

150

0

150

-2, Same 2.03e-05

47

0

150

43

C

150

36

47

C

1.83e-08

44

36

C

+2, Opp.

48

44

C

150

38

48

C

0

46

38

C

150

150

0

150

+1, Opp. 2.32e-05

43

0

150

150

0

150

-3, Opp. 8.20e-05

\^] _ `ba c dfe g hji k

98

T C GGAA

C

G T A

hg18 STAT1

T GCT

A

HeLaStim [6]

A

T

G

T

G A

A

T

G AGC C

CCAT

G A

T

T

C

C Stat3

J YY1

150

98

0

150

35

T C GGAA

C

G T A

hg18 STAT1

T GCT

A

HeLaStim [6]

A

T

G

T

G A

A

G AGC T

C

C

C Stat3

CACGTGA

T

C

G

A

CG

T

TA

G

A

C

T

T

150

U Bhlhb2 i

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [6]

A

T

G

T

G A

A

G AGC

TGA

A

T

G

C

C

C

C Stat3

C T

T A

G

150

J NFE2L1::MafG

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [6]

A

T

G

T

G A

A

T

G AGC C

GAA

G T

ACT

G

TG

A

C Stat3

ATCA T

G

TG

T

G

T

C

C

C

C

G

A

A

U Hdx

150

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [6]

A

T

G

T

G A

A

T C GGAA

T

C

C

T A

G AGC

C

T GCT

G

C Stat3

A

A

T

T

G AGC

G

T

G A

A

C

C

C Stat3

150

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [6]

A

T

G

T

G A

A

G AGC T

C

C

A

TGACGTCA

G

CC

C Stat3

T

T

TA

A

A

T

G

CCA A

GG

T

T

C

150

U Atf1 i

0

150

T C GGAA

G T A

hg18 STAT1

T GCT

A

HeLaStim [6]

A

T

G

T

G A

A

T G AGC C

C

C Stat3

A

TGA TCA G

C

G

TG

GA CT

C

C A

TC

A

C T

G

150

U Jundm2 ii

0

150

T C GGAA

G T A

hg18 STAT1 HeLaStim [6]

T GCT

A

A

T

G

T

G A

A

C Stat3

T

G AGC C

C

T AA T T

G T

A

A

GGAA

C A

C

G CC

C

C

A

TC AA TC C

C

G

GT T

A

G T

C G

150

U Sfpi1 ii

17

0

150

-1, Opp. 2.35e-16

150

0

150

+1, Same 1.03e-14

150

0

150

+7, Opp. 7.89e-12

150

0

150

+10, Same 9.44e-08

150

0

150

+1, Same 1.23e-07

31

0

150

35

C

150

21

31

C

1.52e-29

52

21

C

+1, Opp.

53

52

C

150

54

53

C

0

35

54

C

150

150

0

150

+2, Opp. 8.51e-05

35

0

150

150

0

150

-8, Opp. 1.55e-04

lnm o prq s tvu w xzy {

32

T C GGAA T GCT

C

G T A

hg18 STAT1

A

A

HeLaStim [6]

T

G AGC T

G

T

G A

A

C

C

C

ACGT A

T

CT AGT

AA

C Stat3

T G

J TGA1A

150

32

0

150

33

T C GGAA T GCT

C

G T A

hg18 STAT1

A

A

HeLaStim [6]

T

T

G AGC

G

T

G A

A

C

C

TC AGGAA

TT

G

C

TCA AC A AG G

C Stat3

T

T T TC T T T AA A C

C

G

G

C

T

C

150

U Bcl6b i

T C GGAA T GCT

G T A

hg18 STAT1

A

A

HeLaStim [6]

T

T

G AGC

G

T

G A

A

C

C

ATTCA

C

T

AG TTCAG C

C

C

G G

G

G

CT A

G A

TA

150

C Stat3

U Sox17 ii

0

150

T C GGAA T GCT

G T A

hg18 STAT1

A

A

HeLaUnstim [5]

T

T

G AGC

G

T

G A

A

C

C

GAT GAAT A

T GG

C

T CT

G

T

T

G

G

G

A C

A

A

150

C Stat3

J En1

0

150

T C GGAA T GCT

G T A

hg18 STAT1

A

A

HeLaUnstim [5]

T

G AGC T

G

T

G A

A

C

C

A

A

G

CA A T T G

T G C

A

GATCTAC

G

C

C

G

T T

A

T

A

C Stat3

U Gat3

Logo

Logo

A

C

AG

T G

T

C

G

150

0

150

TTT GCGC CCAT TTT GCGC GCGCGC G

ESC [1]

CC

G

Position

ESC [1]

A T

Position

CEQLOGO 03.12.09 11:44

J E2F1

J YY1

Logo

Logo

G

mm8 E2f1

T

G

CEQLOGO 03.12.09 11:44

CC

G

Position

G A

C

ATATATG AT

C

G TG CA T C AT A T

J E2F1

A G T T G A

T A

G

C

C G

C

Position

CEQLOGO 03.12.09 11:44

CEQLOGO 03.12.09 11:44

U Zfp161 i

150

0

150

0

150

AGGTCA

A

G

G A

mm8 Esrrb ESC [1]

GGG

C Esrrb

T

AGGTCA

A

C

G

G A

GGG T

C Esrrb

18

150

0

150

+11, Opp. 1.75e-04

150

0

150

+1, Opp. 6.45e-03

150

0

150

+3, Same 2.38e-03

150

0

150

+2, Opp. 3.76e-03

150

0

150

-1, Same 9.39e-04

24

0

150

101

C

150

27

24

150

1.63e-04

10

27

mm8 E2f1

-1, Same

18

10

C

150

31

18

C

0

33

31

C

150

150

0

150

+1, Opp. 5.68e-03

101

0

150

150

0

150

-4, Same 4.23e-56

|~}  €‚ ƒ „†… ‡ ˆŠ‰ ‹

47

AGGTCA

A

C

G

G A

mm8 Esrrb ESC [1]

GGG T

C Esrrb

CC G

T

GC

G

A

GT T

T AA

T

C

G

AT

TA

G

TC

G

T

G

C

C

A

A

C

150

U Tcfap2a ii

47

0

150

45

AGGTCA

A

C

G

G A

mm8 Esrrb ESC [1]

GGG T

CCCC

CC

G

T

T

T T ATAGG G T A TAG A AG

A

A

G

G

T

C

150

C Esrrb

U Bcl6b ii

CC CGCCC C

A

G

AA T

T

TG

mm8 Klf4 ESC [1]

A

C

A

T

TA

T

U Klf7 i

CCCCC

C

CA

C

A

G AG TT

A

A

A

T

AC T

T

150

U Zfp740 i

0

150

CC CGCCC C

A

G

AA T

T

TG

mm8 Klf4 ESC [1]

A

T

TA

T

CC GG

G

C

A

U Klf7 i

A T

T

A

G

C

TGA

C

C T

C

G

TCCGA GT C

A

GT

AC T

G

150

U Tcfap2c i

0

150

CC CGCCC C

A

G

AA T

T

TG

mm8 Klf4 ESC [1]

A

C

A

T

TA

T

U Klf7 i

GGGGA AT

G

T

J MZF1 1-4

150

0

150

CC CGCCC C

A

G

AA T

T

TG

mm8 Klf4 ESC [1]

A

C

A

T

TA

T

CCC GGGG

CC

CA A A T

TT

G

GG

T TC AGGT CT T A T CA AA T

G

GT

G G

A

AT

C

A

C

A

C

C

A

150

U Klf7 i

U Zic2 i

0

150

CC CGCCC C

A

G

AA T

T

TG

mm8 Klf4 ESC [1]

A

C

A

T

TA

T

A

T G

T

T

C

U Klf7 i

C

G

G

GGAA AA GA

AA AA AG AGGG

GG

CC TC TC

G T

C

T

C

T

C Klf4 a

150

0

150

mm8 NRSFcanon

C T GT CC G T C T G

ESC JohnsonRory

J REST

GG T TC AC

A

A

CG T

CA

G

T

AA

G

T

T

C

G

AC

G A

T CG C TC A

G A

T

C

A

GCGCGC

G

TAC

CG TA

A

C

G

T

T GC

A

C A

G

U E2F2 i

19

T

G

150

0

150

-7, Opp. 2.11e-03

150

0

150

+1, Same 4.35e-07

150

0

150

+1, Opp. 3.62e-05

150

0

150

+1, Opp. 7.28e-04

150

0

150

+1, Same 1.05e-03

34

0

150

8

CT

150

29

34

C

1.02e-04

35

29

C

-1, Opp.

32

35

C

150

36

32

C

0

45

36

C

150

150

0

150

+3, Opp. 9.91e-03

8

0

150

150

0

150

+1, Same 7.89e-03

ŒŽ  ’‘ “ ”–• — ˜š™ ›

12

A T

T

C

AA T

GA

mm8 Nanog ESC [1]

CA T T C

GC

G

GA

CC

TC

A G A

G A

G T

C Nanog Loh2

ACA

A AACA C T

TC

G

G TAG C

C

T

C

TT

AG

G

G

C

C G

U Foxk1 ii

150

12

0

150

16

C

CA

T

TT ATGCAAA GT

T TCGCA

A

G

G

C

AA

T

G

T GT T

CA

G

C

TG

ACAAT

A

T

GC

C

C

C

C

G

G

G

C C G A TT

C

mm8 Sox2 ESC [1]

A

T TG G

A

G

A

C Oct4

U Sry ii

150

T

ESC [1]

C

A

G

AC TCGCGA T TCT

AT

mm8 Tcfcp2l1

T

G

C

C A

AAT

C

AG T

T

GA A

A

TG

C Tcfcp2l1

C T

T

G

C

A

G

ACTCGCGA T TC T C

AT

A

AA T

C

AG T

T

GA A

A

T

G

C Tcfcp2l1

150

0

150

T

ESC [1]

C

A

G

AC TCGCGA T TCT

AT

mm8 Tcfcp2l1

T

G

C

C A

AAT

C

AG T

T

GA A

A

TG

C Tcfcp2l1

CTGG

T

G A

A TT

C

CAG

C

TA T

T

G

A

T

G

C

150

J Hand1::Tcfe2a

0

150

T

ESC [1]

C

A

G

AC TCGCGA T TCT

AT

mm8 Tcfcp2l1

T

G

C

C A

AAT

AG T

T

GA A

C A

TG

GGAA

A TG

A

CC

A

A G T

C Tcfcp2l1

T

AT

T

J SPIB

150

0

150

G

C T

mm8 Tcfcp2l1 ESC [1]

T

C

A

G

AC TCGCGA T TCT

AT

C A

AAT

C

AG T

T

GA A

A

TG

G A

GATTA

G

C

C A

C G

C

A

C

C Tcfcp2l1

GGGT

T

T

T ACG

C

G

C

A

T

C

U Tcf1 ii

150

0

150

T

ESC [1]

C

A

G

AC TCGCGA T TCT

AT

mm8 Tcfcp2l1

T

G

C

C A

AAT

C

AG T

T

GA A

A

TG

C Tcfcp2l1

CAGCTG

CAG T

A

G

G C

C

A

A

C

CG

T

TA

T

G

A

A

U Ascl2 i

150

0

150

T

ESC [1]

C

A

G

AC TCGCGA T TCT

AT

mm8 Tcfcp2l1

T

G

C

C A

AAT

AG T

T

GA A

C Tcfcp2l1

C A

TG

GT

CTA TA

A T

T

G C

G

G

A

G

U Hmbox1

20

A

T

T

G

G

C

150

0

150

-5, Same 1.33e-04

150

0

150

+5, Opp. 9.24e-14

150

0

150

-5, Opp. 1.02e-12

150

0

150

-1, Same 9.55e-09

150

0

150

-3, Same 3.16e-06

52

0

150

25

CGC CG

150

33

52

CGC CG

4.75e-03

58

33

CGC CG

-9, Opp.

70

58

CGC CG

150

72

70

CGC CG

0

16

72

CGC CG CGC CG

150

150

0

150

+2, Opp. 4.53e-06

25

0

150

150

0

150

-3, Same 7.07e-06

œž Ÿ  ¢¡ £ ¤¦¥ § ¨ª© «

23

CGC CG T

ESC [1]

C

A

G

AC TCGCGA T TCT

AT

mm8 Tcfcp2l1

T

G

C

C A

AAT

AG T

T

GA A

C A

TG

GATCTA

TA

CG G

C Tcfcp2l1

A

TA

T TC

CG

CA A T

T

G CA

G

G C

150

U Gat4

23

0

150

44

CGC CG G

C T

mm8 Tcfcp2l1 ESC [1]

T

C

A

G

AC TCGCGA T TCT

AT

C A

AAT

C

AG T

T

GA A

A

TG

AAC G

GT

A

TCAT

CT

G

C

A

C Tcfcp2l1

J ovo

150

T

ESC [1]

C

A

G

AC TCGCGA T TCT

AT

mm8 Tcfcp2l1

T

G

C

C A

AAT

C

AG T

T

GA A

A

TG

C Tcfcp2l1

ACTG

C

A

G

T AGA AT TCG CA A A T TG

G C

T

G

CC T T

CT TG

C

C

A

CTG A A

G

G

C

150

U Zbtb3 ii

0

150

T

ESC [1]

C

A

G

AC TCGCGA T TCT

AT

mm8 Tcfcp2l1

T

G

C

C A

AAT

C

AG T

T

GA A

A

TG

CGCCC

TA

A T

AC

G G

C Tcfcp2l1

T

T

A T

U Klf7 ii

150

0

150

T

ESC [1]

C

A

G

AC TCGCGA T TCT

AT

mm8 Tcfcp2l1

T

G

C

C A

AAT

AG T

T

GA A

C A

TG

AACTG C

CC T T

A

G A

C Tcfcp2l1

C

A

A

T TG

C

A

A

AT

150

U Myb ii

0

150

G

C T

mm8 Tcfcp2l1 ESC [1]

T

C

A

G

AC TCGCGA T TCT

AT

C A

AAT

C

AG T

T

GA A

A

TG

A

A

GCT T

G

T

TCCGG

G

T TC

C

C Tcfcp2l1

T

AA

TC

A

AG

G

C

150

U Spdef i

0

150

TTAC C

mm8 nMyc ESC [1]

CGCGCG

CG T

C

C

T AAG

GA

G C

A

G

J Mycn

T

CGG

CT

A

T CG

C

A T GG A A

C

T G A

150

U Rsc30

0

150

mm9 Gata1

GATAA

G1EER4 [2]

U Gata6 i

A T

T

G

A C

T

C G

A T

CAGCTG

CAG T

A

G

G C C

A

A

CG

T

TA

A

U Ascl2 i

21

C T

G A

150

0

150

+1, Same 3.17e-05

150

0

150

-5, Same 2.22e-04

150

0

150

-5, Opp. 1.15e-03

150

0

150

-3, Opp. 4.85e-03

150

0

150

-3, Opp. 5.37e-03

21

0

150

35

C

150

29

21

CACGTG

1.57e-05

29

29

CGC CG

-2, Opp.

26

29

CGC CG

150

49

26

CGC CG

0

44

49

CGC CG

150

150

0

150

-1, Opp. 3.85e-03

35

0

150

150

0

150

-8, Same 1.32e-10

¬®­ ¯

34

mm9 Gata1

GATAA

G1EER4 [2]

U Gata6 i

A T

C

T

G

A C

T

C G

A T

CTGG

T

G A

A TT

C

CAG

C

TA T

T

G

A

T

G

C

150

J Hand1::Tcfe2a

34

0

150

24

mm9 Gata1

GATAA

G1EER4 [2]

U Gata6 i

A T

C

T

G

A C

T

C CAGCAGG

A

C

T

G

G

A T

A

A

G

G

TA

TC

TGT T

U Zic1 ii

22

T

150

150

0

150

-5, Same 4.73e-08

24

0

150

150

0

150

-3, Opp. 5.78e-04

Supplementary Table 4. Motif database. Summary name, full name, and Sequence logos for all 645 motifs employed in our anlayses. “Summary Name” indicates a short name for the motif employed in the main article. “J” indicates the motif was derived from the JASPAR [20] database. “U” indicates the motif was derived from the Uniprobe [21] database. “C” indicates the motif is a custom-generated motif. Custom generated motifs were created by running MEME [22] on publicly available ChIP-seq datasets (“C CTCF”, “C Esrrb”, “C Klf4”, “C Nfe2”, “C Oct4”, “C Srebp”, “C Stat3”, “C Tcfcp2l1”, “C Zfx”, and “C NFYA”), or by converting consensus sequences taken from the literature into position weight matrices using the iupac2meme tool available in the MEME Suite of tools (http://meme.nbcr.net). Summary Name

Full Name

Seqlogo

C AG GGC

CA

C

GT G

G T AA AG T

C

C CTCF

CTCF

C E-Box

E-Box From JASPAR

T

A

C

AT

GG AT

A

A

T CT

AC

T GGC

T AG

T

C

TG A

CAGGTG GG CA TG AAGGTCA GAT GGGAAT T A

C ETS

ETS factors like fli1

C Ebox

Ebox

C Esrrb

Esrrb

C Gata1

Gata1 Merika et al

C

G

G A

T A

C Ikaros

Ikaros

23

GGG T

CC C CCC G GGG G G AAC G TGCTGA TCA TGCTGA TCA CA T T C CA T T C C ATTA TGCT AGTCA A

A

C KLF-A

KLF-A

T A

G C

C

A

G

C

G

GGAA AA GA

AA A A AG A GGG

GG

T G

CC TC TC

T

T

C

C Klf4 a

KLF4 Shields and Yang 1998

Klf4

G T

T

C

C

T

C GG T T

A

G TA

C Klf4 b

C

CC

T A T TG

T

T

C

C

G

A

A

A

T C C Myb

C NF-E2 a

C NF-E2 b

Myb

NF-E2 Consensus

NF-E2 or AP-1

T C

G C

T C

T C

G C

T C

A

GC

GA

GA

C A

CC T

C Nanog Loh1

Nanog Loh et al

T

A

CC T

C Nanog Loh2

Nanog Loh

T

G A

A G A

G A

A G A

GC

GA

GA

TC

T

GA

C A

G T

TC

T

GA

G T

GGG G CAC C

C Nanog Mitsui

Nanog-Mitsui et al

G

T

AA T

G

C

C Nfe2

Nfe2

24

AC

C

G C

T CA

A

C

T

TC T

A

A

C

G

TTTT AAAA C Oct-Sox

Oct3 Sox2

C

T A

TT ATGCAAA

T TCGCA

A

C Oct4

T A

GT

CA

T

G

GCAT ACA G

G

C

AA

T

G

T GT T

CA

TG

T

G

C

C

C

G

A

Oct4

TT T

AT CGAT AT GCATAAT C

G

C

C

A

C Octamer a

Extended Octamer Loh et al

C

T

G

T

G

GGA

A T

G

C

C

TG

ATGC AAT G T G GA T G A G T T C GG A T A

C Octamer b

Octamer

CTC C

G

T

A

C Srebp

Srebp

A T

A CG C T TA

CG

T

G

G

C Stat3

Stat3

T

T

A AC

C

A

C

G

A

T

Tcfcp2l1

C G T

T G AGC

A

C

C

G

C A

C

A

AAT

G

A

C

AG T

T

GGCCT

A

Zfx

T

AC TCGCGA T TCT GA A

A

TG

GG

G

G CTC

G

C A

T

A TA

C A

G AG AGA G A

GA GT G G G C

C

A T T C AC A A

C Znf263

A

C

G A

C

T AG A

TC

G T

NFYA

25

G

AC

AT G C

C CCTCG G A

AC

T

GT

C

A

C

A

T

G

G

C

Znf263

T CC G T T GG

C NFYA

C

A

T

G

T

G A

T

A

G

T

T

C

CGC CG AT

C Zfx

A

A

C

C Tcfcp2l1

G

A

T GCT

C T A

G

A

T

T

G

T CC A

CC A G A

A

TT

G

J AG

MA0005 Agamous MADS

CCA

T

C

MA0001 AGL3 MADS

T

T

G

AT T

G

AATATA

G

TA

G G C

G

AG

J AGL3

TA

T TA T

T

G

T

T C G C A

AGAT

G

C

C

C

J ARR10

MA0121 ARR10 TRP-CLUSTER

T

A

TA AT

T

C

MA0007 Ar NUCLEAR

J Arnt

MA0004 Arnt bHLH

C

G

T

A

C

T

A GA

GT A

GG

C

J Ar

TT

A

AT ATT G A C GT C CACGTG GCGTG CA

G

MA0110 ATHB5 HOMEO-ZIP

T

A

G

CG

J ATHB-5

T

C

C

C

CC TT

C

T

TC CT A

A

T

G CC AA

G

A

A T

C

C

T

G CA A

G

G

A T

A

G

T

C

TTT

A

G

J Arnt::Ahr

MA0006 Arnt-Ahr bHLH

A

C AC

G

T G

C

MA0133 BRCA1 -

G

T

C T

J CREB1

CA

T

MA0018 CREB1 bZIP

T

T

GT A

C

MA0102 Cebpa bZIP

26

T

T

CGA

T

AG

J Cebpa

A

A

A

T

G

T

C

A

A

TGACG CAA

GCG

G

T

G

T

J BRCA1

C

A

GACG

A

G C

C A

T

C

G

T AT AT T

G A

A A

G

C

CG

C CCG G C C

J Cf2 II

MA0015 CF2-II ZN-FINGER

A

TGCAAT

G T

MA0019 Ddit3-Cebpa bZIP

J Dof2

MA0020 Dof2 ZN-FINGER

T

C

C

C

C

T CAGG

A

CT

C

J Ddit3::Cebpa

A

C

TCG

A

G

T

AAAG AAAG TTT GCGC TCC

C A

T G

CG TC A

G

J Dof3

MA0021 Dof3 ZN-FINGER

G

CC

J E2F1

MA0024 E2F1 E2F TDP

G

T

ACA T

G

TG

J ELF5

MA0136 ELF5 ETS

T

G A

C

C

C

GAA

CG

C

AA

J ELK1

G A

C

T

T TG

G

AG T

MA0028 ELK1 ETS

CCGGAA AGGTC G CC

GT

A

TAC

G T C

J ELK4

MA0076 ELK4 ETS

CA

AC

CC G TT

C

GG

A A

J ESR1

MA0112 ESR1 NUCLEAR

T

A

GG

CG

TAGAT

G GA A C

TA

T

CA

G

C

TCC

C T

A

T

A

T C

G

A

J ETS1

MA0098 ETS1 ETS

27

G T

A

C G T

GGAA ACGTGG

C

CA G A T

J Eip74EF

MA0026 Eip74EF ETS

AT

G A T

C

C G

T

T

A

J EmBP-1

MA0128 EMBP1 bZIP

CC

GAT GAAT A

T GG

J En1

G

G

G

A

GA A GATAA

A

J Evi1

MA0029 Evi1 ZN-FINGER

J FOXC1

MA0032 FOXC1 FORKHEAD

J FOXD1

MA0031 FOXD1 FORKHEAD

C

A C GA T

A

T

A

J FOXL1

MA0033 FOXL1 FORKHEAD

A

G

A

T

T

AA T

G

AG

GA

C

A

ATA

C

G CT

T

C G

C

G TT AG

A

G

C

T

T

A

G

G C

G

MA0042 FOXI1 FORKHEAD

T G

G T CA

C

TA C

T

G

J FOXI1

C

C

GTA GTAAACA GTA ACA TTTT T C A

MA0030 FOXF2 FORKHEAD

G

A

C

A

CT G

T

T GC

GCC

A

T

TGA T A

G A

C

J Fos

T

T

A C

A CT

AT

G C

J FOXF2

C

T CT

G

MA0027 En1 HOMEO

MA0099 Fos bZIP

28

G

CA

CC

CCTA

T

C

A

A

TA T TT T T

J Foxa2

MA0047 Foxa2 FORKHEAD

ATAT

G A T

J Foxd3

MA0041 Foxd3 FORKHEAD

J Foxq1

MA0040 Foxq1 FORKHEAD

G C A

CC

TGT T TTT

C

C

C

AC

GGT C

G

T AC T C

C G G

A

G

A

C A

GTTTA CGGAAG GAT AT T

T

T

AA

G

AC

G

C T

GG CA

A

MA0062 GABPA ETS

J GATA2

MA0036 GATA2 ZN-FINGER

J GATA3

MA0037 GATA3 ZN-FINGER

T

GA

GCG

C

C

J GABPA

C

ACG G ATA

A

T

A T

G

C

C

G

GAT

A G T A T

G

T

G

C G

AAC G C

C T

MA0034 GAMYB TRP-CLUSTER

J Gata1

MA0035 Gata1 ZN-FINGER

J Gfi

MA0038 Gfi ZN-FINGER

J HAT5

MA0008 Athb-1 HOMEO-ZIP

C

C

AAA

G

G

AC

J Gamyb

C

T

A

A

G

T

GAT AATC AATTATT C

G

T A

AC

A

T

C

G

CACA

AA

C T

T

G

C

A

29

TG

AC

G

G

T

C

T

G

GA

TT CG AA

GA

G T

J HLF

A ACT

G

MA0043 HLF bZIP

T

T

GTTG

A

G G

C

CGC

T

CTC

T GG

A

C

G

T

TC T

ACG C TC AC

J HMG-1

MA0044 HMG-1 HMG

A A T

G

J HMG-I/Y

A AA

GGG

TC

G

G

C

A

TT

A

AA

A

MA0046 HNF1A HOMEO

A

C

T

C

T

G

A GC

T

T C

A T

A

AT

G

C

A

G

G

C

T T GGC

G

G A TG A

T

T

C

TC

G

MA0092 Hand1-Tcfe2a bHLH

A TT

C

CAG T

G

A

G

T

C

CT AAA GA A GA A A G GA A

T

C T

A A

MA0109 RUSH1-alfa ZN-FINGER

G A

A

MA0050 IRF1 TRP-CLUSTER

C

G

CA T

MA0051 IRF2 TRP-CLUSTER

T

G

A T

G

T C

G

CGT

A

AA

C A

T C

G CC A T

G

MA0039 Klf4 ZN-FINGER

30

A

T

G

AA

C TA TA

G

G

TA

G

C

C

T

G

G

AAG

AGG GGGAA

T

A

CGC

C

GA

G T

CTCTCCC

J Klf4

C

TC TA

T A AGG AC

CTGG

T

C

TA T

J IRF2

A

G

MA0114 HNF4A NUCLEAR

A

J IRF1

C

A

GA T

GT

G

A

G

J Hltf

A

C

G CAAAG

GT T

C C

J Hand1::Tcfe2a

AA

C G A A CGT CGT G

T

C

GTTAAT TT

T A AA T

J HNF4A

G

MA0045 HMG-IY HMG

G J HNF1A

T G

T

A

T C

C

TAATT A AT A AT T A CACGTG CTATTT TAG ACGTCCG AAAG CACGTGG GGGGA A

A

T

TC

J Lhx3 a

MA0134 Lhx3 HOMEO

CGG

AT

A

TG

T

C AC

C

A

GC T

J Lhx3 b

MA0135 Lhx3 HOMEO

A

TT

G

T TC GA

C

A

G

C A

CT

J MAX

MA0058 MAX bHLH-ZIP

T

A

J MEF2A

MA0052 MEF2A MADS

AT

MA0131 MIZF ZN-FINGER

J MNB1A

MA0053 MNB1A ZN-FINGER

AT

A

AGT

C

ATCTGT T

A

G

C

J MIZF

AA

C C

A

G

C T A

A

A

T

G

G

C

GCA

J MYC::MAX

MA0059 MYC-MAX bHLH-ZIP

J MZF1 1-4

MA0056 MZF1 1-4 ZN-FINGER

T C A

A

T AC

T A TC

AT

G

T

AGGGG

T G

T

AC

J MZF1 5-13

MA0057 MZF1 5-13 ZN-FINGER

A T

MA0118 Macho-1 ZN-FINGER

31

G

A

GG G

G

J Macho-1

A

G A TG

CT

T

G T

C

T

T

C

T

C

T G

A

C

A

G

G

CTGA

CTCCG

T TT G AG ATAA

J Mafb

MA0117 Mafb bZIP

C GTTG

GG A A T

C

J Myb

MA0100 Myb TRP-CLUSTER

J Mycn

MA0104 Mycn bHLH-ZIP

A

G

AA

CTG

A

C A

T

CACGTG CA C G G GGG T CC ATGA TTA GTAA GGGG CCCC CCAAT CAGCTG TTAC C

C

AG G T CT

A A GA

AGA

CG

G

J Myf

MA0055 Myf bHLH

G

A

GA

T

T

CA

J NF-kappaB

MA0061 NF-kappaB REL

J NFE2L1::MafG

MA0089 TCF11-MafG bZIP

C

G

A C TG AT

A

G

A

C T

T A

C

G

T

J NFIL3

MA0025 NFIL3 bZIP

C

C

T

A

GT

A

T

GG

ATT

A

J NFKB1

MA0105 NFKB1 REL

GACT GG

C

AG C G

AG

A

G

C

T

J NFYA

MA0060 NFYA CAAT-BOX

G

C

A

A

T

C

CT

J NHLH1

MA0048 NHLH1 bHLH

32

G

C

A

A

T

CG

G AA

T G A

T

CG T G

C

AAA

T

T AA

A

TACTTA

T G C

TAAG

C

J NKX3-1

MA0124 NKX3-1 HOMEO

G TCA AG TCA A TGA CTTTG AG

A AG

CG TT

C

J NR1H2::RXRA

MA0115 NR1H2-RXRA NUCLEAR

T

CCAACC

A

J NR3C1

G

T

T

T G A

AC

G

A GT C

GGAAC T AT

G C

C

AA

T

AA CG T

A

MA0017 NR2F1 NUCLEAR

G

C

C

GAG

J NR2F1

C

T G

T

T

C TG T CC T A A G A

GA T TG GC A GA CT A

G

C

G

A

T

MA0113 NR3C1 NUCLEAR

AA AGTG TAATT AAAG ATCAATCA T

T

T

GTG

AC

AT

CA

G

J Nkx2-5

MA0063 Nkx2-5 HOMEO

J Nkx3-2

MA0122 Bapx1 HOMEO

C

TA

A

G

GG

C

CT

A T

G

J Nobox

MA0125 Nobox HOMEO

A

GT

G A

G

T

C

C

A

CC

T

G

T

A

C T

G

A

J PBF

MA0064 PBF ZN-FINGER

GC CA

J PBX1

MA0070 PBX1 HOMEO

A

G

J PEND

MA0127 PEND bZIP

33

T

GTG AG CC A ACA

G

T

G

A

T

TTCTTAT

C

T

C

A

T CG

CTG

A

G

G

G

T

C

G

C T A

J PPARG

MA0066 PPARG NUCLEAR

T A GT C A T A C A T

A

G

T

C

T

T

A

MA0065 PPARG-RXRA NUCLEAR

G

C

TCA

CG

CG

T

T

G C

T A

T

A

C

MA0067 Pax2 PAIRED

G

A

AA A T A

T A GC T T TA

C

MA0068 Pax4 PAIRED-HOMEO

G

G

C

GA

GCG

C

C

T CC A

A

A T

G

C

CA

GT T

T ACC

G

GG

C

TA

AA

C

C

A

T

G

A

C

T

CG

C

A

G

G

MA0014 Pax5 PAIRED

MA0069 Pax6 PAIRED

A

T

T

A C C C C CT CA C A TA T

TTA

T

A GT

G

CAGTGA A

T G

AC

G A T

C

A

T

T

T

G

T

T

G

A

TCCG

G

G

G

AT

A

A

C

T

G

C

G

GT

G

A

G

T A

G

A CA

TAG TAT

TCCC

A

C

T

A

GG C

TG

C

CC C

G

G

T AT

C

A

CCA T

G

C

G CC AC A

J Pax6

G

G

AATGT

J Pax5

C

AT

T CA G

G

G

J Pax4

CC A

T

G

T CA

C

J Pax2

T

G TC A A G TC A

GAGA

C

J PPARG::RXRA

T

T

CC

T AG A A

A

T

G

G

GG C

CACACG

A T

A

TAAT AATTA T

C

G

T

G

G

C

A

J Pdx1

MA0132 Pdx1 HOMEO

J Prrx2

MA0075 Prrx2 HOMEO

J REL

MA0101 REL REL

G C

G

T

GG TTCC T

G

CT TG A TCA

A

T

C

TT

MA0107 RELA REL

34

AAA T A

G

C

GGA TTTCC

G

C

J RELA

G

GA

G

A

C

C

C T G T CC G T C T G AGGTCA A TAGGTCA C CC C A A C A CC C TG GGT GGTCA G GTTC GG C

CG T

T T AC

CT A

J REST

MA0138 REST ZN-FINGER

CA

A

T A TA

G

G A

T

AA

T

G T

G

C

AC

T G C TCC A

G A

T

C

G

CA AT

G

G

G C

C

J RORA 1

MA0071 RORA 1 NUCLEAR

A T

T

AT

T

A

GGC

J RORA 2

MA0072 RORA 2 NUCLEAR

G

G

C T

A C CA AC

A A A C CC A

A

T

J RREB1

MA0073 RREB1 ZN-FINGER

T

A

J RUNX1

MA0002 RUNX1 RUNT

TC

C

J RXRA::VDR

MA0074 RXRA-VDR NUCLEAR

TGC

C

T

G

T

T

T

T

C

A

G A

A

G

A TC

MA0077 SOX9 HMG

CGG

J SPI1

MA0080 SPI1 ETS

T

G

T A

TT G T AC

TA

C A

MA0079 SP1 ZN-FINGER

AA

C T

GG G T GGAA GGAA GG

CT

J SP1

G

GC

A

T

CC C C G

J SOX9

TA

CCG

CAAT

A

A T

T A

T

T

G

AA

C

A

AA T A

GA

T

T

AT

G C C

CC

C

A A AC A AG

G

C A

G

C

A TA

A

T

C

C T

A TG

T

A

CC T

J SPIB

MA0081 SPIB ETS

35

T

A

A G

AT

T

CC TATA GG ACAA

GC A

T

AT

A

G G A

C

J SRF

MA0083 SRF MADS

T

T

A

A

C

A

T

AA

T

AA T

T

A

TT

G

G C

J SRY

MA0084 SRY HMG

MA0137 STAT1 Stat

G G

A GA AC ATTGT AACAAT GA A

GA

C

J STAT1

G

A

T

T AC

C

G

CC

GA

G

C G

C T

GT

A TC

J Sox17

MA0078 Sox17 HMG

J Sox5

MA0087 Sox5 HMG

C

T G G T

C

AGGGTA CAGC

GCAT T

J Spz1

A

TT

GA

GTG GAA TAGGTGT AA A CA CTG GA

T

C J Su(H)

GGC TG A

G A

MA0111 Spz1 bHLH-ZIP

MA0085 Su H IPT TIG

CC A

T G A

C

G

JT

MA0009 T T-BOX

A

T

MA0091 TAL1-TCF3 bHLH

A

T

A

36

T

T

T

C

T

C A

G TC

C

ATAAA

G

MA0108 TBP TATA-box

T

G

GA

G

C

J TBP

C

T

C

G

J TAL1::TCF3

G

CT

C

TC

A

T

A T

G

G A

G

C

T

T

CAT CC GCC C T

A

G

T

G

A

J TEAD1

MA0090 TEAD1 TEA

TC AG G T C

A

G

GAG

A C

C

T

TT

C T

G

J TFAP2A

MA0003 TFAP2A AP2

J TGA1A

MA0129 TGA1a bZIP

C

AC

G G C

C TG CA GC AA G ACATG C GGGCATG CACGTG T

ACGT A

CT AGT

AA

T G

CCA

TGGGT T C

T G

A

A

T

C

A

J TLX1::NFIC

MA0119 TLX1-NFIC HOMEO CAAT

C

G A

J TP53

MA0106 TP53 P53

J USF1

MA0093 USF1 bHLH-ZIP

G CT

AAG

T

T

T T AA

AGT A A

T

C

T CG AA

G A

TA

G C

T

TAAT

TG C

C

A

G

T

G

J Ubx

MA0094 Ubx HOMEO

J YY1

MA0095 YY1 ZN-FINGER

J ZEB1

MA0103 ZEB1 ZN-FINGER

CCAT

G A

T

T

ACCT

C

TG

G

G

G

G T

A

T

A

C

CCAC

T

A

A C C G

J ZNF354C

MA0130 ZNF354C ZN-FINGER

37

G

C AG

GCGCACA CTTGA GTGTGCA A

C

J Zfp423

CG GC TGACGT G TGACGTG C A ACA CTA AAC A T AAA A GGG TTT GGG TTCC GT

C

ACC

CCTC TG

CG

MA0123 ABI4 AP2

T

TGG

G A

A

J abi4

T

C

T

A

MA0116 Roaz ZN-FINGER

AA

C A

G

J bZIP910

MA0096 bZIP910 bZIP

A

T A

J bZIP911

MA0097 bZIP911 bZIP

GTC

G

C

TA

T A A TG A T G

G CC T AG

J br Z1

MA0010 Broad-complex 1 ZN-FINGER

J br Z2

MA0011 Broad-complex 2 ZN-FINGER

T

T

A

T

AC

G

C A

T

T

MA0012 Broad-complex 3 ZN-FINGER

TATGAAG

C A

J br Z4

MA0013 Broad-complex 4 ZN-FINGER

G

T

C

T

MA0022 dl 1 REL

G

T

T C

J dl 2

MA0023 Dl 2 REL

38

C

A

T

T

T

G

C

A A

GCT T T T

G

C

G

A

T

CT TTGGA

J dl 1

AA

T G

G

J br Z3

G

GA

ATG

AC

G

A

T

T T

TC

C

C

AT

A

AA T

G C

A

G

GG

T G

C

A T

A

A

T

T AA

CC

CC

CT

T

G C A

G

AA

C

A

CAT

G

G AC

C

A

G

T

J hb

T

MA0049 hb ZN-FINGER

T

AAAA

CTG

T

TG

G

GA

T

C

AC

G

A

MA0120 id1 ZN-FINGER

T

A

AAC GTTT T

MA0054 MYB

C

T

C

A

C

AA

A TG

G G

C

J myb.Ph3

T

TCCA

GG

C

T

G

C

T

AA

A

GC

T C TG C TC T

GG CC

J id1

A

CC

A

C

J ovo

MA0126 ovo ZN-FINGER

AAC G

J sna

MA0086 sna ZN-FINGER

CAGGTG

GT

A T

CA CT T G

C

A

AA

T

G T

A T

J squamosa

GA

CA AATATGA

C

A TGTT T

TA

C

A T

C

GGGGTCA

A

C

T

G

C AC

CG

MA0082 SQUA MADS

C

CG

J usp

MA0016 usp NUCLEAR

G

T

T

TC

G

J znf143

MA0088 Staf ZN-FINGER

U Aft1

GR09 Aft1

A

TC

A

GC A

TTT

G A

C A

T

A

A

A T

39

A

C T

G

C

G

C TGC

C

T

G T A TC A GA C G

C

G

A

G

C

A

T

T

TG A

TA T A

T C

A AG

C

G

C

A

T

Cell08 Alx3 3418

TCA

G

C

G

CA TC C

A

G T

TGCACCC

G A

G

C

U Alx3

G

A CT A T

TC C

TC

GA

T

CA

T

T A

A C

GCT AT

G

T

C

A

G A

G

T

CG A

CC

GA T

U Alx4

A

C

TAATTA

C

G

T

A

T

Cell08 Alx4 1744

A

G

G

T AT A

T A T AA G

CC

AG

T

GA

U Arid3a i

Arid3a pwm primary

U Arid3a ii

Arid3a pwm secondary

G T AC A

A TC

G T

C GT TC C

AT

CA

T

C G

G C G G G C AT A

T

A

AT

A TA T

G C A

TT

U Arid5a i

Arid5a pwm primary

AT

A

T

A T

C

ATACG

A

G CG

T

Arid5a pwm secondary

G

T

T

A

T

C

T A

G

A

G

CT

TC

G C

A

A

G

GR09 Aro80

T

A CG A

GT T

Cell08 Arx 1738

T

U Ascl2 ii

Ascl2 pwm secondary

A

TA

CCC

CCGG GR09 Asg1

40

C

CC GA

A

T

A T

T

A T

C G

C T

G

A

A

CCC T

T ATG A

GC

G

C

A

G

GAAG

A

T

T

C

GG

C A T

CG

T

G TG T

A

ATA

CAGCTG

C

G

T

G

GT

C

A

Ascl2 pwm primary

C

C

G

A

G

G C

U Ascl2 i

CG

TA T AA

CC

C

CAG

U Asg1

G

C A

CTA T

C

U Arx

T

G

C

A G

U Aro80

A

G

CG TCGCG AA

C

U Arid5a ii

G

CA

A

C

C

TT

AA

G

G G

T

G GT

CCGG

T A T A TGGCA T

C

CT

A T CGA G

T

G

A

C

A

A

T

C

A

TGACGTCA

G

CC

U Atf1 i

Atf1 pwm primary

A

T

TA

T

A

A

AA T

A

G

T

C

G

C

CACTTA

C

A AG

GT

C

T

A

G

Cell08 Bapx1 2343

TT

AA

A

T

G

T

T

A

Cell08 Barhl1 2590

A

A

T

C

C

T

G

G

A

A T TA

CA

C

T

T

A

C TG

A

G

G

GC T AC C

U Barhl1

T

G

T

C

AA

CC T

T

A

G

A

Cell08 Barx1 2877

AAT T

C

C

G

Cell08 Barx2 3447

AATT

TT

G

GC

C T

GT

G

GR09 Bas1

U Bbx i

Bbx pwm primary

C

T

C

A

TA

C

A T

C

T

T CG

T

AC

G

G

A

C

T G

A

A

C

T

A

T

A

A CG T

A

GA

C

C

T

AT

G

T

A

TA T T T

T G C

G

C

C

T

C

G

G

C

G

C

C

C

A

G

G

GTTAACA

G CA T

AGCG T A AC

C

GT

C

41

G

TA

GC C

TTC GAA

G

T

C

Bbx pwm secondary

AT A

C GAGTCA

A AG

AA A

G

G

U Bbx ii

T

C

TG

G

A G CA G A

U Bas1

G C

C

C

C

A

A

G

G

CG

G

T AT

A

G

CT

U Barx2

C

G

C

TC

G G

U Barx1

T

T

A

C

Cell08 Barhl2 3868

T

G

GC T GT

TTT

G

A

G

A CA A T T A A

AG AC

C

U Barhl2

T C

Atf1 pwm secondary

C

U Bapx1

T

CT ACG TG T

U Atf1 ii

G

A

TGACG

G

CA T T TGG CG ACCA T

T

CCA

GG

T

C T

G G

C

TT

TC AGGAA G

Bcl6b pwm primary

T

G

C

C

CCCC

CC T

C

C

G

U Bcl6b i

T

T T TC T T TA A A

T CA ACA AG G

G

T

T

T A TAGG G T AT AG A AG

A

T

C

U Bcl6b ii

Bcl6b pwm secondary

U Bhlhb2 i

Bhlhb2 pwm primary

A

G

G

CACGTGA

T

C

G A

A

C

C CG G

T A T GA A

T

GC

CC

A

A G

GAG

T A

T

G

U Bhlhb2 ii

C

Bhlhb2 pwm secondary

T

C

G

AG T

T

U Bsx

A

Cell08 Bsx 3483

A

U Cart1 a

AATT C

TA

C GG A

T

U Cbf1 a

GR09 Cbf1

CT T

T

C

T

T

C

NBT06 Cbf1

42

T

A

C A

T

G

A

C

A

T

T

G

CA C G T G T

A

TA

CCG T

C

T

G

G C

C

AA

C

T

GTAA AA

AA

Cell08 Cdx1 2245

A

C

A

A

GT

GT

TG

C

U Cdx1

G

G

C

C

G

TA

A

A

C

T

TC

G

C

G

T

AA

C

T

G

TT

C

G

A

AGG

TAATTA

AG A

U Cbf1 b

G

CACGTG

G

A

T

TAATTAA

C

G

A

A

C

Cell08 Cart1 1275

C

T

C

U Cart1 b

T

C

C

Cell08 Cart1 0997

T

G

G

G

C

A

T

T

A

G

G

TC

A

T

C TCC

C

T

G

C

CG

T

TA

G

TT

TC

ACT

C

T T

AG

GA

A T

T

TG

C

C

G

A

C

A T AAA

G

TA GAC C A T

U Cdx2

Cell08 Cdx2 4272

C

CAC T T A

GT T A

U Ceh-22

NBT06 Ceh-22

G

U Cep3

G GC TC A

T

C

A

T

A

A

CC

T CC G

A T

A

CA

T

C

C

GR09 Cep3

T A

C

A

G

C

T

C

PNAS08 Cgd2 3490

G T

Cell08 Cphx 3484

T

T

AATC T

AG

C

C

T

C

A

G

C

G

AG

C

A

T

G

G

T

TC

C

G

T

C T

A

A

GC

A

G T

T

T

TC

C

GG

A

T

G CG

A

Cell08 Crx 3485

G

C G

T

C

T C T GA T A A

G

T

A

C

G

A

C

C

U Crx

C

A

C

U Cphx

T

G

GT

G

A

GR09 Cha4

G

AA

GT

A

GCATGC T C GC TGAT ATCA GGAT A ATGTGTCA

C

U Cha4

T

A

A

C T U Cgd2

A

T

GG

T TC

G

T

AT

G

C

GT C

C

G

G

T

CG A

G

T

TCGGAA

T GC

A

T

A

T TC

A T GG

A

G

C

T

G

C C

CA

G C AT T

GG

C

T

A

T

A

T

A

T

AT

G

U Cup9

GR09 Cup9

U Cutl1

Cell08 Cutl1 3494

AAC

C

A T

Cell08 Dbx1 3486

43

A

GT

A TT

A TTA

AT

A

C

G

A

C

AT A

ATT

A AG CC GCC T

T CCGGG G

T

G

AC

CG

C

T GAAC

G

U Dbx1

C

G

TA T TA

G

C

C

T

AT AT A

TG

A C GT

A

C

C

G C

G C

A

AT

TA

C

TA CG C

TCGA

U Dbx2

Cell08 Dbx2 3487

U Dlx1

Cell08 Dlx1 1741

G

AA

G A

T

AT T

TA

GG

A

G

G

A

T

A

A T

U Dlx2

Cell08 Dlx2 2273

A

AA T T A TAATTA TA T A AATT GGATTA G

CT

Cell08 Dlx4 3488

C

A

GCC

G

A

TCT

G T

T

T

A

U Dobox5

Cell08 Dobox5 3493

G A

C

T

C

CCT T

GT

U Duxl

Cell08 Duxl 1286

44

A

C

T

T

TT

C A

T

A

T

C

GA A

A

T

A

C G

C

T

GG A

C

G

G

GCT

T

T

C G TTA T

A

T

A

C G T AG A

AT

GG

C

C

GA

C

Cell08 Dobox4 3956

T

G

AG TACCC

T AT T

U Dobox4

C

A GGATTAA A CA TCAA

T

AG G

T G

G

GA

TG

A

C G

G

C

AA

GT

C A T

G

C

G

C

C C

Cell08 Dmbx1 2277

T

A

C

AA

U Dmbx1

A

G

C

GG

Cell08 Dlx5 3419

C

T

CC T

U Dlx5

A

A

A

A

T

A

GT

C

A

G

G

GCA

T

U Dlx4

C

T A AT C

A

TC

Cell08 Dlx3 1030

T

A

TCG C T

G C

T

G

U Dlx3

C

G

T

C

G C

T

T A AT T A

C

G CC GT T

T

G

A

G

GT

C

T

T

TG

C

GCGCGC

A

G

AC TG

CT A

T GC

CA

C

A

G

U E2F2 i

T

E2F2 pwm primary

G

T

CG

C

AAA

G

U E2F2 ii

E2F2 pwm secondary

U E2F3 i

E2F3 pwm primary

TG AC

CG

T

E2F3 pwm secondary

T

A T

A

G

GGG CGGA

C

G

A

C

G

GR09 Ecm22

A

GC TA T C

T

CG C

G

A

C

A

G

Egr1 pwm primary

A

A

A

G

GC T

C

CC

A TA T

T

G

CG CCC C

C U Egr1 i

C

GTGC

AT

C

C T

U Ecm22

T

C

A

GG CCA

TAC

A

A

A

T

T

C

TC

A

T

G

A

A

T

G

T

G

GC TC

C

U E2F3 ii

C

A

GCGCGC

AG

G

A

GTG T

AT

T

CT A

A

A

G

GG CCA

C

T

T

T

T

G

A

A

TTT

CGG A

C

GG

GA TG GA GA

AG TC

A

A

T

U Egr1 ii

Egr1 pwm secondary

C G A

C

Ehf pwm primary

U Ehf ii

Ehf pwm secondary

AG T

A

G

A T

C

Elf3 pwm primary

45

C

GT

TA

A

T

A T

A

G

GT CG TT

AC

G

GGAA

GGA

C

C

A

A

C

A

A

T

U Elf3 i

A

C

C

G C

GGAA

AAT

T

T

G

G

GA

C

TTCC A

A

T

A

G

A

G

G

C

TT T

A

C

C

A T

C

U Ehf i

T

T

CA

C

A

T

A

G

GT A

CA

C

T

A C

G

G

A

A

G

TC

G

T

T

A

U Elf3 ii

AA

AAA TAAT

CA T T T T

G C

A T

CC T

G

T

A

C

A T

C A GCG A

A

C

G

T

T AA T A

C

G

A G

U Emx2

Cell08 Emx2 3420

U En1

Cell08 En1 3123

C

G

GCT A

GC T

TA

T

C

A

TG

A AC

GG TC

C

C

C

T

G

TG

Cell08 En2 0952

A

GA TT

TA

G

AA

CC

CT

C

T TG

AC

T

C

Eomes pwm primary

T

G

GAC

T

A

CA CA

G

T

G

G

TG

T

A

AT

C

G

G GC

CA

G

ATT A TA T A GGTGTGA

AA

CC GT

U Eomes i

T A

A

G G G

G

Elf3 pwm secondary

C

U En2

C

T TA CC CCC CCC C

GG

G

AGGTGT G C

A

C

GA TT

C

U Eomes ii

Eomes pwm secondary

A

AAGGTCA AGGGGTCA T AA T TA T AA T TA T

G

G G

U Esrra ii

Esrra pwm secondary

A

C

G

A

T

G

T

G A

T

A

Esrra pwm primary

C

A

C

T

C U Esrra i

TT

G

C

G

C

C

AT

C

A

G

A

T

T

A

GT

C

G

C

G

A

C

U Esx1

Cell08 Esx1 3124

C

A T

AG

G

U Evx1

Cell08 Evx1 3952

46

G

A

C

C

C

T

C

TC

A

A

C G

T

T

G

T

G

C

G

A

TC

T GC

C

GA

G

T

TA AT A GACGCA GTAAACAA GTAAACAA GTAAA A GC

T

A

T

C

GG

U Evx2

Cell08 Evx2 2645

U Fkh1

GR09 Fkh1

A

AA

C A T G T

U Fkh2

GR09 Fkh2

A

AC G

G G

T

C

G

A

C

C

AA T T

G

A

U Foxa2 i

Foxa2 pwm primary

U Foxa2 ii

Foxa2 pwm secondary

AA T

C

A T

CG C

T

C

T

A T

CG G

CT A

A

TT

CG G C

A

TAAC TT

G

T

T

C

ACCC

AA

G

T G

G

C

T

A

C

GC T

T

A AT T TT

G

CC

T

C

C

AG

GR09 Fhl1

C

T GGG GGT T

C

T

A

CT

U Fhl1

G

C

CC A AG

AA

G CTG G CT A TC TT G G ACG

C

G C

CT

C

Foxj1 pwm primary

AT

C

C

A

U Foxj1 i

T

TAAACA A

G A

C

T

C

G

C

T

T TT

G

G C

A AACA C

TC

T GTGCAG

G G

T

CAC

G

T

U Foxj1 ii

T

T

C

T

C

A

Foxj3 pwm primary

A

G T

Foxj3 pwm secondary

47

C

T

A T CC

T

AG

C T

T

T

C

G

A T

CC G

G

CA AACA

C

C

U Foxj3 ii

C

TAAACAA

GA U Foxj3 i

G

T

G C

A

Foxj1 pwm secondary

T

A T

A T

C

G

A

T

T AA A

GA CA

ATT

A

G

GGA

A

C

C C

U Foxk1 i

Foxk1 pwm primary

C

Foxk1 pwm secondary

T

G

A

Foxl1 pwm primary

A T

T A

AT T

A

G

U Foxl1 ii

T

T

Foxl1 pwm secondary

T

C

U Gabpa ii

Gabpa pwm secondary

T

T

TCC

AC

AG

G GA

T

T

A

G

G

T

G

G

A

T

G

G

GC T

A

CG G

A

G

C

C

G

T T

48

AC T

A

CA A T T

GR09 Gat4

G

A

C

C

C T GG

U Gat4

A

CG C G CTTATC AGATCTAC TAGATCTA

GR09 Gal4

GR09 Gat3

T

G

T

U Gat3

A

G

CC C C TGA T AA T

G C

A

C

GR09 Gat1

A TC

AC

T

A

GG

U Gat1

A

G

C

T

A

CC T

U Gal4

T

G T C C

G

TG

Gabpa pwm primary

C

G

CCGGAAGT

A

G

A C

U Gabpa i

T

A

TT

CG

AACA

CG

CC T

A

C

C

C G

C

CAAT T

G

G

TT

AG

G

G

TAA CAA

GC

C

G

T

C

AATGA

A T TTA

C G

C

C

G

C

TC

G T AG C

U Foxl1 i

A

TT

CG

A AACA

ACA

G C

U Foxk1 ii

T

T

A

T

A

A

AG

C

T G

T

C

G

T TC

TA

CG

CA A T

T

CA

G

G

C G

A

GATAAG

A T

GG TT

U Gata3 i

Gata3 pwm primary

A T

T

C

A

C

GAT

T

U Gata5 i

Gata5 pwm primary

T

TA

G

GATAA

C

A

T

AT

Gata5 pwm secondary

Gata6 pwm secondary

GATAA T

G

A C

T

G TA C

TC

AT

G AG T

TA

CGCGA

GC GA

C

A

T

C

G A

Gcm1 pwm primary

C

A

A

G

G C

G

A

T

G

C T

G

G

T

G

T

C G

G

T

A

T

G

T

G

CCC

A

CA

A

TG

GT

AA

AC

C

49

GA

A

T

Gcm1 pwm secondary

CT

A

C

C

C

U Gcm1 ii

CA GC

CC

T

U Gcm1 i

T

G

G

G

T

Cell08 Gbx2 3110

G

T

CT T AG T A

GC

GT

C

C

U Gbx2

C

A

AA T A CTAATTAG CCCGCAT ATAGGGG CT

A AC

G

CG T T

Cell08 Gbx1 2883

G

CA TG

G C

U Gbx1

A

C

G

A T

U Gata6 ii

T

A

AT

C

Gata6 pwm primary

A

AC

TT CGA

G G AC

T G

U Gata6 i

G

CG

T

AGATATC

GCT C

U Gata5 ii

G

T

CA G

G A T

CA

AC

G C

G C G

Gata3 pwm secondary

ATC

TC

C

CT TA

U Gata3 ii

A

GGT

C

A

A

CC G

T

G

GA

AC

G

T

C

C

C

CT

T

TG

TGA TCA ACCCCC G

A

G

U Gcn4

GR09 Gcn4

C

T

C

T

A

C

G

C

T

A

Glis2 pwm primary

U Glis2 ii

Glis2 pwm secondary

G

A

TTA

TC

CG

U Glis2 i

T

C

A

G

T

A

TA T T A

AA

G

C

T

G T

C

AAA

T T TC

GGAGG

A

CTGA T ACCGT T

ACCCC

T

U Gln3

C

T GT A

GR09 Gln3

T

AA T

G

Gm397 pwm primary

U Gm397 ii

Gm397 pwm secondary

A

A

C

C

C

A TC T

G

G

T G

T

CG

C A

A

G

A TA

T

C

Gmeb1 pwm primary

CC

CG

C A

T

T

G

C A T GT

G

G

AA

T

C

C

CT CT

C G GC T AT

T GG

AA

C

TCGA T CGA A

G

G A

T A

Gmeb1 pwm secondary

TAATCC T AA T TA T

T

CT

G

C

AC A A

C T CA T

Cell08 Gsc 2327

G A T C

U Gsh2

GT T

G C

G

A

AA

U Gsc

T AC

TCCA

A TG

GG

G T

U Gmeb1 ii

T

C

G

G A A A T

C

TG A

GCACACAC

C

U Gmeb1 i

A

G

GTGCACA

C G

U Gm397 i

T TATC

G

T

A

G

C

G

AC TCC

T

C

G

A

C

T GG

A

G C

G C G

Cell08 Gsh2 3990

50

C

T

T

A

GC T

G

A

C T

TA

G A

GC

TT

CA

TG

A

G

A

G T

C

A

A T

G

U Gsm1

GR09 Gsm1

GR09 Gzf3

U Hal9

GR09 Hal9

T

C

AG

CCT

T

GA

A

T

AT

A

G

G A

CG A

T

A

G

TT

A

G

T

G

T

CCC

TT

U Hbp1 ii

Hbp1 pwm secondary

U Hdx

Cell08 Hdx 3845

ATTGT

A

GAA ACT

T

TG

A

Hic1 pwm primary

T

Hic1 pwm secondary

U Hlx1

Cell08 Hlx1 2350

T

A

C

A

C

A

CT

C

G

A

G

A

T AA T

C TG

A

G

C

T T A T TA A

GGAAG

CA

AC

A

AT

A

CT

T

G

G

T

TGCCC

G A

U Hic1 ii

T

G

G

C

G

TGCCA C

G

U Hic1 i

ATCA

GT

G

C T

CC

G

G G

G

G

G

AA

C

G G

CT

A

G

C

TT TA TG CGA A CA A GA

C

GC A

T

C

A

G

T A CC

G

T AATGAA C

C

CGGA

C

TC

A

Hbp1 pwm primary

A

GT

C

G

GT

C

U Hbp1 i

T

CC A

T

TC

G

CG

C

C G

G

G C

U Gzf3

C GATA G

A CT CGGA

TT

GT

GACT TCCC

G G

A

G

G

TAATTA

CC GC G T

C

A

U Hlxb9

Cell08 Hlxb9 3422

51

C

GTG

CGA

G

G

CT

T

A

C

T

U Hmbox1

Cell08 Hmbox1 2674

G

T

G

T

Cell08 Hmx1 3423

U Hmx2

Cell08 Hmx2 3424

A

A

A

A

GT C

G

T

CT

TC

Cell08 Hmx3 3490

T

A

T

GC

CT T T

A

GC

TA C TGT A

A

A

AA

C

C

A

TC

U Homez

Cell08 Homez 1063

GC

G

C

A

A

T

A

T

A

C C

G C

U Hoxa1

Cell08 Hoxa1 3425

T

GCTCA A

C T

C

C

U Hoxa10

Cell08 Hoxa10 2318

A T

U Hoxa11

Cell08 Hoxa11 2218

52

A

A

G

C

G

A TA AA A

T

A

TT

T T T AG CA

C

CG

G

C

A

A

T

A T

C A

G

TC

TC T A A

C

T

G

T

A T TC

GG

G GA

A

CG

A C

G

G

CCGT

T CC

A

T

T

AT

CC

G

G

G

A

A

A

G

G

T AGC

C GG AAT

A

G

T C

G

TC A T

A

GC

A

G

AA TC

T

G

T

G

AT

G

T

C

A

C

CG

T G

G

T

G

T

G

GT A

CGC

G

C

G

T

C

Hnf4a pwm secondary

T

G TCA AAAGTCCA ATCGTTT TA T A T

U Hnf4a ii

T

AAA

C GG G C

G C

A

GG G

A

Hnf4a pwm primary

T

C

A

CA A T A

GC

CA

U Hnf4a i

A

TG

G

T CTG

CAAT AA

G

GA CT

U Hmx3

GC

T

A GG

A

GC C C

A

G

CAATTAA

AG

A GC CT

U Hmx1

GT

CTA TA

A

G C

A T

C G

T

T

CTCGTAAA GCT A

U Hoxa13

Cell08 Hoxa13 3126

U Hoxa2

Cell08 Hoxa2 3079

U Hoxa3

Cell08 Hoxa3 2783

A GG AC

G T

A

C

TT

C

C

G

A

A

T

T

TA AT A

CC

A

AT

G

TA A T TA

CC

T

G

G

CA

A

G

T

C

Hoxa3 pwm primary

Hoxa3 pwm secondary

U Hoxa4

Cell08 Hoxa4 3426

T

A

C A

C

G

GCA G

T C A TC T

U Hoxa5

Cell08 Hoxa5 3415

U Hoxa6

Cell08 Hoxa6 1040

U Hoxa7 a

Cell08 Hoxa7 2668

C

T

T

T

C

A

G

A

G T

GC A

53

T

CT

G A

A

AT T

A

A

A

GA C T GA

T

A

G

C

C

C A

A

T

G

G

G

T

C

CTG

G

CC

TA A T A T

A

CT A A T

GA T

AG

CT C AC

G

Cell08 Hoxa7 3750

A

G C

G

G

T

AT

C

U Hoxa7 b

C

TT

GA CC

CC A AC T AGT

T

C

TA AT A

G

T TTG

CA

GG

TA T A

T CT

G C

AC

GGG

TA AT T A

C

G

G

TA AT T A

CC

A AGG

GCA T

C

A

C

C

T

CC

G

A

G

G

T

G

C

TC A G A CT T GG

T

T TC

A C T GC A

A

A

G

GGC

A T TA

C

AA A AG

GC

AC

CG

T AA T T A

CC

C

T

U Hoxa3 ii

C

G

G

T

C

A

T

A

C

CA CA G

G

C

C

GT T C T

GG

C G

C

T

U Hoxa3 i

T TA

G G

GT

A

C

T

A

C

T

T

A GGG

GTC C G A

C

A

G

T

C

A

A

GT

C

U Hoxa9

ATAA A

GCC T

G TA G

CAC T

A

AT

Cell08 Hoxa9 2622

A

T TA TA

C TG

T G A

Cell08 Hoxb13 3479

TT

T

G T

A

Cell08 Hoxb3 1720

T

C

GC

G

AA

A

C

T

Cell08 Hoxb4 2627

U Hoxb5

Cell08 Hoxb5 3122

U Hoxb6

Cell08 Hoxb6 3428

T

G

A

TT

G

C

C

T AA T A

CC

TA AT A T

C

A

A

G

A

CC T A

T

U Hoxb7

Cell08 Hoxb7 3953

A T

U Hoxb8

T

C

T

A

T

Cell08 Hoxb9 3413

AT

54

T

C

GGA T

TA

A

T

G

TG

GT

A

CCA G

AC

TA

TC

C

G TCA

T

G

G

T

TT

T

G

TCGTAAA

G

Cell08 Hoxc10 2779

A

G

CC

U Hoxc10

T

A T AAAA

C

A

A

A

C

AT A

C T

U Hoxb9

A

CG

A

G

G

C T

T

A

T

G

AG C TA

G

A

TC

GA G

C

A

T AA T

TC

Cell08 Hoxb8 3780

T

A

GCA T A

C

AG

TA TAC

A

G

T G C

T

C

C

T GC

C

CA

TA A T TA

C

T

G

GG

GC

T AT

G

GGT

CC

AT A TG G

CC

C

GGG

G

AG

C T

A

T

TAATTA

G

C

C

U Hoxb4

T

A

G

C

TC

TG A AA CCC

G

C

U Hoxb3

AT

GG

C CC T G

G

ATAAAA

CCTA U Hoxb13

TT

A

A

T

T

T AA GTCGTAAA CTC TAAA TA T A GTCG A

AA

GT A TG

A

A

G

CC

C C

U Hoxc11

Cell08 Hoxc11 3718

AG

GA

A

C

CC

C

T

T

U Hoxc12

Cell08 Hoxc12 3480

A T

T

T

T

C

A

U Hoxc13

Cell08 Hoxc13 3127

A

C A

A

T

T

T

AA

A

T

C

A

U Hoxc4

Cell08 Hoxc4 3491

U Hoxc5

Cell08 Hoxc5 2630

U Hoxc6

Cell08 Hoxc6 3954

T

C

T

A

GCCC

C A

T

T

T

TC T

G

T TA

CC

T

G

G

G

T A

A A

T

TC

A

T

A

GC T

GG

T

AA

GA T

GC

A

T

T

G C

U Hoxd10

Cell08 Hoxd10 2368

55

G

CA

A

AT

T

C

CT

G

GA T G

G

TA

C

CGA

A A AA

TCT

T

C

TG

A

A

T

T A TA

AC GTCC

C

C

T

TC

C

A

AT A A

A

Cell08 Hoxd1 3448

T

A

AG A

A

U Hoxd1

G

C

GTC T

GAC

Cell08 Hoxc9 2367

A

C

T

G

T

C

U Hoxc9

T

A

AGT

T AA T T A

CC

Cell08 Hoxc8 3429

T

G

G

U Hoxc8

T

C GA

TA

G C

T

G

AT

GGC

G

T

AA C

A

C

T

G

TAATTA

GCT

C

A

T

GC

T

T AA T A

AA T

GG C

A

T

GGC

G

A

C

C

T

A

CC

GGCGC T

T

G

A

A

G

G

A C

T

T

GC

TCA T

TC T AA GTCGTAAA ATAAAA TAATTA G GA A

C

C

U Hoxd11

Cell08 Hoxd11 3873

U Hoxd12

Cell08 Hoxd12 3481

A

C

A

T

A

T

CTA

C

T

T

G

A

U Hoxd13

Cell08 Hoxd13 2356

TG

T

GT

C

A

T

U Hoxd3

Cell08 Hoxd3 1742

TA

A CC T AG T

GT

AC

T

C GC

U Hoxd8

Cell08 Hoxd8 2644

U IRC900814 i

IRC900814 pwm primary

T

T A

GA

T

AA TCGT G

T

T

T

U Ipf1

Cell08 Ipf1 3815

T

A

U Irf3 i

T

C

C

GC

GGCT

AA T

T

A

CT

AG

C

GA

C

TT

G

G

C A

C

A

T

G

C

A

CT

A

C

G

Irf3 pwm primary

AA GG

AG ATT A T

C

G

CCT

T

T CG

T AA C GTG G T G

C C

G

C

U Irf3 ii

T

T TGC

AG

C

T

G

A T

G

AA A A C

G A

G

GT A

TAATTA

G C

T

C

T

A

CC

C

T

AA

G

C

C

GG A

T TA

G

A

C

T

C

C TC

T

C

IRC900814 pwm secondary

T

C

G

AG

GGC

U IRC900814 ii

GG

TCAACA

A A TCA T T AG G

G

C

T

G

GG

G

GT

A

T

T

ACGACAAA

C A

C

T

AT AAT

G C

G

C

T

CC

T AG CA A A GC

T

T

T

CT

G

T

A

A

C

A

C

Irf3 pwm secondary

56

AT

CGAAAC

A

C

TTT C

G

T

A

A

C

U Irf4 i

Irf4 pwm primary

T

A

TCTCG

T AC

AAA C

G

C

G G A

G

U Irf4 ii

T

C

Irf4 pwm secondary

TA T

A

C

U Irf5 i

Irf5 pwm primary

A

T

GAGA

AT

G

T

T

A

Irf6 pwm secondary

AA

T

T

AA T

A A

T

A

G

GG C

Cell08 Irx2 0900

T

A

A C A T GT A T

C

T

Cell08 Irx3 0920

T

U Irx3 b

Cell08 Irx3 2226

T

T

A

T

U Irx4

Cell08 Irx4 2242

57

G

T

C

T

T

A

T

A

T

A

A

C

T

AA

A T

G G

G

C

T

T

T

AA

C

G

G

C

ACATGT

T T TA AA

C

G C

G

C

G

G

G

A

T G

C

C

A

TA

T

A A

C

ACATGTA

C

A

A

A

G C

G

G

TA

TA

G

CA

CT

G

A

ACATGTA

G

U Irx3 a

AA

TT

A G

G

U Irx2

T

GT

C

AGG T

G C A

G

C

A

G

C

TC CG

C A T

CT A

A

C T

T

T

TC

U Irf6 ii

T

CGAAAC

A

G C

Irf6 pwm primary

GC

TAG C

G

Irf5 pwm secondary

U Irf6 i

T

G

A

C

U Irf5 ii

C

T

A

C

T

G C

T

GAAAC

AC

C A TGT

G

A

GT

C

A

A A TG T AG

C

T

G

AT

C

C

AAA

CT T

T GG

G

C

T

A

ACATGT T

G

U Irx5

Cell08 Irx5 2385

T

C

T

A

A

TA T

C

C

U Irx6

ACATGTA T

C

Cell08 Irx6 2623

A

T

T A AA

G

T

U Isgf3g i

A

C

C

G

Isgf3g pwm primary

A

AA

T

U Isgf3g ii

A

A

C

C

A

C

A

T

A G G

ATCTCACA C

A

GG T

Jundm2 pwm primary

A

A T

T

C

T

TA

C

TC

C

AC

C

A

G

AA T

C

T

TG

Klf7 pwm primary

A T

A

Klf7 pwm secondary

58

G

A

T

CC A T

G

G G

A T

C

A

T

G

C T

TA

CGCCC

TA

AC

G G

A

G

A

G

TG

GA CT

Jundm2 pwm secondary

C

G

C

A

TT

AATT G TGACGTCA TGA TCA CC C CCC C

C

A

G

C A

A

Cell08 Isl2 3430

Cell08 Isx 3445

TTA

C A

G G

G

T

U Klf7 ii

TT

A G CT

T

G

A

U Klf7 i

G

C

CT

U Jundm2 ii

A

A

C

A CC

C

U Jundm2 i

C T

G

T

C T TG

T

T

G

U Isx

G C

Isgf3g pwm secondary

C

U Isl2

A

T

TA

CA

C

C

T

A G C

GGT AG

G C

T

G

T

C

A T TGGT T

A A

G

G

G

T

A T

G

C

GAAAC A

AA C T

T

T

G

C

G C

GGT A T A C

A

G

G

G

AT

T A A AC

T

A T

T

T

T

AG C

C

C

A

A

A

G

U Lbx2

T

CC

AA T T A T

Cell08 Lbx2 3869

G

C

A

U Lef1 ii

Lef1 pwm secondary

A

A

T

U Lhx1

Cell08 Lhx1 2240

T

A

T

CT A

AA T T T

T GC

C

G

C

T

G

A AC

Cell08 Lhx3 3431

A

C T

U Lhx4

Cell08 Lhx4 1719

U Lhx5

Cell08 Lhx5 2279

A

C

T

A

G

G

C

GC T

C

T

CG AC A

TC

G

T

U Lhx6 a

Cell08 Lhx6 2272

59

C

CG

A

T

T

G

CCGT GA A GCA C T T

T

G

AC

A

AT

GGT A

CC

A

C G

G

G

T

T

T A TAG

CC

T

GT A

G

G

AT T TA A T TA A T AA T TA A

T

A

A

T

G

GA

CGCG

C

A

G

A

A

G G

AC TC

CG

U Lhx3

A

AT

C

T

T

G

TC

C

CC

TC

A GT T AC

G C

GGA TC

T A

G A A AC

Cell08 Lhx2 0953

T

T

C

G GT

U Lhx2

G

T

A

AA

C

G

G

T

G

C

AC

G T

C

A

C

A

CA

T

AC GCG

GA

G

C

T

G

A

CG T TA A T TA A T AA T T A TTAATTAA T

G

T

T G AGTG A

C

C

T

AAT

ATC T

GR09 Leu3

A

A

C

U Leu3

C

G

G

Lef1 pwm primary

T

CTTTGATC

C

C TT TG

U Lef1 i

A

GG

A

G C

C

A

T

T

A

C

G G

A

CGGT

T

GGC T

C

T

A

A

A T

GC G

C

TC

AA

A

A

C

GT

U Lhx6 b

AT

T A TAG

TCG

G

CG

G

Cell08 Lhx6 3432

AT

C

CGA T

CTA TAGC A

CG

G T

U Lhx8

Cell08 Lhx8 2247

U Lhx9

Cell08 Lhx9 3492

T AC A

CG G

C

A

T

U Lmx1a

T

A

TT

A AC G

U Lmx1b

Cell08 Lmx1b 3433

U Lys14

GR09 Lys14

C A T

A

T

GT

A

TT A TA

G

GA T GCA C

C

U Mafb i

Mafb pwm primary

C G

C

A

T

AA

GT T A T

T AG T

TA

C G

C

T A

A

A

G

G

A

A

TG

C

C

GT

A

CT

C G

AT

GG

G

C

G C

G

G

Cell08 Lmx1a 2238

T TG C

AA

C CA

G

T AC T

C

C

T

TGGT

G

C

AC

A

A

T

A

A

AT T AA T A T T AA T T AA AAATTCCG TGCTGAC

TT

T

A

T

C

T A TA

A TC

CGC CC G G

CG

T

G

C

G

G

C

T

T

T

G T

C

A

A

C

T

T

AG

T

A GA

T

T

A

C

G

A

G

T

C

GCA AAA

TT

A

TGT T T

GGC

T TA

CCT ACG CG G C

G T

U Mafb ii

Mafb pwm secondary

U Mafk i

Mafk pwm primary

C

AA A

A TT T G

G

T

G

G C

C

T

A

AATT T CA

U Mafk ii

Mafk pwm secondary

60

T

CA

C

G

A

G A

G CCC

T A GG

G

CA T

A

T

A

G

G

G

T TGCT

G

TC

T

C

C

AAAA TGCA

AG

G

CT T

G

TGCTGAC

A TC A

C G

C

C

G

AAA

TT

C

ACAAT G

A

A T

A C G

G C T

U Matalpha2

GR09 Matalpha2

U Max i

Max pwm primary

T

CACGTG

A

C CG G A T

TA

AG

T

T

T

T

Max pwm secondary

A T

ACGCGT

G A

AT

T

T

G

C

U Mbp1

GR09 Mbp1

TT

CC

T GA A

C A

A

C

G

U Mcm1

GT

CA

GG

T

G

T

T

A T AT

GGA AT

CC

T

AG A

C GCG

AA

G

CC G

AT

T

T

C

A CTGTCAA C

G G

C T

Cell08 Meis1 2335

A

T

T TA

G

A

C

C

AT TGTGGCG A GAACA GCG G TA T A

G T

A

C

C

Cell08 Meox1 2310

G

C

G

G

C

T

A

T

T

U Mga1

GR09 Mga1

C G

A

A

A

C

GTC A

TGT C

G

G

G

TA

C

A T

AAT

A T GGC TAT

GG

G

GR09 Mig1

61

C

TT

T

A

GR09 Met32

G

CA T G

C

A

G

U Met32

A T

CC C G

A

G C T

U Mig1

A

A

C

G

G

G T

T

C

T

AT

C

A

GCC

A G

AA T

C

A

G

AT

G

A

U Meox1

T TA

ACC

GR09 Mcm1

G

U Meis1

T

A

C

G

C

G

C

CT CG C

G

A

A

T

G

G

CT

CA G

G C

U Max ii

T

A

T AC

CC G

T

A

GTGA A

T

T

T

A

C G

C C GCA C C GCA A CTGTCA CTGTCA TA A T TA CATA AA T T A CGTGTGCA TA

CG

GT A

TA

T

A

U Mig2

GR09 Mig2

C

GR09 Mig3

A

GT A G A CCC

G

C

GA

U Mig3

T

T

T

AC

CT T

AC

A

G

AC GC

GT

A

T

GT

A

A

T

CC

G

T

U Mrg1

Cell08 Mrg1 2246

T

T

A

A

U Mrg2

Cell08 Mrg2 2302

U Msx1

Cell08 Msx1 3031

C

A

AC

G

G

G T

T

T

G

C

T

T AC

G

C

A

C

C

A CT

U Msx2

Cell08 Msx2 3449

C

T

T

A

GA

U Msx3

Cell08 Msx3 3206

U Mtf1 i

Mtf1 pwm primary

U Mtf1 ii

Mtf1 pwm secondary

T

AACCT

T

A

T

C

TA

T

A

A

A

A

G T

A

AC

C

T

A

G

T

T

T

AAA

A

G

T

A

G G G

T

C

G

C

CCGTT

AA

GG

G

T TA

62

G

TT C T AGC TCCA

CT

Myb pwm primary

C

C

A

A

G C

U Myb i

T

A

GC G G

CT

G

A

T

G

C

C

T

A C

C

G

A

A

G

GA

C

A

CC T T

A

C

GC T G T

GGG T

G

C

G

A

G

A

T

TA

A

A

TT

CT

C

G

G

GC

C

C

C

A

G

CG

T

T

GA A

G

C

A

G

C

T

GT A

G

A

CG T

AACTG C AACCGTT AACTG C CA TG

CC T T

A

U Myb ii

Myb pwm secondary

A

GG T

Mybl1 pwm primary

U Mybl1 ii

Mybl1 pwm secondary

TT

A

A

A T

CAG

C

T

AT

G

A

G

C

AA G C

T

C

T

U Myf6 i

Myf6 pwm primary

T

A

T

T C AC

CAG

AC T A

CG T

U Myf6 ii

G

CC A

C

GT CG

T

CC

T

T

C AC A A A A

GCA

GA T

A

T

TG

A

G

GR09 Ndt80

T

TG

GA

G

TA

T

T A TA TA

G

A

T A TA TA TG

C

ACT

C

TA

C

C

G

CTA

C G

G C

C

U Nhp6a

AA T AGGA T

A

Myf6 pwm secondary

C

U Ndt80

T

G

C

A

G

ACTGACGGG

G

T

C

G

GG

A

C

A

C

T

AT

CG

G

CC

A

T TG

A

A

T

U Mybl1 i

C

C

A

G

G

GR09 Nhp6a

T

AAT

TA

T

TAT ATA

AT A

TA

TA TAT A T AG

C T CC GC GGG CC C GC A

G

U Nhp6b

T

TG

T

C

G

Cell08 Nkx1-1 3856

AC T

Cell08 Nkx1-2 3214

63

T

C

GT

C

U Nkx1-2

C

A

G

AA T A

CCC

G

CA T AC A G A

G

U Nkx1-1

CG

G G T

C C C

GR09 Nhp6b

A

TA

T GGCG

G

AA T TA

A

C T

GGT

CT A G

A

G

C

T

A

C

T A

G

C

C

G

A T

A

CT G T

C

A

CACTTGA

C

A AG G

G

CT

U Nkx2-2

TT

A

Cell08 Nkx2-2 2823

TC

A

U Nkx2-3

Cell08 Nkx2-3 3435

U Nkx2-4

Cell08 Nkx2-4 3074

A

GC

G

CC A

G A

A

Cell08 Nkx2-5 3436

T

GT

T

A

T TT

A CA T

GA C A

U Nkx2-9

G

G

AA

T AC

A TT T

A

G

CG

C

TT

T

G

AA

C

C

C

GA

GC CG

C

T

U Nkx3-1

Cell08 Nkx3-1 2923

A

G A

T

U Nkx3-1 i

Nkx3-1 pwm primary

U Nkx3-1 ii

Nkx3-1 pwm secondary

T

A

TT

C

G

G

G

CTGA

C

G

TA

T

CTC

G

A

G C

ACTTA

CC

A

G C

G

C A ACC

G

A A T A T T AA

GG

A

C

AGTACTT A

G T

Cell08 Nkx2-9 3082

T

A AT

TGACC

GT

C

A

G

G

C

C

GCC A

GC

C

G

A

Cell08 Nkx2-6 3437

A

C

T TA

A

T

G

G

G

G T

G

C

G

CT

A

CA TC

A

G

G T

AC T A

TT

G

T

U Nkx2-6

A

A

CAG

C

G

T CCACT A A CCACT A

G C

U Nkx2-5

CG

C

T T

A

G

CC

GC

G

AG

G

A

GT

CT

AA TA T T

T TG

C TC

CA

A

A T

CTG C

GTAC

A

A

TT

GG

CC

TCG A

CT AA T TAG

TA

AC

AC

G

G G

G

C

U Nkx6-1

Cell08 Nkx6-1 2825

64

AT

AT T T A

CT T

T

A

GG

C T

TG

TC

C A

T

G A

G

A T

C

AT

U Nkx6-3

T AA T T A

TAC

Cell08 Nkx6-3 3446

T

GG

C

C

G

G

A

G

A

T

A

GGTCA GGGTCA AGGGTCC TA CCG GGGATTA GGGAT A GGATTA GGGATTAA T GGAT AA T AAA

GG

T

G

U Nr2f2 i

Nr2f2 pwm primary

G

T

TG

C

C C T

GA

G

G

T

T

U Nr2f2 ii

Nr2f2 pwm secondary

C A T

U Nrg1

GR09 Nrg1

A

AT

G

TA

A

A

T

C

G

T

U Oaf1

GR09 Oaf1

U Obox1

Cell08 Obox1 3970

G

T

T

T

G

GA A

A

G

A

T

T

AC

T

U Obox5 a

Cell08 Obox5 2284

A

A

TA TA A

A

A

A T

U Obox5 b

Cell08 Obox5 3963

T

65

G

G

AC

T

T

T

T TA A

GT

AG

GA T

A

T

G

G

T

AC TA

C

G

G

CC T

C

TGT G

C

C

G

G

A

TC

A

G

T

TA

GT

C

G

ATT

Cell08 Obox3 3439

T

C

A

GAA

C

U Obox3

G

G

T

T

C

Cell08 Obox2 3438

T

C

A

AA T

CC

U Obox2

T

C

G

AAA

G

CC T

A

C

C

G

G

C

G

TT

TT

T

C

AA

C

A

A

C

G

G

G

AA

A

GT

C

U Obox6

A

G

A

T

C

A T

NBT06 Oct-1

A

C

A

CT

T

C

TA

C

Cell08 Og2x 3719

G

T

TA

T

C

GA

C

Osr1 pwm primary

G

CTAC

G A

T

A

T

G

CTA

TAG

CTA

T

C

G

G A

T

C

G A

T

C C

G

Osr1 pwm secondary

C GTAGC

AA

G

C T

G

U Osr2 i

A T

G

CAGTAGC

A

C

U Osr1 ii

T

A

G

G

G

C

T A

TG

AA T T A

TC

G

T A

T

T TGCA A

C

CA

T

G

U Osr1 i

AA T

A

C

C

U Og2x

T

Cell08 Obox6 3440

T

U Oct-1

G

G

CG

G

GGAT A

C A

TC

T G

Osr2 pwm primary

G

C T AC

G

A

G

T

TA

TCGC AT C

C

T

A

T

A

A

T

G T AG

CTA T G A

CC

A

G

U Osr2 ii

Osr2 pwm secondary

A

T T A T TA A

GC

C

U Otp

Cell08 Otp 3496

G

A

C

T

U Otx1

AA

Cell08 Otx1 2325

C

A

Cell08 Otx2 3441

66

AT

TCT A T GA

C

G

G

GGATTAA A

C

U Otx2

T

A C

A

A

G

A

GA C

G T

G G

GGATTA

GG C

T

GC

C

T

GC

C

A

A

TT

G A

CATGCA GGTGCAC TA T A

TG

CA

U PF14

PNAS08 PF14 0633

G

T

G

A

AA GCG TG

C A

U PFF0200c

T

C

PNAS08 PFF0200c

T A

C

A T

G

Cell08 Pax4 3989

T

T

T AC

CGC G

GA

A T

U Pax6

C

A

T

AAT T

G

C

GR09 Pbf1

C

GA TT

TCATCG

C

G

T

G

C

A

CT

C A TG

T TG

CATCG

C

GR09 Pbf2

A

A TG

C

A

A

TGT A

ATCAA

C

CTCCAG T

T AGT

G

G

A

T

G

T

A

A

G A

C T

GR09 Phd1

67

GG A

T

AA

A

CG

A

G T

T

CA

A T

G

G

A

A

C

C

GC

A

T

TGCAT A

G

C

A

T

A G

G

C

G

C

T GC

A

T

GC T C

G

C

G

G

TTC GAA

AAT

G CC

T

TA

C T

C

G

TC G G

AA

C

GG

GT

U Phd1

C

G

G

T

A

GR09 Pdr1

TT

AA

C

TA

G

U Pdr1

A

TGGG

G C

TC

Cell08 Pbx1 3203

A

GT C

C

A

G

U Pbx1

G

AATTA

C

U Pbf2

C

C

TAG

T

AA

U Pbf1

T

A

T A

A

C TCG

Cell08 Pax7 3783

AC

G

Cell08 Pax6 3838

U Pax7

T

G

G

C

T

A

C

G

C

A

G

C

G

U Pax4

T

GA A

TTA

A

C GA T

T G

G

G

C

T

T A A

C

A

G

T A

A C

C

C

GG C

G

G

T

A T

AT T

U Pho2

T AA T TA

A T TA AT

C

G

G

A

CA

C

T

CACGTG

G

C

GR09 Pho4

C

TA A T TA

T AC

G

A

T

Cell08 Phox2a 3947

G

T

G

C GAC

G

U Phox2b

Cell08 Phox2b 3948

G

G

G

T

C T

A

A

GGATTA

G

A

GAA T

Cell08 Pitx1 2312

U Pitx2

Cell08 Pitx2 2274

T

A

T A

T

Cell08 Pitx3 3497

G AT A A CTGTCA CTGTCA C

GG T

G

A

C

T

C G

T AA

T

T

T

T

A

Cell08 Pknox1 2364

A

AC

T

A

A

G

TA

C

Cell08 Pknox2 3077

T

A

A

C

A

GC

C

G

G C A GT

G

AC

GG

C

GG C

A

A

C

CG

G

U Pknox2

TG

C

G

G

GCA

T

TG

C

C T CA

U Pknox1

C

TT

C

A

C

U Pitx3

A

GGATTAA

G A

A

T

A

A

U Plagl1 i

Plagl1 pwm primary

68

GC

T

A

T

TA

CC G G

AT

CA

TC

G

G G

GG CG CC

C

T

TGA

C

TC

U Pitx1

T

G

T

T AA T T AA

AAC

C T

T

GAC

C

C

G

T

TA

G

A

U Phox2a

C

G

T

G

TC

GR09 Pho2

CA

U Pho4

G

TG GA

C

G

C

A

T

C

AT T

A

G

GGGGTAC

G

G

C

AA TT

C

U Plagl1 ii

Plagl1 pwm secondary

U Pou1f1

Cell08 Pou1f1 3818

U Pou2f1

Cell08 Pou2f1 3081

T

AA

A

C

G

G

T

A

T

A

Cell08 Pou2f3 3986

U Pou3f1

Cell08 Pou3f1 3819

T

A

G

GT

A

TGAC C

C

TA

A

GTGGA T

C

T

T

CG

C

G

A

ATGCAAAT ATGCAAAT TT

C

C

TT

G

T

AT

T

G

U Pou2f3

T

T

G C

C G

C

C

Cell08 Pou2f2 3748

A

T AA T T A

CA

U Pou2f2

T

T

T T AA T T A

GC T A T C

A GA T

C

A

AA

GC

T

T

T

T

T

T TA A T TA A

A

G

T

A

G

A

G C

G

C

AA T

U Pou3f2

Cell08 Pou3f2 2824

A T

A

C

T

G

G

U Pou3f3

Cell08 Pou3f3 3235

U Pou3f4

Cell08 Pou3f4 3773

TA

A

C

ATT

G G

T TA

G A

Cell08 Pou4f3 2791

69

A

A

T

TT

A

A

G

T

A

A

A

T

G

G

C

C

T AA T T A

AT

GT

T

G A

AC

G C

C C

G

C

T

A

C

AT A T A

CT

U Pou4f3

A

A

C

A

TGCATA

AA T

GC C

C

T

AC C T

G G

T

T

C

A

CT

G

A

C

G

C

A

T

G G

T

T AA T T A

C

TC

GT

G C

T

GCG

A TC A A T TGA TC

C

T

T

A

GTC

C T

T TC A

G A

A

C A

TAAT AG TAATGAG T TA A T AA TA T A TA T A GT

A T

C

U Pou6f1 a

Cell08 Pou6f1 1731

A

T

A

A

U Pou6f1 b

Cell08 Pou6f1 3733

U Prop1

Cell08 Prop1 3949

T

C

G

TC

A

T

U Put3

GR09 Put3

A

T

TC

T

AG

T

C

A

A

G

NBT06 Rap1

U Rara i

Rara pwm primary

C

GCAA C

T

T

T

AG A

A

G

C

T

T

A

C

C

T

C

CT

C

TA

G

C

C

AG

A

TA

G

T

C T

GA

T

G

GGGTCA

CG TC

T

A

C

A

TG

CC

GG

TTA A

T

C

G

TA

70

G

T

G TCA

A

Rara pwm secondary

T

A A AG T

GG

T

G

GCT

CC C GT T G

AAA G

G

U Rara ii

CCG

A

GG

A

G

C

C

A

CC ACACCC

TA CG T

T A AC T TA

G

U Rap1 b

C

GGTGTAC

G GR09 Rap1

G

TC GG

A T

GG CA

T

T

C

AC

G

GA

A

T

U Rap1 a

A

GC

G A

C

T

G

T

T

G

C

C

A G

C

T

C A

A

T GC

G

C

C

A

G

Cell08 Prrx2 3072

T

T

G

A

C

G

U Prrx2

C

A

T

AC

A GT

Cell08 Prrx1 3442

A

G

GT

U Prrx1

T

G

A CC

TT

G

C

C

T

A

T

T

C

T

G A CC

T

T

TT

A

G

T

G

GT

C

AT GCGGA A

TG A CC T

C

A

G

G

AA

G C

U Rax

Cell08 Rax 3443

C T

T AG

C

TC

GA T

A

T

G

T

C

G

U Rdr1

GR09 Rdr1

U Rds1

GR09 Rds1

G

TA

A

TC

A

C

G

C

G

G C

G

C

C

T

C GC G A CCGA C TAGCAAC

TG CA

G

C

A

C

G

CG

G

G

G AT

A

C

C

C

G

C

A

C

A

A T

GGT T TC

T

C

C G C

T

G

G

C

G

U Rds2

GR09 Rds2

U Rfx3 i

Rfx3 pwm primary

T

C A

CG C

G

A

T

CT

C G

G

T

Rfx3 pwm secondary

A

C TGGATT C GA A

T CC

A

G

C

T

T

A

A GT

C

G

G

A

G

G

G A AG

C

A TG

C T

Rfx4 pwm primary

TG

C TAGCAAC

C

U Rfx4 i

A

C

A

A

T

T

T

T

TG

GC A

U Rfx3 ii

G

G

G

C

C

T

A

A T

C T GTTAC A

G A

A

G C T

G

A

A

U Rfx4 ii

Rfx4 pwm secondary

U Rfxdc2 i

Rfxdc2 pwm primary

G

C TAGCA G

CG

A

T

C

G

A

Rfxdc2 pwm secondary

71

T

AC

G

A

T

A

G AC T

C

C

G

C

G

TGA ATGT

C

TT A

U Rfxdc2 ii

T

G

C

A

T

G

C

T

G

A

A

CGGA AA AA

C T

A

T

U Rgt1

GR09 Rgt1

T

G

T

T

T

Cell08 Rhox11 2205

A

T

A

C

T

G

A CC A

C

A T

C

T

GR09 Rpn4

T GC A

A

A

TA

T

AC

GT

A C

T

T

A

GR09 Rtg3

T

A T

C

C

T

C

G

C

T

A

CACGTG A

G

G

TA

G

AT

C

T

T

C

A

A

T A

G

G T

A

C

C

TCG

A TGG A A

C

T G

C

CA

C

T

T

G

TC

A

T

T

TA

CG

C

G

T

A

C

TGACCCC

CA

U Rxra i

C

G C

A

CGG

CT

G

AT

G C

T

C

GA

G C

A

GGA

A

CGCG

CG

GG GC TA

CC

G

T

T A AG

G

U Rtg3

G

TA

A

CGCGCG

CG GR09 Rsc30

T

T

A

GR09 Rsc3

U Rsc30

A

A GGG

G

C

G

T

T

T

T

C

C G

G

GC A

C

T

CA

U Rsc3

AT

GC

GT

CGCCACCC

C

A

C

C

G T

T

CCGG

C

C

A

TT

TT G

AA T AA

G G C

U Rpn4

C

G

C

T AA T TA

A

A

TT

C

G

T

G

Cell08 Rhox6 4251

GR09 Rph1

T

A

AC

G

U Rph1

T

G

T

AA G

U Rhox6

T

A

A

GCTGTAA

C

U Rhox11 b

A

AC

A

G

C

G

Cell08 Rhox11 1765

TT

TC

GCTGT A

AA

A

U Rhox11 a

G

C

A G C

T

CT

G

G

C

C

G

Rxra pwm primary

72

A

TT

G

GA

C

T

U Rxra ii

C

A

TA

GA G

A

A

G

GR09 Sfl1

A

A

G T

A

A

TG

C

G

A

T

T

GCT G

C

G

A

AG A

G

GR09 Sfp1

T

Sfpi1 pwm primary

T AA T T A

G T

GGAA T G

A

T

C

T

C

A

A

G

GGAA

TC AA TC

C A

C

G CC

C

C

C

C

GT T

A

G

A

G T

C G

Sfpi1 pwm secondary

T

TAATTA

A

C

C

Cell08 Shox2 2641

T

T

C T

G

G

G C TA

A

C

C

G

T

A

T

G

T

A

TCCGGA

C C T A G

C

GR09 Sip4

U Six1

Cell08 Six1 0935

U Six2

Cell08 Six2 2307

C

A

T

T

A

A TGA T GA

C

A

C

G

A

T

TA

TA

T

C

Cell08 Six3 1732

73

T

GG

T

C

C

A

GC T

C GG

T C

ATCA GGGTATCA GGGTATCA

GGT

G

G A AC

C

C

T

A

GG

A

GG

A

GG

GC AC T

G A

U Six3

T

ACCA TC

CA G C

C

T T AG

U Sip4

A T

C

G

C

AA A T T T T

C T GG

AAGA

U Shox2

A T

G

C

A

GGCCA

U Sfpi1 ii

T

GT C T A T TC CT

C

TG

C

T

U Sfpi1 i

G C

A

AA A

GCC T G

U Sfp1

C

CA

Rxra pwm secondary

C

U Sfl1

C

C

T

C

C

G

T

AG T

G G A T TATGTA

C

AGA

GA T

GT T

C A

T

T A C

T

G A

C

C

GT

C T

A

G

C

G

C

A

T

T

A

T

A

C G

T A

G C

T

G A C

T

G C A

A T

TGACACC T

C C

T

C

A

G

A

G

U Six4

Cell08 Six4 2860

A T

U Six6

Cell08 Six6 2267

U Six6 i

Six6 pwm primary

T

GGTA CA

G

T

A

A

CC

GG

T

A

A

T

T TG C

C

CA

A

TATCA

C

TG

U Six6 ii

CA

T

G

U Skn7

GR09 Skn7

U Smad3 i

Smad3 pwm primary

G

C

C

A

T

C

C

GCC

TG

T A GC

GA T

G

C

C

T

CC A

C

T

G

G

C

T

Six6 pwm secondary

G A

TA

CG

G C CAC

C

T

T

AT A T

G

G

C A

G

GG

TA

T

A

A

TT

CC

A

GGG

C

T

T

A

A

A

TC

A

A T GC T T A C GT G

AG

A

C

GT

G

G

C

A

AGAC

CC

A

T TC

G

G

C

G

A

T

CGCC

CC

C TG T

A

C

C

GAGATAT

A

U Smad3 ii

Smad3 pwm secondary

A CC

T

T T AC

GG C

U Smp1

A

A

C

G

T

G

T

T

AT

A TAAA

CC T AGG

T AT A T

G

G

G C

G

C

AT

C

G

A

A T

G T

T

C

C

C

G

ACAAAG

G

T AAA

AG

TCT

Sox11 pwm primary

TT

G

C

74

T

AAT

A

G C

C

T

Sox11 pwm secondary

G

C

C

AA GT

G

U Sox11 ii

T

G

GR09 Smp1

G

U Sox11 i

G

T

AT

G

C A

G

TA T

C A

C

G

U Sox12 i

Sox12 pwm primary

ATTGTT T

A

T

A

A CCC

A

AA

T

C

A

G

G

A AAA

G

G

AG C

A

G

C GCCT TGTC A AG GC TT G

T

C

U Sox12 ii

C C C TG T T

G

T

Sox12 pwm secondary

A T

A AAT

AC

GG

T GA

T

C

C

U Sox13 i

A

T

TT

GA

Sox13 pwm primary

A

TA

T

T T TA

A AC C

G

CG

A

G

C

G

C

GGGTGGG

T T TA ACT T

C A

A

U Sox13 ii

T

C

AC

C

G

C

Sox13 pwm secondary

A

ATTATAAT AGCTT

T TA

C

CG

G

U Sox14 i

Sox14 pwm primary

U Sox14 ii

Sox14 pwm secondary

ACAAT

AC

G G A

A

GG

C C

T

G

T

G

A

T

C

ACAAT

A

TT

AC

Sox15 pwm primary

AA

T

T

A

G

A

T ACGGC

A

G

G

T T AC T T T A A G G

C C T

A

G

U Sox15 ii

Sox15 pwm secondary

U Sox17 i

Sox17 pwm primary

C C

A

TT

C

G

AAC

AG

GT TC

T

A

C

TG ATT

GCC C

T

A

GG TT

A

G

C

CA TG

U Sox15 i

A

A TCT A A

C

C G

AA T

GT

T

T

A

T A A TA

GC G G

A

C

C

C

G

ATTCA

C

T

AG T TCAG C C

G

G G

C A T

G A

U Sox17 ii

Sox17 pwm secondary

75

C G

TA

A

ATTGTT

C TT

U Sox18 i

Sox18 pwm primary

U Sox18 ii

Sox18 pwm secondary

U Sox1 i

Sox1 pwm primary

T

AAA

A

C

A AC

C

C

G

G

T

AT

TGA

G

TCA

CGTG CACG TA T GTG

A

AC

C

C G

ATT A T C

A

A

T TAAGT TA

C

CC

C C G C A G

T A A

G

G G

C

T

TTGTT

A

CTGCC CGGC A ACTA

U Sox1 ii

Sox1 pwm secondary

CG

A

C

G

G G G C

T

A

T

ATTATAAT TA

T AGCT TAA TA

G

C

C G G

C

G

U Sox21 i

Sox21 pwm primary

U Sox21 ii

Sox21 pwm secondary

G C

C

ATTGT

A

A

C A C

G

G T

C

T

U Sox30 i

Sox30 pwm primary

U Sox30 ii

Sox30 pwm secondary

G

ACAAT

A

G

T

C

A

T

C

TA

AT AT AT

CACCGGTG GT ACC

TG

A

GC TG

CA

G

G

C

ACAAAG

A

G

TAA

G

C

Sox4 pwm primary

C

T

ATTGT

A

G

G

Sox4 pwm secondary

76

G

T

AAC

C A

TT

U Sox4 ii

C A

G

G

T

G

U Sox4 i

G

G

A TA

T

TA

CC

TC

G CAC GT

T TC

CG

C

A

C

G

A AAT

AC

GG A T

T C

C

U Sox5 i

Sox5 pwm primary

C

T

A

TAATT

A

TAGCTAC

C

G

GT AGCGA C

U Sox5 ii

Sox5 pwm secondary

U Sox7 i

Sox7 pwm primary

A

ACA T

AG

G

T TT

C

GT

A

G A T GC

U Sox7 ii

T

C

G

C

GC G

T

AA T

A

T

A

AA

C

T

T

G

G

G

TG

CC G

C

C A

ATTGT

T

T

U Sox8 i

Sox8 pwm primary

U Sox8 ii

Sox8 pwm secondary

AA

ATTCA T

G

A

C

T

C

A

T

T

G

C

A

G CG

C T

A

AA

C

AA

G

T

G

C

C

G

CGCCCCC

77

T

GG G

ATC A

G

A C TG C

C

Sp4 pwm secondary

A T

A G

A TT

G G

U Sp4 ii

T

A

Sp4 pwm primary

C

T

G

G

C C A T

TA

CC

G

TG C AC

C

G

G

T

T

G

TTT

A ACA A

Sp100 pwm secondary

G

C T

C

Sp100 pwm primary

T

T

T

G

CT

C

C

G

A

C

A

C

A

CG A

T

AAA

G G

A

G

TG

TGGA

A

C G C G

T

C

CTCAC

C

U Sp4 i

C

A

G

A

A G

U Sp100 ii

T

C

Sox7 pwm secondary

C

U Sp100 i

A T

T

T

C

CC G

T

A

A

T

A T T GT

CTCT A

TA

CG A

GA

AT

T

G

A

G

G

C

T

AG

T

A

C T

GA

T

T

G

GT

T

C

T TA

C C

C

C

G

A

A

GCT T

G

T

TCCGG

G

TT

C

U Spdef i

Spdef pwm primary

A

A

C

T

T

G

U Spdef ii

Spdef pwm secondary

U Spt15

GR09 Spt15

A T

G

CC

C

AA

A T A T A TA

T

CGG

T

GT T TA

C

A TG

G

AT

A

C

G

TCCTA

A TG

G

C

G C

C

C

GTG C A

T

A

T

A

GR09 Srd1

A

G

C

A

T

C G

A T

TA AA T A T A

TT

G

U Srf ii

Srf pwm secondary

U Sry i

Sry pwm primary

U Sry ii

Sry pwm secondary

G C

C

G

AAA

T

C

A

CC TATA GG Srf pwm primary

G

C

GATCTAC A

U Srf i

T

TA

G

A

G

U Srd1

T

AA

TC

A

AG

C

A

C

T

T

A

G

C

AAAA

A

AGCCCTCCC

GT

GGTCT T T

T

G G G

CT T

C

ATTATAAT T

AAG CT T

A

ACAAT

A

GC

T G AGTG

A

C

G C

G

C

C

G AT T

C

G T

U Stb3

C

A

A

G

A

AAT

AAT

C

T TCT T

C

T

A

GGG

GR09 Stb3

T T T TCA

AA

G

CC

C

G

A

G

GG

CT

TG

AA

G

C

G

GC CC CTA

GA A

C T

U Stp2

GR09 Stp2

78

T

T

A

GG

T

G T

C

AC G

GTGAC

C

A

A

A

GT

T

A

U Stp4

T

C

T

G

A

A

C

G CC

T

A

GGCTG A

CC

GT

A

G

A

T

G

GR09 Stp4

AAATTAATTT T

T AT AA

A

AACTCCGA

GT T

GR09 Sut2

T

A

GR09 Tbf1

T

A

G

TA

T A TA T

A

C

CT C T

T

C G

Tbp pwm primary

GT

A

G

TT G

A AAAC CT

G

C

G

C

A T

T C T GT

G

C

G

A

G

C

C

A

A

A

G

Tbp pwm secondary

CG ATC G

C

G

C A T

GR09 Tbs1

TA

U Tcf1 i

Tcf1 pwm primary

T

T

G A

C A

T

T

T

T

A

A

CG A

C

TG T

G

TA A

C

G

GG

A

T

AG

C

C

T

GATTA

G

G

T

T

G

T

C A

C G

C

C

C

A

CA

C

A

G

G

G

AA A

GT C

A

C

T

A

G

A

G

T

G

79

A

A

C

Tcf1 pwm secondary

C

C

A

GT C

T

G

G

T A AC T T A AC

GT

T

Cell08 Tcf1 2666

A

G C

G

U Tcf1

T

CG

T

C

U Tcf1 ii

G

G

C

A TA TA TA TA

CGGA T

U Tbs1

T

C

T

A

C

U Tbp ii

G

AACCCTAA

G

A

G

T

C

C

U Tbp i

GA

G

T

CT

U Tbf1

T A

C

GR09 Sum1

C

U Sut2

T

G

G

U Sum1

A

CC

GGGT

T

T ACG

C A

C

C

TAAC

GT TA A T

A

CC

U Tcf2

Cell08 Tcf2 0913

T

T

Tcf3 pwm primary

U Tcf3 ii

Tcf3 pwm secondary

T

C

G

G

TA

A

AGG

TCAC C

G

TG

T

T

T

AAA

AC

G

T

C

C

T

T

TG T

G

T AAA

AC G

G AC T

G C

Tcf7 pwm secondary

T

T CTG C C G C

C

U Tcf7 ii

T

T

G

A

Tcf7 pwm primary

C

C

T AAA

AAT

T

G C

C

C

G

AA AA T

G

U Tcf7 i

G

CT A

AC G

G AC

G

U Tcf3 i

T

T

A

G

AT

T

G

T

A AGG A

CCC

G

T

T

T

TATTA AAA

AT

AGG

TCC

G CCAC

CT T

CG G CT

CT TGAT

TC

TT CG

C

G

G

A

A

U Tcf7l2 i

T

G G

Tcf7l2 pwm primary

A T

CT

C

AT

T

AGA

TCAAT

Tcf7l2 pwm secondary

A CG

T

T

U Tcfap2a ii

Tcfap2a pwm secondary

C

G

CC G

T AA

T

C G

A

C

T

G

G

AT

TA

T

G

TC

G

CC GG

G CT AGA CA

C

C

AA

80

AT

C C

A

T

A

G

GC

G

A

Tcfap2b pwm primary

C

C

T

GT T

T

G

TGA

TCCGA

C

U Tcfap2b i

T

CC GG

G

Tcfap2a pwm primary

T

A

C

A

C U Tcfap2a i

T

G AGTG A

C

C

U Tcf7l2 ii

C

A

GT

T

CG GC

G

C

A

G

C

T

G T

C

TT

GCC

T

AC

U Tcfap2b ii

GGC

C GAA

AA GAC AG T

C

TC T

TG

T

T

TA

A

AG TT

G

C G

Tcfap2b pwm secondary

C GG

G

A T

T

A

G

U Tcfap2c i

Tcfap2c pwm primary

U Tcfap2c ii

Tcfap2c pwm secondary

C

TGA

C

C T

C

G

TCCGA GTC

A

GT

AC T

G

GCC GGC

A

CAAG

AA TT

C

A

G

Tcfap2e pwm primary

U Tcfap2e ii

Tcfap2e pwm secondary

U Tcfe2a i

Tcfe2a pwm primary

G

A TTT T

C

AGG

C

CC GG

G

C

TGA

CCG

C

U Tcfap2e i

T T TA

CT AG

G

G

T

C

CA

T

A

G

GT AC

G

T

AAAAAA

GT TGGTC AGGCCCT

A

G

C

T

T

CT TG G C

CAGGTG CAGATG T CG A T A A C ACATTC TGACAG A

G

C

CC

C

T

G

C

A

A

GT

TG

A

CC

G

T

A

C

U Tcfe2a ii

Tcfe2a pwm secondary

T GT

A

C

C

A

G

G

A

G

C

C

C TG C

U Tea1

GR09 Tea1

T

AT

GA

U Tec1

GR09 Tec1

A

A

A

A T T

TG

A

G

G

C

U Tgif1

Cell08 Tgif1 2342

81

C

C

T

T AT

C

G

C T

C

G

G

TT

CC

G

C

G

GT

C

G

A TC

G

A

C T

TC

A

A

A

A

A T

CT

T

GG

A

G

T

C

A

A

T

T

G

T

Cell08 Tgif2 3451

A T

C

U Titf1

Cell08 Titf1 1722

U Tlx2

Cell08 Tlx2 3498

T A

A

TT

ATA T

CC

G

A

U Ume6

GR09 Ume6

U Uncx4

Cell08 Uncx4

GCA

G

C

G

T

A

CAT T

C

C

C

T

CG

T

U Vax2

Cell08 Vax2 3500

C

A AC

GGC

AC

GT

T A

T

C

A

TA A T TA

CC

T

U Vsx1

Cell08 Vsx1 1728

82

T

T

T

C

G

A

C

T

G

G

G

T

C

A

C

G

A

T

C

GT

A

C

G

A T

C

GT T CG T

G

CA

A

A

A

GG

T T GCC

A

C

C

G A

G

G

TA T A

G CC

AA

GC AC

G

T

A

C

C

C

GG

C G

CG

C

T

T

TA AT A

CC

TG

T

ATG A

G

C

TC

GT

A

C

G

C

G

G

A

AGG

AC

Cell08 Vax1 3499

T

C

T

A

T

T

G

U Vax1

G

GC A

C

TGA

C

GR09 Usv1

TT

AA

G C

G

C

T A

AC

G

U Usv1

T

A

C

G

G

T

GGC C

GC G A T T A T TA A CCCCTGAA A

C

T

C

C

C GT

C

T

T

T

A

T A TG

CA G CT

GA

G

GR09 Tye7

A

TT

AA

G G

CG

T

A

G

G

TTA

A

C

GTCC

AA ATA

G

C

C

C

A

CA TC C

C

G G

CACGTGA

T

G

C

U Tye7

C

A

C

G

A

C

TT GA C T

C T GA

A

CT T A

CCA

GGG C

A

G

T

TA

C

GT

C

C

G

U Tgif2

A CTGTCAA G

A

AT C

A

GT A

G

G

T

T

C A

G

C G T

T

TCGA

CT

C

C

G

A

U Xbp1

GR09 Xbp1

GA

A

T

G

G

T A

AC

GG

G

C

A

C A

T

TTACGTAA

G

C A

TA

G C

C

G

U Yap1

GR09 Yap1

G

GCA

TC T T AG T C

C

A

A

A

C

G

U Yap6

T

G

C

A

T

GT

T

C

AT

G

GR09 Ybr239c

G

T A T CA

T

G C

A

A

U Ydr520c

GR09 Ydr520c

U Yer130c

GR09 Yer130c

G

AAT

G

T GT

C

A

C

GR09 Ygr067c

A

U Yll054c

GR09 Yll054c

A

G

A

G

G

G A

T

T

G

83

T

G T

C

T

A

A T

C

CCC C

G

C

A GA

AC

CG

C

C

GT

A

GR09 Yml081w

AA

G

A

T

T

C

CGGA

C

A

T

T

C C

A

T

T C

A

G

A

C

C C C

CG

TC

CT

G

GA

C

GGT A T

T

GR09 Ykl222c

TA

GT

GT CT G

A

A

GC

U Ykl222c

T

GCC

C GA

T

A

A

G

C

C

A

AG

C

A

TA

C T

C

C CA T AG T

TGG

C

T AG

A

GGGT GC A

A

CGGAGATA ATAGGGG GGG A CGGA AT

T

C

U Yml081w

A

CCG

CGGT

TTCCGGAAC

A G

U Ygr067c

C TA

G

A

GR09 Yap6

C

U Ybr239c

T

A

TTA GT A

G C

C

A

C

G

AT

A T

T

C

TCCGA TA T A GTAAATC C GT A A A TC

ATC TA

G

A T

G

U Ynr063w

GR09 Ynr063w

T

AC T

U Yox1

A

C

G C

T

T

G

GR09 Yox1

TA

A

GR09 Ypr013c

T

T

T

A

GR09 Ypr015c

A

GR09 Yrm1

A

A

A

C

C

G

G

C A G T

T

C A

C

T

T

T

A

CC T

G

CA T

C

G

A

G T

T

CT

A

G

A T

C

GG

C

C

TC

A

G

T

GT

C G T

A

G

CG T A AC

GA

G

CC A

T

CGGA ATAA A

G A

G

GT

CT

GR09 Yrr1

C

T

C G

T

C

A

G

GTTCTAGA

G T

U Zbtb12 i

Zbtb12 pwm primary

U Zbtb12 ii

Zbtb12 pwm secondary

CA

T

TCGA

T

C

A

C

A

TA

C T

CA

T

C

T

A

G G

G

T

A

C

GG

T

84

TC T

G

CACTGCA A

Zbtb3 pwm primary

A

TAGAAC

GGAGG

T

C

C

CA A A

C

U Zbtb3 i

CG

G

CA A

CG A A A T A C

A

A A T CG

C

T TA TA A GA T GG G C

G

T CCG

AAT T

G

T

A

T

C

C

CAT

G

G T

G

G

G

G T

A

U Yrr1

A

TC

T

T

U Yrm1

T

GA

C

C

A A

G

GR09 Ypr196w

C

T A

G

C

C

GG

C

U Ypr196w

A

T GA

G

G

TC

U Ypr015c

C

C

A

GA C

U Ypr013c

T

G

C

GC

G

T

T

C

ACTG

A

G

T AGAA T TCG CA A A T TG

C

C

A

U Zbtb3 ii

CCT T

CT TG

G C

G

C TGA A

G T

G

C

Zbtb3 pwm secondary

CCCCC GACCACC

C

AA

G A

A

TA T TA A TCCT

G

C

T

G

G

U Zbtb7b i

Zbtb7b pwm primary

U Zbtb7b ii

Zbtb7b pwm secondary

A

AC

C

GTC

G T

A

G

AC

C

TT T T TC A

T

G

G

Zfp105 pwm primary

T

G

T

C

Zfp105 pwm secondary

U Zfp128 i

Zfp128 pwm primary

A

A

CC G G

T

C CGGGG

A

G

U Zfp105 ii

G

TC A T A A

GC

CT

AGT

TC GTC

G C

U Zfp105 i

A

T

T

A

A

T

T

A

C

G

C

TAC

GGCG

T TC

G

C G

C

T

T

GT A A

T

A AA AA

CC A

T

T

CC GT A

G A

TA T A T A TA

G T

AGAGT

A

U Zfp128 ii

T

AGAG

C G

C

G

Zfp128 pwm secondary

A T

GCGCGC

G AC

G

U Zfp161 i

Zfp161 pwm primary

U Zfp161 ii

Zfp161 pwm secondary

A

C

T

A

AT

TTT

TA

CG A

A

A

G

T

C

G

GT T

G

T

C

G

AA TT

CG

ATGTAC AAT

AG C C

G

85

TA T

G

T

A

T

A A ACA

Zfp187 pwm primary

A

CGCGCA GC

GC C

T

C

G

G

TT A

C

U Zfp187 i

A

A

C

C

CT

CT

CTC TC G G

C

T

C

GT

A

G

T

G A

A

CCT GTCC

G C AA T

C

U Zfp187 ii

T

Zfp187 pwm secondary

TT

A

A

C

C

Zfp281 pwm primary

U Zfp281 ii

Zfp281 pwm secondary

A

A

GA

G T

A

A T

A

A

T

T

A

A

T

AT

TA A

CT G C

G

C

G

TGGGATG

G

A

G

A

T

A

C

G C

T

T

T

G

G

Zfp410 pwm primary

C

A

CCC C

T A

C

AC

U Zfp410 i

A

A A TG T

G

T AG

T

C

T

CC

A

G A TT T

GT

A

CCCCC

AT

GA

T

A

C

T

G TA TAT A

GT

G

U Zfp281 i

CA

G

TT

C

C

CC

C AA T

CT A

G

CCGCCCC TT

TA

G AG T

AG G T C G G A ATA

U Zfp410 ii

Zfp410 pwm secondary

AGTGCTC

A

C

T AG

U Zfp691 i

Zfp691 pwm primary

A

A

A

G

T

AT

G C

C

U Zfp691 ii

GACTCC

T

T TGG

G

T

C

G

G

Zfp691 pwm secondary

TA

CCCCC

C

CA

C

A

G AG

Zfp740 pwm primary

U Zfp740 ii

Zfp740 pwm secondary

AA

TT

T

T AC A A C

CCC

CC

A TT

GGG T

A

AC

A

A

TT

U Zfp740 i

AC

G G

A

C

C C

G

T

A

T

T

CG

T

A

G

A G GTC A CGG T AT TT

T

G

G

G A

CCCC GGGGG CG

C

G

A A AGGCCT T T TC TT

T

G

G

G

U Zic1 i

Zic1 pwm primary

86

T

A

ATA T

C A

A

C

AC

T

AA

C

C CAGCAGG

A T

A

G

Zic1 pwm secondary

TC

T

TT

U Zic2 i

T

G

GG

T CA AA T A

A

AT

C

C

A

C

A

C

Zic2 pwm primary

C CAGCAGG

A T

A

G

T

G

U Zic2 ii

T

T TC AGGTC T T A

G

GT

G G

TGT

CCC GGGG

CC

CAA A

TA

A

G

G

U Zic1 ii

Zic2 pwm secondary

TA

C

T

TG T

T

CCCCCGGGGGG

G

A A AGGTCT T T TC T A A AA T TT G

GT

U Zic3 i

A

A

A

TA

G

A

G T

G

T

Zic3 pwm secondary

A

TA

C

C

G

A GA

A

T

T

C

C T

G A

T

A

CT

T

CA

A

C

TGTG

C

A

G

G

GCACA AA C

AA

TGAA

C

C

Zscan4 pwm secondary

T

GTGCACA

T

T

A TG

T

A

G C

U Zscan4 ii

T

CCCC

ACA

G

NBT06 Zif268

Zscan4 pwm primary

TGT

CG CA G A

CT T

T

G C

U Zscan4 i

CA

C CAGCA G

A

U Zif268

C

T AC

Zic3 pwm primary

G

U Zic3 ii

C

T

T

G

A

T TC T C

A

G

G

References [1] Chen, X., Xu, H., Yuan, P., Fang, F., Huss, M., Vega, V. B., Wong, E., Orlov, Y. L., Zhang, W., Jiang, J., Loh, Y.-H., Yeo, H. C., Yeo, Z. X., Narang, V., Govindarajan, K. R., Leong, B., Shahab, A., Ruan, Y., Bourque, G., Sung, W.-K., Clarke, N. D., Wei, C.-L., and Ng, H.-H. Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell 133(6), 1106–1117 (2008).

87

[2] Cheng, Y., Wu, W., Kumar, S. A., Yu, D., Deng, W., Tripic, T., King, D. C., Chen, K.-B., Zhang, Y., Drautz, D., Giardine, B., Schuster, S. C., Miller, W., Chiaromonte, F., Zhang, Y., Blobel, G. A., Weiss, M. J., and Hardison, R. C. Erythroid GATA1 function revealed by genome-wide analysis of transcription factor occupancy, histone modifications, and mRNA expression. Genome Res 19(12), 2172–2184 (2009). [3] ENCODE Project Consortium. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447(7146), 799–816 (2007). [4] Johnson, R., Teh, C. H.-L., Kunarso, G., Wong, K. Y., Srinivasan, G., Cooper, M. L., Volta, M., Chan, S. S.-L., Lipovich, L., Pollard, S. M., Karuturi, R. K. M., Wei, C.-l., Buckley, N. J., and Stanton, L. W. REST Regulates Distinct Transcriptional Networks in Embryonic and Neural Stem Cells. Plos Biology 6(10), 2205–2219 (2008). [5] Robertson, A. G., Bilenky, M., Tam, A., Zhao, Y., Zeng, T., Thiessen, N., Cezard, T., Fejes, A. P., Wederell, E. D., Cullum, R., Euskirchen, G., Krzywinski, M., Birol, I., Snyder, M., Hoodless, P. A., Hirst, M., Marra, M. A., and Jones, S. J. M. Genome-wide relationship between histone h3 lysine 4 mono- and tri-methylation and transcription factor binding. Genome Res 18(12), 1906–1917 (2008). [6] Rozowsky, J., Euskirchen, G., Auerbach, R. K., Zhang, Z. D., Gibson, T., Bjornson, R., Carriero, N., Snyder, M., and Gerstein, M. B. PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls. Nature Biotechnology 27(1), 66–75 (2009). [7] Seo, Y.-K., Chong, H. K., Infante, A. M., Im, S.-S., Xie, X., and Osborne, T. F. Genome-wide analysis of srebp-1 binding in mouse liver chromatin reveals a preference for promoter proximal binding to a new motif. Proc Natl Acad Sci U S A 106(33), 13765–13769 (2009). [8] Valouev, A., Johnson, D. S., Sundquist, A., Medina, C., Anton, E., Batzoglou, S., Myers, R. M., and Sidow, A. Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data. Nat Methods 5(9), 829–834 (2008). [9] Wilson, N. K., Miranda-Saavedra, D., Kinston, S., Bonadies, N., Foster, S. D., Calero-Nieto, F., Dawson, M. A., Donaldson, I. J., Dumon, S., Frampton, J., Janky, R., Sun, X.-H., Teichmann, S. A., Bannister, A. J., and Goettgens, B. The transcriptional program controlled by the stem cell leukemia gene Scl/Tal1 during early embryonic hematopoietic development. Blood 113(22), 5456–5465 (2009). [10] Williams, D., Cai, M., and Clore, G. Molecular basis for synergistic transcriptional activation by Oct1 and Sox2 revealed from the solution structure of the 42-kDa Oct1 center dot Sox2 center dot Hoxb1-DNA ternary transcription factor complex. Journal of Biological Chemistry 279(2), 1449–1457 (2004). [11] Mo, Y., Ho, W., Johnston, K., and Marmorstein, R. Crystal structure of a ternary SAP1/SRF/c-Fos SIRE DNA complex. Journal of Molecular Biology 314(3), 495–506 (2001). [12] Muhlethaler-Mottet, A., Di Berardino, W., Otten, L., and Mach, B. Activation of the MHC class ii transactivator CIITA by interferon-gamma requires cooperative interaction between Stat1 and USF-1. Immunity 8, 157–166 (1998). [13] Wei, P., Taniguchi, S., Sakai, Y., Imamura, M., Inoguchi, T., Nawata, H., Oda, S., Nakabeppu, Y., Nishimura, J., and Ikuyama, S. Expression of adipose differentiation-related protein

88

(ADRP) is conjointly regulated by PU.1 and AP-1 in macrophages. Journal of Biochemistry 138(4), 399–412 (2005). [14] Wadman, I. A., Osada, H., Grtz, G. G., Agulnick, A. D., Westphal, H., Forster, A., and Rabbitts, T. H. The LIM-only protein Lmo2 is a bridging molecule assembling an erythroid, DNA-binding complex which includes the TAL1, E47, GATA-1 and Ldb1/NLI proteins. EMBO J 16(11), 3145–3157 (1997). [15] Cai, D. H., Wang, D., Keefer, J., Yeamans, C., Hensley, K., and Friedman, A. D. C/EBP alpha: AP-1 leucine zipper heterodimers bind novel DNA elements, activate the PU.1 promoter and direct monocyte lineage commitment more potently than C/EBP alpha homodimers or AP-1. Oncogene 27(19), 2772–2779 (2008). [16] Horard, B., Castet, A., Bardet, P., Laudet, V., Cavailles, V., and Vanacker, J. Dimerization is required for transactivation by estrogen-receptor-related (ERR) orphan receptors: evidence from amphioxus ERR. Journal of Molecular Endocrinology 33(2), 493–509 (2004). [17] Rosmarin, A., Resendes, K., Yang, Z., McMillan, J., and Fleming, S. GA-binding protein transcription factor: a review of GABP as an integrator of intracellular signaling and proteinprotein interactions. Blood Cells Molecules and Diseases 32(1), 143–154 (2004). [18] Nateri, A., Spencer-Dene, B., and Behrens, A. Interaction of phosphorylated c-Jun with TCF4 regulates intestinal cancer development. Nature 437(7056), 281–285 (2005). [19] Schlisio, S., Halperin, T., Vidal, M., and Nevins, J. Interaction of YY1 with E2Fs, mediated by RYBP, provides a mechanism for specificity of E2F function. Embo Journal 21(21), 5775–5786 (2002). [20] Sandelin, A., Alkema, W., Engstr¨ om, P., Wasserman, W. W., and Lenhard, B. JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res 32(Database issue), D91–D94 (2004). [21] Newburger, D. E. and Bulyk, M. L. UniPROBE: an online database of protein binding microarray data on protein-DNA interactions. Nucleic Acids Res 37(Sp. Iss. SI), D77–D82 (2009). [22] Bailey, T. L. and Elkan, C. The value of prior knowledge in discovering motifs with MEME. Proceedings of the Third International Conference on Intelligent Systems for Molecular Biology, Cambridge, United Kingdom, July 16-19, 1995 3, 21–29 (1995).

89

Suggest Documents