PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cz16g02020.t1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Sphaeropleales; Chromochloridaceae; Chromochloris
Family ERF
Protein Properties Length: 947aa    MW: 100930 Da    PI: 4.3826
Description ERF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cz16g02020.t1genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP2361.7e-1199148155
            AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 
                    s y+GV+w k  ++W A++r+        k+ ++g++ t+eeAa+a++ a+ k +g
  Cz16g02020.t1  99 SSYRGVSWEKAAKKWQAHVRE------AgKKRRIGHYVTEEEAARAYDGAAYKAHG 148
                    67****************888......33999********************9998 PP

Sequence ? help Back to Top
Protein Sequence    Length: 947 aa     Download sequence    
MGLIMADYSF KYVNYRSTSK KWIGQLKIPG QKSWTTVYCA TAPEAARNVD RIIYKVKGPH  60
APLNFPLTDA ERAALDAITL DELMASYRAA GSVFSSGCSS YRGVSWEKAA KKWQAHVREA  120
GKKRRIGHYV TEEEAARAYD GAAYKAHGSK AKLNFPDMLV SDEVIAPVHA TPPPTAVHTS  180
LPADDTLLRD ESSVRVQSMQ AVVIQPPNDA QLSSDLPPAE GSVDDEQYMA DWEDFPGAVG  240
NRTRGASRKI REQEFSSDAE AMQNSVCAHT DTDTIDNKQS QEMQLAAHDE SLSDTQQCRH  300
QQRIDAAHMQ GLGEVAVDDH NVSDGNCEHD SQLSLDAADS DWVDDYDDDD DDDDSMLICD  360
DGLFDDYARC KANHNGKSGD QQVEEPSMPD ISAMPMAAAA GMAVAPAVPA AEASAVHAVC  420
QDETEGVSQG SIVHGTAAPG KVPVPSLLDS SQTKSIGSTP GADCAGVDAA AHPTETNTAV  480
PSAADPTIIM VADGVADPLT ALPTATVPPL QLASAAKVQL CLDAGVSAML EEALLELHLT  540
PGLDDVPLEL FAALGIDLDE NGMGFASSPH VIPPSTTAPV FRGNPHINNQ YPGEPDKLTD  600
AHQLPAQHDS QNTGPCAKDT APAEKSLHIL EGDYHTWAPT EEDLQNEWFH MPPDSDEEFD  660
PHQVPLPVVP LTGNPKVDMI TGLKMVIEES QAHFPIRAKF VRMGALIKEA CPELSYEPLR  720
KRPRPFESED PPPVAAVAGN TTTPTEPWRF PVAMKPLKMK AEQLCKQVDT SLPTAGYRWA  780
AGGSGSSSSS SRMPVDAVDA ILLMGYDPDD EDEYYLGASD DDQRAVDDSN DSHAAKPIDM  840
QTSSHIILRT ATTRPSDSAP AVALQHGNTV HSPTASAAAG NAVAAPAAGG GGTSGQSHND  900
MMNMDADPSG RLVGQSPILM KHVPVSCISA DELRMLAVAE HDAIQL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1122152KKRRIGHYVTEEEAARAYDGAAYKAHGSKAK
2721725KRPRP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G51190.16e-09AP2 family protein