PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cz13g09180.t1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Sphaeropleales; Chromochloridaceae; Chromochloris
Family AP2
Protein Properties Length: 1343aa    MW: 138015 Da    PI: 6.4574
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cz13g09180.t1genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP228.63.6e-09390439155
            AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 
                    s+y+GV+w+k++ +W + ++       r ++ + g++   eeAa+a+++a++ ++g
  Cz13g09180.t1 390 SRYRGVSWNKRTLKWQVTLYA------RgRYRYFGSYTNQEEAARAYDKAALVWKG 439
                    79**************99666......33999999***************987665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1343 aa     Download sequence    
MMAYEKSKDD VNEYALRLAG SWLPHVEVEV GGITSILLGG HTSQSSNVYV QCQCSLCTNK  60
EPHVGLNTLE EQLEHVTGED LPDSVEEANK VCRMYLFVRP PEDPNRRIPL GLYLKQQAEC  120
MGGSGLIGSR LWIYWYDDPK VINHVSYGTW YKAEITGYDG HTGEHVVRYF VDNTLERIFL  180
PCTHVHFGRS APPPGEAPVM TPVSKARSPD QGVVKSGRGD DVLGGGADVG AKARSPDSQK  240
VGRGGKRSAA MGSAEKSEGK RARGGAADVG GIPSAFATAG LPDIAPLGDA TANPVKGEPV  300
ALADQLSGHA DDLAAPMALA QGLEAVQEGL VDGSDLLPEG AADVSATDDA TDGDDLRNTT  360
DQAVVPEVAV AAAGARAVAP PPGAPDHKTS RYRGVSWNKR TLKWQVTLYA RGRYRYFGSY  420
TNQEEAARAY DKAALVWKGP QAALNFPAEG YRSAQVQQQI MEAAAAAAAE GQPVAVPGVP  480
AAAAAATAPA TQESTGVVSL QPVASTTELK DEVVNVEGVS LPVLEGGDQT GGDNGVEGDD  540
DYGVSYGDLD AVNDQLGEQL GAVGSGTLDL SRKKRRKKAA SKLFTRKKIH KADPLLDAIP  600
PVNAPGLVYV KRRGRPPKGS LPAFVPEAED ASYPEGSGVG VEGEDASEGL DTEGNVRADL  660
GLNDTEATGN GRRSTREAAQ KGMRRLAAAM ASVVNARRDG LWARLMERAG DGISDDENGA  720
DGTLAHYIAP SLPSPRGRGR GRGRGRGRGR GRGYVAPDVD DADGGVGGHD AVNEAADEGG  780
ERGDDMDIMP EDAAAALLSM QDCIPGASET SWKKSSKTER EDEQVTVPTG RRRGRPPGRG  840
RGRIAARGTR GGRGRGRGRG RRSGPPPSAA AYNARAAVAA QNPWAMMNPW MNPKAAMAAG  900
GYPGADDSDA EDPQAAYAAW MNSPYYAQAY AAQMAAFFSN PAMAGMASAM AASSAGNQWP  960
GMEATMAWQN QYQMMLQHMH RQAMTSQWPQ GGVDEDDDES EESDNESHGQ EVLAAAKAQH  1020
EKKAPSGVDQ DALRRSMLAA QMGKAGLPPP VSGAPGGMGV NALMAHMGGA RKSSSSSKAA  1080
PAGAMAAGGV PGMTSSKGSL KKPVITTANA TMPASSSGMG LPGRATAPVP AAQVAAAQSA  1140
AQVAAAQAQA LAAVKNSGMG AVPKLAPPAK LYPMPPNPNL IGQAHGTMKF PPGPSGIANR  1200
TAAPVPGVPA AVASAAAAAA RLGAPPLTTN VGGLPPRPPS APATAKAPAG PSHHSGGATN  1260
ATPVYSREGD GISGPCTPQP GPSPVKAAGP ADAQAVPPAA AGPQQQPHVV PQLSQPAVAA  1320
AVSQPPMHPA APAVAVSGPQ AQ*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1572578RKKRRKK
2572588RKKRRKKAASKLFTRKK
3573577KKRRK
4573578KKRRKK
5736745RGRGRGRGRG
6736746RGRGRGRGRGR
7738747RGRGRGRGRG
8738748RGRGRGRGRGR
9740749RGRGRGRGRG
10740750RGRGRGRGRGR
11742751RGRGRGRGRG
12742752RGRGRGRGRGR
13744753RGRGRGRGRG
14850859RGRGRGRGRG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G72570.13e-09AP2 family protein