PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG76723.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family C3H
Protein Properties Length: 1643aa    MW: 178247 Da    PI: 7.2604
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG76723.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H214.88.4e-05866886323
                 ETTTTEEESSHHHHHHHHHHT CS
     zf-C2H2   3 CpdCgksFsrksnLkrHirtH 23 
                 C+ C+k+F++ +++  H++tH
  GBG76723.1 866 CETCDKTFKNAEQMSAHLKTH 886
                 ********************9 PP

2zf-CCCH192.5e-0611061127426
                  ---SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH    4 elCrffartGtCkyGdrCkFaHg 26  
                   +Cr  +r G C++Gd C+F Hg
  GBG76723.1 1106 QPCR-LFRIGRCHRGDDCRFLHG 1127
                  6899.6666*************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1643 aa     Download sequence    
MFRPPPSATP PPPVPGGPGA AFGKTLGAVG SGGPGQGNGR PWPGRGNMNG SGPGGGRGNR  60
EGRDWAPGRG GRGGGRGQGG RMGEGYGRGF AGDDHIRQGN APGAYRSGDG NGSSFGTGGS  120
SRQGLPAEAV EWGHLPERER ATGGRGGGGG RGGGGGGGGG GGGGRGWGGS EGERSDCRVN  180
QTVANQGHDF GRQSYDYGNR AEGREGSGWA SRGDAGGCVW DPDQGSGAPE GTGRNWNPEG  240
RGRGATREAP TGWAAREGDQ WGADREKGEM GGHGFSHGDH DDAKLERSWN RGQGAGGEGM  300
ANGQVGVGGR GGLWEGWRAN VAGDRHLSPC ENTRQSAERS QERENGWVSG RQDQGMARKG  360
EGNGKGWGGD QTCRERLCGD DQGRRDFGHP DVQDSASRAW GDSSSRTLYE FQAGKTHVQP  420
NQGNVEPMAS SWKPTHQPPL PHQPPLPPRT SLPPPAAPFQ AHHPPERSPR PHQQPLQPEP  480
LPQQCRRPPP SQSPQQSQPP PGAEQGWPPQ HSPRQKMARP LPPKTPPPPH SLQQPHDVNG  540
MSGPPLHHQQ RTPTPPRNLR PPCPPMPPPQ PPWPKPPSQA AQGIARAPPP LQSHVGERAV  600
VVSSGGPVLP FEPRQQGGRV SMPSAGPLPM LPPQPHCRQQ QGQTQIRPPP PMPPHPHSHQ  660
HHSMAAAAAP PIPPIPPIPR GPPHQQHPQC DRPERIVSMQ DEQLRSHQRP WPPNPNDLSN  720
QVGDRVSPML CHTPGTLLMR NGCAPEERGP FPERRGEGLG QEQGRVHQRG EQSSRAWAPG  780
SGRDAQHVGR NSGPGEEPSS RVAWSNGHYA QGGLRVAGSQ PMQQRQGFPQ RYLPGNNLGQ  840
RAPHPPTLNR EYTVPAQQQT TLLRHCETCD KTFKNAEQMS AHLKTHVACD QEGCTFAASG  900
KLLKEHKATA HHVATVSASA LRALAKRPKE SDEDIQRWIA ERKRNFPTDA NITRKKRDAQ  960
RRAETGELVD EEERTRVKRL REILSRQEQM GLPVAEIPPD LLRIRRPVGV RRKPAEEESR  1020
PTSQTGVEKQ RSDTCAARPA AQQLKDVNSN KESGVKDAGN LSAGKEVEDV NGSGKKRSWP  1080
DDEDGRDWDS HVADKRCNKA NARKLQPCRL FRIGRCHRGD DCRFLHGHDQ SEGRRGTNEV  1140
KVNSSVTGNP MDPCNGSLLA KLLSRDIRRD KSFLLQSFRF MVNNNFFADY PQKEPLKFFN  1200
WNASNNELED EAAGTLKVHV AEADKVIADL IDKLDDSDAR GGMSDDKGGD ENEDLGASSG  1260
HRANSRATVS GVHTRGSEND GGHDEDKVEG KDGSCHAVAK RLDNDDDDDD DVEKEEEEEE  1320
EEEGKQGKKE DRLVADKKIS HVEKKKGKRD DGDDGTDDDK EEEEDKDGEG KKEDMEGVED  1380
KEEEEMEGVE EKEKEVHVGE AGNVGHLREH VDACQTRSNG VFTKDADFAE WSSSALDGVG  1440
ATRSGRDGYQ QDLERKVDGT ERGRVQTVRF SSAADAISNV QNMSWNEDSI RRCISRSGAT  1500
EKSDNSIKER EEQRGEEKMA NGVRKVQSAD PAATEEAAKA GAHESSGNGK KRKKKVSETN  1560
GIEKDTRVKH SFKKHRKGKQ SSLPSSGGGS THHMEERMNC VEQEQQQEEC VKIAKGVCKE  1620
AAMEEEACEF IGWGAMCAKM CLS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
16977RGGRGGGRG
2143150GGRGGGGG
3145165RGGGGGRGGGGGGGGGGGGGR
4147154GGRGGGGG
5148155GGRGGGGG
6149156GGRGGGGG
7151171RGGGGGRGGGGGGGGGGGGGR
8942957RKRNFPTDANITRKKR
915501554KKRKK
1015501577KKRKKKVSETNGIEKDTRVKHSFKKHRK
1115501579KKRKKKVSETNGIEKDTRVKHSFKKHRKGK
1215511578KKRKKKVSETNGIEKDTRVKHSFKKHRK
1315511580KKRKKKVSETNGIEKDTRVKHSFKKHRKGK
1415521581KKRKKKVSETNGIEKDTRVKHSFKKHRKGK