PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bobra.0163s0044.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Elliptochloris clade; Botryococcus
Family C2H2
Protein Properties Length: 2273aa    MW: 236975 Da    PI: 10.8018
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bobra.0163s0044.1.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.60.0004212821305223
                           EETTTTEEESSHHHHHHHHHH..T CS
              zf-C2H2    2 kCpdCgksFsrksnLkrHirt..H 23  
                            Cp C+++ s+k nL+rH ++  H
  Bobra.0163s0044.1.p 1282 ICPVCNMVVSQKRNLTRHEKSaiH 1305
                           5*****************997666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2273 aa     Download sequence    
MAPKTQPFCS IEKNPAFNNF KRRRVEESGI NKLSNATNAT QNGRVSNAVL SPGVPEQTFR  60
TAQNNKPTGE RPSSASSAQS VEGMPRWQQI VQSQLAGKTA SSGVDESVTQ SSSIPLPSPP  120
TKKSLREKVQ AITPCSKPIP SAHGGSEKVD KSEFSSTRNT RLSVNPFSVQ GVLRATFRAP  180
SIINATWASD NETMGGASKE KAAVFSFAVP SAGGRSPQRM VQPGNALAMS PKQREEEARP  240
SAEGLSVPAG RESCDTGVKA DSQKRGIEYT KANTNCNEHA SRVDHEAAGD GRGRRKTRRS  300
TSAVRSYSRE RSRLRMHSPR EQRPGLDQSR SSNLSRRDHR VWDSQHPGGY VEVRDGRQYE  360
RSQPGHQWRG VDRQDRSKER MLNSRHGALD LGHSPGQGRW SHGQYYEAEH GREPWSQSRD  420
NPRAGDSRQW GSRWQKRTHS PGNISIGGRS PTRKSRDLAN RKSTDGSEPP LRPRQPLDGR  480
HLETTDNQHS NHGNHRQPSW VTGVSHARQP PDRIPGSQGG CSPPRAPSPS KETRASAVDE  540
HSAKQQARLR AWIQAKKNLP KVHAMSGFPA PGGVNHTSTS LTWSLDKPAP SLTGGKAPSE  600
ELRPSVLVLS AMPGLASPGS LQEQGSLRTD QNRRMTGPTS QHPVGPILQT PDATGQALPC  660
PSSGRGVSGQ SKVGQAALKP SPVSATPPHG YPDWLANYGS SSICTSSTEA ASGEPADVAR  720
TPASRQVSPR AQEQVPVASP PQGDGPDGSR LSNRPSGADL QTSVRKEAAN SGVSPSARPV  780
HGGTRQQESK QATFSPYGEC PSLRELEARD LGPQRSASPQ TVGPAAVSSS QNRSQNPAQP  840
EQGVPRPGRQ TLSSETHPPP KHPMAVRSAM VNPVIAEPWR VSLETVPVPN ASSQCFNPVS  900
AHPLLEPCSH KYSGPPLGFQ TAPQGPWPSP PDRWNVAGGV SRRPVTSPPP LRSPQGHPWH  960
LLPATTPSPP RPPHGYVLKT AGPPVGTAGG LEQPSGHVGA SSAGQAAPEK RQAELTADHG  1020
EKPLEKRHRP GNSRWDREVQ SGPVGGAPGS RGTHGGEARG ARGGEARGAH QGAHAGSREG  1080
AHEAPGGEAR GTHQGAHAGS GGGAHDAPGG EAWGAHQGAH AGPGGEDRSP EGGEAPGSKG  1140
GETSGPDGGE APGPEGGEAP GPEGGQAPGP EGGEAPGPEG GKAPGREGGE APGPDGGGAP  1200
GSEGGEAPGP KGGQAPGPEG GKAPGAHGQA HGGFPVEPKK ASNITGSLVK VATPKTKQLG  1260
SSLGRRSSQG RGTGYKSNQP TICPVCNMVV SQKRNLTRHE KSAIHIELTE ELKMESGQRL  1320
ESSVPRTSSN PDLSSSPGIA CTQQGAKAAG SRRPSATSAG KAAGASRPPA GKRKAQSDPG  1380
PVKKGSEGVV CSKKAASEQA AGVERVAGGD VAGNGPERVP APAWPGPARG SAAGRTWPRP  1440
GSARSDGQPC RIGSLKDLVV TAPTVGGPPV PLAPPIEGAP ALQKGSQHGQ GRNRSKARRS  1500
GSHLGGSSGK SSNPVEGGRG SPVCSAACGA QAAVTPPASV DGHAGSPVCT AACGAQAAGP  1560
PPASVEDHGG SPVCTAACGA QAAVPPPASV EGHAKSPVHT AVEGPSGAQM SIPPPHMPQG  1620
GTPAGPQRGD AERGAAAPCE TGVPSPLPVG GKSSQERQPR ATKRSSQSQE DRPSSRGDSV  1680
GGREAGPEAG IKTVTAAGMG PPVAKQGVSR DSSSGPQRPS GAPVPSPVAP GVLLFEGQVE  1740
GNVSSAPKSK KPRRNEAAML LEAAVPSGVV RNAPIDPSLE PRGKPFSSPR KPPPASPGTK  1800
AFVRAPCGRP VHDRKVPVND KHRTSTHGPR VENPVNKRPV QRPVVLSQPL QLDVLAKPQH  1860
SPDQAACPER SPGPRAFPQT APREAGVTAL VQQPNRQAGR DPAEGGTYKI PKPRMKSLLE  1920
LSTEPAGAQS KQQQRSRLPS PTRPQKLQPN SARLVSLKAP QEGIQRGRPR MRQAGGHKGV  1980
PTSKVGTART KTARAVGKAL HLSPGSGLRK RKFVPSSLPR NQQSRARAKG SKRRSPRVAT  2040
AAASRVAGSA INACARAKPM NEVVGTSKQG GWVPTATASR AAPLEGGVSK NLLARAVRAA  2100
AASAKHKVWE RQDINAETWA AWALTRKLKG DNFRYYGSSI HGAGLFAARD VKKGEFVIEY  2160
VGALVPSNRQ DMFEKKYLRE GIKSTYMFSL DDDWIIDATK QGNAARYINH SCDPNCVAKL  2220
VHVKDGQNHV CIFARRDINY GEELAYNYQS KATDEATREE CRCGAKNCVK WI*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12125KRRRV