PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
Basic Information? help Back to Top | |||||||||
---|---|---|---|---|---|---|---|---|---|
TF ID | kfl00941_0010p | ||||||||
Organism | |||||||||
Taxonomic ID | |||||||||
Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Klebsormidiophyceae; Klebsormidiales; Klebsormidiaceae; Klebsormidium
|
||||||||
Family | DBB | ||||||||
Protein Properties | Length: 3727aa MW: 390310 Da PI: 5.9296 | ||||||||
Description | DBB family protein | ||||||||
Gene Model |
|
Signature Domain? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
1 | zf-B_box | 18.8 | 3.5e-06 | 1174 | 1216 | 4 | 39 |
zf-B_box 4 rkCpeHee.kelqlfCedC......qqllCedClleeHkgHtv 39 ++Cp H++ +++fC dC ++++C+ C+ +H+gH v kfl00941_0010p 1174 DRCPAHPKlVAARIFCVDCsgaegqGMPCCAACSKGDHRGHLV 1216 69******666********66555559**************76 PP | |||||||
2 | zf-B_box | 17 | 1.2e-05 | 1407 | 1450 | 4 | 42 |
zf-B_box 4 rkCpeHeekelqlfCedC......qqllCedClleeHkgHtvvpl 42 + C eH +lfC dC + ++C++C+ +eH+gH+vv+l kfl00941_0010p 1407 KPCREHAGG-KRLFCIDCahkpgyERPVCAQCVKKEHEGHRVVNL 1450 67****996.69******555555589***************986 PP |
Protein Features ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Database | Entry ID | E-value | Start | End | InterPro ID | Description |
SMART | SM00336 | 0.26 | 1171 | 1219 | IPR000315 | B-box-type zinc finger |
CDD | cd00021 | 0.0081 | 1174 | 1219 | No hit | No description |
SMART | SM00336 | 0.59 | 1404 | 1450 | IPR000315 | B-box-type zinc finger |
SMART | SM00336 | 3.8 | 1711 | 1757 | IPR000315 | B-box-type zinc finger |
SMART | SM00336 | 0.71 | 2299 | 2345 | IPR000315 | B-box-type zinc finger |
Gene3D | G3DSA:1.20.930.10 | 1.6E-4 | 3651 | 3724 | IPR017923 | Transcription factor IIS, N-terminal |
SuperFamily | SSF47676 | 8.11E-7 | 3651 | 3724 | IPR017923 | Transcription factor IIS, N-terminal |
Gene Ontology ? help Back to Top | ||||||
---|---|---|---|---|---|---|
GO Term | GO Category | GO Description | ||||
GO:0006351 | Biological Process | transcription, DNA-templated | ||||
GO:0005634 | Cellular Component | nucleus | ||||
GO:0003677 | Molecular Function | DNA binding | ||||
GO:0008270 | Molecular Function | zinc ion binding |
Sequence ? help Back to Top |
---|
Protein Sequence Length: 3727 aa Download sequence Send to blast |
ASLSRRGDIP RQGRAESQLK PWYAIVRDFL QEGKPVANMN RGEKHLPQRR ESRSPAADAT 60 RQGRRERNRD ARRGERDHRR ERSRSCDWEA RPEYASPERE RSLRDEPDRE RRQREKERAR 120 PQKRRSLSRE KSERREERGR NVHTEKEQGG HRERRRGDDK GVSKGGEKRR GDAKVEERGQ 180 GDPISPLVEF DREEQASEGR GARSKLSPGF LRKDGGETLQ ETAGRSVGRS SDKPERRKSG 240 TTGERSSDRA GGRRQEGGGE KDIRSERNVE RGKGRSSDMS ERQKSVTTSE RGSDRAGGRG 300 QESRGKERES ERGGVKEGRS QDSGRADRPR WTVALEENAH EGLRKLPEVK REEKRTGEVS 360 CRDCELGDDR RGRSGRSPIW AEEQLPASSE ALEREARFKE KRRKDAAYAA SRLKLSIVAD 420 LWMREEEDGI LEGPPGRRGD SKSAGEPHTP SRASGGAQAG GPHVDFVREA FGGAAMEPLR 480 VREDSPASVE AVDSGSPWKS RSRPKVLAPK SSGMVTKGNE RRPGESPGIA TKGGERRPSD 540 GGSSVVEDGK RTASAFGFGQ QEGGSARQIG CAEKPGAAAA GVEKANGKTI GSRIAAGEVA 600 TRGPSRGGAG PAGKEEALNG ANGMQDLDRM AAQGVARGDG ERVDFGRSLG VSAGAVKRME 660 KAAGVQSPAA SVQRLAMSGG TGVVELLLGP SQSEEDFHTV FEEASGPQIS TALTALQIYS 720 ESESESGDEE ADRPLFQRMG SEAAAACQGA GGTLQSDGGG SLLSEIGAEN LVSPPPAAIW 780 AAAAGGAVEE GVEQVAEVRA EPGSAGDQEE PLGDQGPPAS SNSQPNVPGL ATKGSPEPRM 840 PPLEDPNTPS PVFAPTDTRL FPQADNPVSS VTPAASSDKT PGAVDTGVLM SASAPQGPEG 900 TGIETDDITG VANEGPGGDS AEGRHAVGLG ALGAAATAPP AGGLSVPPPE VLAESSSGPA 960 EEPGARKMTS AGDPSFEDGV AVEEELPVWQ GQRSGGEQPS EGEQPSAPIE RPTALDDEPA 1020 DSGAVPGERT PEEGSCPVSV TNPAAVEEQS TAALVQLGQE PSIGQCERGG VQRRDALTAE 1080 EEREELPDAR QKGPPEAPAQ GAPEVVAAGL ADGAPEALAG TPKQNLVPLL VARLLPSPTE 1140 LLSESLGPVG GNLTAQERRG LDLTALLKKG WETDRCPAHP KLVAARIFCV DCSGAEGQGM 1200 PCCAACSKGD HRGHLVFDIK YKDREWLLVM DNRRNLNVSD LPCKVVGSAT GVRIWPRESA 1260 RPSSAALTCK ACGGFLSARS EEQLVFCSLQ CKMATNPKAL LTTSAADCHL PQAEQTQSST 1320 PAEEEATESR TAAGRNGAGH IVADGGVDAA ELADALNIAD DVSERVASEA PARAADTRTD 1380 KGASAARSLS PSDADWLDTV LEKGWEKPCR EHAGGKRLFC IDCAHKPGYE RPVCAQCVKK 1440 EHEGHRVVNL SPGHWSADYW LLMDGVHDLD VSNVARRKTN NAAENSSGAV IWPRESEPKN 1500 QTCFKCKGCR RLAMARGCVQ KKEPEFCSVL CKVRAVPNMI RKTSVDDCRP QQTGIKQPTA 1560 PLKRKVLSAA AGRPDPPTAV RGRAGHVLAG TPAPAAALPP CADDKTGTTG ESRTTGLNGE 1620 FGRLSARADS EGAVGALVKE SADAPGEGIA DGVEETADGA AVERTAIATA AETDGTPCTN 1680 ASAEGAVSAA VPAPRERYWG PAWLEAVLEK GWEEPCDKHP QPKRFFCVDC AGLRGCERPV 1740 SAGCFDKKHK EHNTIELRIR DGVWRLVDTG LGEMEVSDLP RRKLPEGPYG VPIWAVQSSK 1800 RTTSVVLCQG CFGPASFRND PARTFCSLEC KVCRVLFSFF SFVDNLRIAS CRCSLGTLPR 1860 AARTDPKSVR PTSTDPSPGP PAESEAPTEL PMEVAGEIEG LRAVGFKSIN ALDLRGEEAD 1920 GLTGRTWRKA SLATDGARTV AEQGVDCSGG EQLEAAALLE DPPTADRGDE PRTGDVILAQ 1980 ETTAVAAEKQ TTGGHAERRE TENGDGDMET PEGLAEAAPR GTAERPMKAA VDSSAAAREG 2040 GGGAAEETVA LLATAGDAPG QSAGSASGAE ENDGLATREA VPPGSLGVSG AGLSNAAVTT 2100 VPHPPLEGTE KLPASTGAPG EAMAMSDWPV EGTADGGEEA ARSEVGEKVV DGVGEGLGPV 2160 PAAGGGTGSG GAEGTLAEGG SDAAVQNVDD APFTVEDATE TVLTASEERG RILEGGEGQV 2220 YKSEKSADGL EERADGVAGG MARAPGEGAA EVTAADAGAS AEKVAEGDPS ADAAATEIPA 2280 AGPRRHWGPD WLETVFEKGW AKPCSTHKYG KLVFCLDCAG TTQQTRPTCA GCLRGDHQGH 2340 ELVEMRVQDQ VWMLVDGTHD LDVSRIYAKK GSGWSGMPIW PRHFQTRQTG NLCLCYGCGV 2400 QWPFRGGEVP AFCSVLCKAR TNPHSVSTYE YRAQQIEATD RRVAAGRIAA CAPLGISDPL 2460 VDLDTGPEET GAQRAKARKP ATEDAELSPG RSGVEEPGTE RPGVEEPGTA RCAEGASHKG 2520 PLSRDRDGKK REEGIDDGAR AKGVESAATS EGRVDAAAAK RALVGFRVGR ELPRGTIVLI 2580 TVGPKRMKNV PGFHKKQGVV IEAQADGTHK VFRGKAVCKW NRSFQRSELQ VMGIPKGHHL 2640 RAKDFEKGSW DGKMTAEEIT NGAGLLDADG RSFYKLRKTL VSAGEKEKAG VEGSDASRWG 2700 DDISEGGVVL GKRRQSGSAG ESHDDEGGRK MSLKKVMEAS DASKKKPRGG AANSAQEAAL 2760 KKPAPKASER RVSRELLCIV GDVDLVAREE RRATRARAKD PQPAVGTGRR GTGGKRKLDR 2820 AVEAPKEKKA RSAGDNEGDE QAKGGAETPN RESGALKLFT GGMTELAALL ANLGNEGIGL 2880 GLDAHAVLAG AADVVSASAA ADVSGSAALA QTVAPPAEAA SQGGAEPGTT VVDLDQDEPS 2940 SEDATPKKRQ RRSAPAAYPL LSDGVQLLDR SSKRAAATKA CQKIKQNLIR ERIPITDDGE 3000 QVPQEKAEPV PQAAAKALAV PNRETEPHGL KAPRPPTAPP AGLCWLGSGR VAPTGGSLPG 3060 QMQAPASAHP ATSSIGHWAP FKLGDPVADQ FASQILATSF RASRNASRDP SDRLIIFRTE 3120 TSLVVGGRVT AAGAILLDQW GLQLHRMGFP GVVVDRSAKT VVSLLPVPVP FPGAPIASVF 3180 SPAVQKGATA ELQARGSAAP ATEVPPLIPS RQTSDQEPVL GPGAMRNGAP SVPTSVAQPV 3240 TAQPPPPERI PTPSPRAGSI PPEEGVGPKG KRQSAATVAC LALGTPREWK RQRRAEEECS 3300 PAAEGSEVRS EDLSEAAELE RHEAEGTGGP VPSDPVLVSP PPADSTELPD SKESQLGMGP 3360 LGIEAVPIPL EEPHVRPKRN ARVETSKLAA PPAAARYAVI RTRPAKGGPF ATEKIIAAPD 3420 AGVHESLGEK QSGPPAQEQT RQGEVGDPPE TGPPTMPVTK RKGRPPKKKL DEAVAAPAAD 3480 VTGGADVSKG HSSGADVGKG AATPAAHTPQ KKRSQGGKRP QEKAAGAAGA AAADVGVTHA 3540 DVSMTEAAAV AAPLVVPKAA KGGKGRGLQQ QQAGRSMSAV AGGAAHNAIV PRGAVSAVGE 3600 GGAASNGAAN GKKLVVAGGR RVVNLPAFLL DSSDEESDDE EEEEEDHGPP EELMAMCKLF 3660 DEVRTDYKKV REAIFYLHYL KGVTMTKDLL ASSQVDVRVR ALMAHAHAGV RNQAQKLGQQ 3720 WEPLRRA |
Nucleic Localization Signal ? help Back to Top | |||
---|---|---|---|
No. | Start | End | Sequence |
1 | 250 | 258 | GGRRQEGGG |
2 | 2945 | 2951 | PKKRQRR |
Annotation -- Protein ? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-value | Description | ||||
TrEMBL | A0A1Y1IMR4 | 0.0 | A0A1Y1IMR4_KLENI; Uncharacterized protein (Fragment) |