PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Kalax.0197s0052.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Saxifragales; Crassulaceae; Kalanchoe
Family Trihelix
Protein Properties Length: 401aa    MW: 42777 Da    PI: 9.1755
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Kalax.0197s0052.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix51.92e-1631126274
             trihelix   2 WtkqevlaLiearremeerlrrgk.......................lkkplWeevskkmrergferspkqCkekwenlnkrykkikege 68 
                          Wt qe+++Liea++ ++er  ++                         ++ +W++v +++++rg+ rs++qC++kw+nl ++ykk+++ e
  Kalax.0197s0052.1.p  31 WTLQETMVLIEAKKMDDERRLKRPgcssseqqqvtgsqemlmmirnkPAELRWKWVEDYCWRRGCFRSQNQCNDKWDNLMRDYKKVRDYE 120
                          *************95555544322235556667777777777777779999*************************************** PP

             trihelix  69 kkrtse 74 
                          +k +++
  Kalax.0197s0052.1.p 121 RKLAEQ 126
                          *98655 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.43523110IPR017877Myb-like domain
PfamPF138371.5E-1130122No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 401 aa     Download sequence    Send to blast
MADQSGGAGG GAVVASGGGG GLMREYRKGN WTLQETMVLI EAKKMDDERR LKRPGCSSSE  60
QQQVTGSQEM LMMIRNKPAE LRWKWVEDYC WRRGCFRSQN QCNDKWDNLM RDYKKVRDYE  120
RKLAEQNAAA GTSSSSSLGG SVSYWNMDRS ERKERSLPSN MSSTIYEALV DVVEKKVGGS  180
SSAAALLSPA TAGVNTSGTP TPTDVLQAAT PTIMPPTAAL SLLHQQHLNA SAAALGLPLP  240
EIAGVAPLPL PPPPPSAQPN IPTCQPATTT IRESSDSDTS SDSPAKRRRK DKGPETSGGP  300
ASSRNEAAGV ANAISKGASK ISEAIQASES RREERHREML GLKQRRLQIE EARMEVNRQG  360
MSGLAEAINN LANAIMALAA SNNNHGHGQA TSPPPPPPPP *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1286291RRRKDK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.19e-62Trihelix family protein