PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OIV90580
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade; genistoids sensu lato; core genistoids; Genisteae; Lupinus
Family Trihelix
Protein Properties Length: 2124aa    MW: 239428 Da    PI: 7.5486
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OIV90580genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix422.5e-13175318013887
  trihelix   38 m.rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87  
                m +e g++rs k+Ckek+enl k+ykk+keg+++r  ++ +++++f+qlea
  OIV90580 1753 MaEEFGYQRSGKKCKEKFENLYKYYKKTKEGKGSR--QDGKHYRFFRQLEA 1801
                43678****************************98..55667*******85 PP

2trihelix35.72.3e-1120322097164
  trihelix    1 rWtkqevlaLiearremeerlrrg.k.lkkplWeevskkmrergferspkqCkekwenlnkrykki 64  
                +Wt+ e+  Li+++++ eerlr++ +    +lW+e+s+k+   gf rs+ +Ck+ w++++   + +
  OIV90580 2032 KWTEIEISNLIQLKNSFEERLRENgClVHNGLWDEISAKLGCLGFDRSASECKQIWDEISISLRVT 2097
                7*********************95244679***************************998766554 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2124 aa     Download sequence    
MLNFGEQLLG QDGNQFQMNN TRVSINPENP ILVRSNPIPD WQLNQIRRAN FQQFHVPGSG  60
YMNGMHMRGH YARAQHNMST GLNSRLELLL KNNANPISAA NSSFDVRMNR AAGNPLLPIF  120
RTQASSNVRE SYAGVQRSNI HSAPGIMYKE SNRLLIPNRN SEFNGSNSET LLKSSAQWSV  180
PNQVEGIPHG DSYPKYSLNY MPITEADASV TFNSFQSIPI TRDQLKFAGN QFCAIPDYTR  240
AGSTSQDKGK QEELISSTEK VQECYNGLLQ QIVDSSSAAI STSCGDQRGS DSNFCGEGSG  300
LGFDLNKTPD QKVATRRRKH RPKVITEGKP KRSSNPATQK KQVKENSPKK RKKVLKTEAT  360
PQADVIEETN GLTVPTRKTC RKALNFISDK SRNESQSRIV CHHDEAFRTT SDYRTAEMLS  420
GENVKTNSVF LSNQQNELTV QGRKRIITLP GTTEEKQIPK FLATEKGPAQ GNSVLCQERS  480
NGCMQQYIHA KEIGNILFQS ETCFENSQNT KELICQNTHQ SESNIPSSSI KGKVSKRKRK  540
SIDSQHNSAT NPLGTSLCQE ILQAGENFEG EALDKGFLET TNKKKTKKRL HGKVNGTSSC  600
QIMSKDESQK VIRKGKKVSQ SPPHVKMANC CTESDRFLEQ KNRGTSTGDC FAISGELHQI  660
YSTLIDEIIC RLNNLNLSES NTSAIEGQSA LVPYKGDGTI VPYQEPDIPK KHKPRPKVDL  720
DPETERTWKL LMGKEGSNSL DGTDKEKEKW WEEERNVFRG RADSFIARMH LVLGDRRFSK  780
WKGSVVDSVI GVFLTQNVSD HLSSSAFMSL SARFSLKSEG SRSYIVGTKT WVEEPDDTTM  840
SCGRGTFSEL AYHLGFGLPH NTSGVWRDSE TSRIQRIPIE TNNKSSEEEL LSSQDSLESS  900
ITQGTRGYGS CSRPSSESGP NRGCEPSKAQ FLTSTNSFQV GKTTMFQEFY NSVNGASLCE  960
ERTKDGQVQH AENLKQSLGV EGVNNRNFRS AFNYPSNFGY PQEQEPVVPS AYYEFHYPDT  1020
QGLETFQMNG NESFWPETVT THSKFPDNNY EKFGIPEIGD NADEPTEKQY GSGALSSPAL  1080
PTMNHSGPLS KHLDLLQGTS HILGSENIIS AANTQVCSDN SRAESNEQQI CSPSPTYKKK  1140
KTKVSKAKKV KPETEKKHAC DWDILRKKVL ANGTKIERGK ETMDSLDYEA VRCASVKEIS  1200
DTIKERGMNN MLADRIKEFL NRLVIDHGSI DLEWLRHVPP DQVKDYLLSI RGLGLKSVEC  1260
VRLLTLHHLA FPVDTNVGRI VVRLGWVPLQ PLPETLQIHL LELYPMLESI QKYLWPRLCK  1320
LDQRTLYELH YQMITFGKVF CTKSKPNCNA CPMRAECRHF ASAFASARLA LPGPEEKGIV  1380
GMSVPIAADK NPYVNMKPVI LPISENNFIG EATHESGNCE PIIEEPTTPE QDSTNALESD  1440
IEDFFVEDPD EIPPIEFNIS ESALNVQSFM QYMEHGDGEM SKALVALTSQ SASIPVQKIK  1500
TVSRLRTEHR VYELPDSHPL LEKMDKREPD DPSPYLLAIW APGETANSIE PPERRCGSQD  1560
STDMCNDKTC FSCNSIREAN SQTVRGTLLI PCRTATRGSF PLNGTYFQVN ELFADHASSV  1620
QPIDIPRAWI WNLPRRTVYF GTSVSSIFKG LSTPEIQHCF WRGFVCVRGF DQQKRAPRPL  1680
QARLHFAARF TDRAQIADIN QKKSFMHTRD RIRDCGVKAT SAFIMSPSWH SSIGYNLIYS  1740
LQTLVIELKS RIMAEEFGYQ RSGKKCKEKF ENLYKYYKKT KEGKGSRQDG KHYRFFRQLE  1800
AICGEPTNIH HNASTSDNKT HHHDASNTRA GFQSPTFATN QDSVNVDHFP NHMSSESLSF  1860
SNLSDQLETS SSENNDEDLS AIAYMMNQSR DNSNKHKGLE LEHRQSEGRV RKSWRGKIEE  1920
IVGSHTRKII ETQDAWMEKM LSVVERREQE MASKEEERKR KESMRFDQEV HELWAKERAW  1980
VEARDAALIK VVRKHFGFKE LEAFPLHHEA MVVEEEQQNK NNEYPCETAS GKWTEIEISN  2040
LIQLKNSFEE RLRENGCLVH NGLWDEISAK LGCLGFDRSA SECKQIWDEI SISLRVTTVE  2100
CSSSSANTTR PWYLGLKLRD DDEL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1316356RRRKHRPKVITEGKPKRSSNPATQKKQVKENSPKKRKKVLK
2348354PKKRKKV
3349353KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G03680.15e-44Trihelix family protein