PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID evm.model.supercontig_192.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Caricaceae; Carica
Family Trihelix
Protein Properties Length: 328aa    MW: 37274.1 Da    PI: 8.6891
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
evm.model.supercontig_192.2genomeASGPBView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix55.91.1e-1718105283
                     trihelix   2 WtkqevlaLiearremeerlrrg.........klkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtse 74 
                                  Wt +e+++Liea+r  eer  ++         k+++ +W++v ++++++g+ rs++qC++kw+nl ++ykk++e e+k + +
  evm.model.supercontig_192.2  18 WTVSETMVLIEAKRMEEERRMKRvgdvgegrnKRAELRWKWVEDYCWRKGCLRSQNQCNDKWDNLMRDYKKVREYERKISRD 99 
                                  *************9777766642223333333799*******************************************9644 PP

                     trihelix  75 ssstcpyfd 83 
                                   ++    + 
  evm.model.supercontig_192.2 100 GEE---SYW 105
                                  442...233 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.0041083IPR017877Myb-like domain
PfamPF138376.2E-1417108No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 328 aa     Download sequence    Send to blast
MAEQGGNSSM TEYRKGNWTV SETMVLIEAK RMEEERRMKR VGDVGEGRNK RAELRWKWVE  60
DYCWRKGCLR SQNQCNDKWD NLMRDYKKVR EYERKISRDG EESYWKMEKG ERKAKNLPSN  120
MLAQIYEGLV EVVERRGHDN INTNPSLGYM GMEKAAHVMT SSNVQTPMLI TSPLLEHQVS  180
TPLMPAVLPL PPPLVLPQPP MPQPQPSPPS HSQPVDCDTS EDSETSGAKR RRRSRRDYNG  240
GGSSSGEEVG NAIRKSASII AEALEACEER EERRHREVVS LHERRLKIEE SNTEINRQGI  300
NGLVDAINKL ANSIMALASH KNHSPPK*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1229236RRRRSRRD
Cis-element ? help Back to Top
SourceLink
PlantRegMapevm.model.supercontig_192.2
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021887581.10.0uncharacterized protein LOC110806900
TrEMBLA0A2P5S9511e-125A0A2P5S951_GOSBA; Uncharacterized protein
STRINGevm.model.supercontig_192.20.0(Carica papaya)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM34932662
Representative plantOGRP18811637
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.14e-50Trihelix family protein