PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID evm.model.supercontig_403.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Caricaceae; Carica
Family Trihelix
Protein Properties Length: 346aa    MW: 38176.3 Da    PI: 9.9511
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
evm.model.supercontig_403.1genomeASGPBView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix51.23.4e-1650133186
                     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm....rergferspkqCkekwenlnkrykkikegekkrtsessst 78 
                                  +W++  v +L+ea+++++   +r+klk+++We+v++++     +++  ++++qCk+k+e+++kry+ + ++  +      s+
  evm.model.supercontig_403.1  50 EWSEGAVSCLLEAYENKWVLRNRAKLKGHDWEDVARYVsaraNSTKSPKTQTQCKNKIESMKKRYRSESATPDG------SS 125
                                  5*************************************87777777888999***************9999987......46 PP

                     trihelix  79 cpyfdqle 86 
                                  +p++++l+
  evm.model.supercontig_403.1 126 WPLYSRLD 133
                                  99999986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138373.3E-2048133No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 346 aa     Download sequence    Send to blast
MDKEISTTQD HHPTPSPSLL LPNSIKEDLS PRKPFSSSAT SGADRLKRDE WSEGAVSCLL  60
EAYENKWVLR NRAKLKGHDW EDVARYVSAR ANSTKSPKTQ TQCKNKIESM KKRYRSESAT  120
PDGSSWPLYS RLDLLLRGTA PPQPPPPPIS SHPAQAAPAP ALTSPPLVVL EPSAQVVVVS  180
QSPALPPTPA PPPAPVVGNA QNSHGSNGFD RLAKEDGDGT KFSDHEPDKI AMETDSSTPA  240
LYSDKEKLRS KKLKMKLEKK KRRRKEEWEI AESIRWLAEV VVRSEQARME TMKEIEKMRA  300
EAEAKRGEMD LKRTEIIAST QLEIAKIFAA TSKGSVDSSL RIGRS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1250261KKLKMKLEKKKR
2258263KKKRRR
3258264KKKRRRK
4259264KKRRRK
Cis-element ? help Back to Top
SourceLink
PlantRegMapevm.model.supercontig_403.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021910206.10.0trihelix transcription factor ASIL1
TrEMBLA0A1R3H2H51e-146A0A1R3H2H5_COCAP; Uncharacterized protein
TrEMBLA0A2C9W2Y81e-147A0A2C9W2Y8_MANES; Uncharacterized protein
STRINGevm.model.supercontig_403.10.0(Carica papaya)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM98592533
Representative plantOGRP44091421
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G54390.11e-100sequence-specific DNA binding transcription factors