PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.007G154900.1
Common NameB456_007G154900, LOC105800972
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 279aa    MW: 32254.5 Da    PI: 9.6635
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.007G154900.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix61.61.9e-1921107287
            trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkm......rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                         Wt+qe+l Li+a++e++ +l+r+klk+ +W+ev+  +       +    +++ qC++k+e+l++ry+ +++g  ++      ++py+d +e
  Gorai.007G154900.1  21 WTHQETLNLIQAYQEKWYSLQRSKLKAWQWQEVAVTVavrcghLDDSPAKTALQCRHKMEKLRRRYRSERQGLASG-----AHWPYYDAME 106
                         ************************************999888878888999***********************97.....57*******8 PP

            trihelix  87 a 87 
                         a
  Gorai.007G154900.1 107 A 107
                         5 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138374.0E-1820108No hitNo description
SMARTSM005951.9E-728113IPR006578MADF domain
Sequence ? help Back to Top
Protein Sequence    Length: 279 aa     Download sequence    Send to blast
MSNISPAADP TPGKKRQPLP WTHQETLNLI QAYQEKWYSL QRSKLKAWQW QEVAVTVAVR  60
CGHLDDSPAK TALQCRHKME KLRRRYRSER QGLASGAHWP YYDAMEALEH EPLTISARPL  120
ASLVPTRGLK FYSENGHEAG NNYYRDYYYD DDEENNQFSK SRSINNILRR PSAVNRFSGF  180
LSGGRKRIRG EEEGNNDVAV VAMEEENKGM ALAVEIRRFG EKLMWVERKR MQMMRETERL  240
RMEMENTRIE MILDANKKFV DVISASFGSS KVDQKLGS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1183189GRKRIRG
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012487775.10.0PREDICTED: trihelix transcription factor ASIL1
TrEMBLA0A0D2P9N90.0A0A0D2P9N9_GOSRA; Uncharacterized protein
STRINGGorai.007G154900.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM102812232
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G24860.12e-43Trihelix family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]