PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.012G018400.1
Common NameB456_012G018400, LOC105780128
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 351aa    MW: 38772.2 Da    PI: 10.2353
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.012G018400.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix51.52.6e-1647130186
            trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm....rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                         +W++  v  L+ea+++++   +r+klk+++We+v++++      ++  ++++qCk+k+e+++kry+ + +++ +      s++p++ +l+
  Gorai.012G018400.1  47 EWSEGAVSSLLEAYENKWVLRNRAKLKGHDWEDVARYVsaraNCTKSPKTQTQCKNKIESMKKRYRSESATADG------SSWPLYPRLD 130
                         5*************************************844444455556679****************99997......4699999986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138376.8E-2045130No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 351 aa     Download sequence    Send to blast
MEKETNQENP SLLSNNNISI TKEDSSPKKH PGNTAAAGGG DRLKRDEWSE GAVSSLLEAY  60
ENKWVLRNRA KLKGHDWEDV ARYVSARANC TKSPKTQTQC KNKIESMKKR YRSESATADG  120
SSWPLYPRLD LLLRGSTAPP PPPLLPPQLQ PSAVPQAATP ISTNPPLMTL PEPSMMVVLQ  180
QQHPPPPPPH LAPQLPGTTQ NSHGSNGIDR IPKEDGAGTK SSGHLSDKIA METDSSTPAL  240
YSDRERPRSK KAKMKIETMA TMMKKKKKRR KEECEIGGSI QWLAQVVLKS EQARMETMKE  300
IEKMRVEAEA KRGEMDLKRT EIIANTQLEI ARLFAGSNKG VDSSLRIGRN *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1264269KKKKRR
2264270KKKKRRK
3265269KKKRR
4265270KKKRRK
5266270KKRRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHQ5274983e-65HQ527498.1 Gossypium herbaceum clone NBRI_C_EYT27PB01AP1QB simple sequence repeat marker, mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012459717.10.0PREDICTED: trihelix transcription factor ASIL2
TrEMBLA0A0D2VUE20.0A0A0D2VUE2_GOSRA; Uncharacterized protein
STRINGGorai.012G018400.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM98592533
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G54390.11e-103sequence-specific DNA binding transcription factors
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]