PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.002G231400.1
Common NameB456_002G231400, LOC105786254
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 443aa    MW: 50964 Da    PI: 6.7681
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.002G231400.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix78.41.1e-24117213185
            trihelix   1 rWtkqevlaLiearremeerlrrgk...........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstc 79 
                         +Wt+++v++Li+a+++++e++  ++           +kk++W++vsk+++erg+++sp+qC++k+++lnkrykk+ ++ +++ +++++++ 
  Gorai.002G231400.1 117 KWTDKMVRLLITAVSYIGEDMAGDCgggirrkfavlQKKGKWKSVSKVIAERGYHVSPQQCEDKFNDLNKRYKKLYDMLGRGiSCQVVENP 207
                         7*********************99888899999999*********************************************9999999999 PP

            trihelix  80 pyfdql 85 
                         +++d +
  Gorai.002G231400.1 208 ALLDVI 213
                         888766 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138372.8E-21115240No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 443 aa     Download sequence    Send to blast
MEGNLSRGII PGGSSFGGLD LQGSMMVHHR AQNPHNMHHH HHHPNPRRGT SAHPRFPLTV  60
GTMQNSDQPV TVIDYNKMEI GKCSVSDEDE PSFAEEGVDG HNDGNKGKKG SPWQRVKWTD  120
KMVRLLITAV SYIGEDMAGD CGGGIRRKFA VLQKKGKWKS VSKVIAERGY HVSPQQCEDK  180
FNDLNKRYKK LYDMLGRGIS CQVVENPALL DVIDYLTEKE KDDVRKILSS KHLFYEEMCS  240
YHNGNRLHLP HDLKLQRSLQ LALRRRDEDE NDDVRRHQHD DLDDDDHDME TDDHDELEEN  300
HASHGDNRAI FGAPGGSTKR SRQSQVHEDA CFQKFLNSQD CNKSSFSCPP VAQADTNQVL  360
PDYSRAAWLQ KQWTESRSLQ LEEQKLQIQV EMLELEKQRF KWQRFSKKSD CELEKIRMEN  420
ERMKLENERM ALELKRKELA AD*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012468067.10.0PREDICTED: uncharacterized protein LOC105786254
TrEMBLA0A0D2QHP80.0A0A0D2QHP8_GOSRA; Uncharacterized protein
STRINGGorai.002G231400.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM50422752
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-151sequence-specific DNA binding transcription factors
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]