PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D04G1686
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 416aa    MW: 47470 Da    PI: 6.9639
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D04G1686genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix67.72.4e-2187181183
     trihelix   1 rWtkqevlaLiearremeerlrrgk...........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfd 83 
                  +Wt ++v++Li++ ++++e+ ++++           +k ++W+ vsk+m erg+ +sp+qC++k++nlnk y+++++  +++ +++++++ +++d
  Gh_D04G1686  87 KWTGKMVKLLITILSYIGEDPSTDCagnqikvssllRKLGKWKCVSKVMLERGYIVSPQQCEDKFNNLNKTYRRLNDLLGRGtSCKVVENPKLLD 181
                  7**********************988899999999999*******************************************95588887776665 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.2E-1885209No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 416 aa     Download sequence    Send to blast
MEGKPSAGGN MLQEWEYGCL GLQGSIQPQN EQQPCMSKLP SAFGSVENER REITVIEDDV  60
TNYAKQCMLE HNEAGKNEDG PPWQRMKWTG KMVKLLITIL SYIGEDPSTD CAGNQIKVSS  120
LLRKLGKWKC VSKVMLERGY IVSPQQCEDK FNNLNKTYRR LNDLLGRGTS CKVVENPKLL  180
DIINVSEKGK EDVRKLLMSK HLFFEEMCSY HNGNRMYLPH DPDLLQSLLF ILKNEDDYEL  240
LDSNQPTPDK KAGVTAKDNE DFAEFSAKWL ELISENGIAP SCSNQTLNAQ GDATEYNGAN  300
SGFSAEISTL WFKPMNDNEV VGPTSSLKPS CFNQIPDTED NEADGSQWMT RQAYQLEKRK  360
LRLKSKVFDL EKQRLKWRRR SWKQDMELEK MRLVNKCLKH GNEYIALQLK GKKIGS
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016679503.10.0PREDICTED: uncharacterized protein LOC107898530
TrEMBLA0A1U8ISV10.0A0A1U8ISV1_GOSHI; uncharacterized protein LOC107898530
STRINGGorai.012G159500.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2214945
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-82sequence-specific DNA binding transcription factors