PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D12G2117
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 424aa    MW: 47583.4 Da    PI: 8.3474
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D12G2117genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix73.53.7e-2325113186
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm...rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                  +Wtk+e+laLi+a++e++ +lrrg+lk+ +W++vs+++   ++ g  +s++qC++k+e+l+kry+++k+++ k++ + ss++ ++  l+
  Gh_D12G2117  25 CWTKDETLALIDAYKEKWFALRRGNLKASDWDAVSDVVssaSDPGTVKSSVQCRHKIEKLRKRYRAEKQRSLKNSGKFSSSWDLYPLLD 113
                  8*************************************998899***************************************998876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138376.0E-2124115No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 424 aa     Download sequence    Send to blast
MSTTTSPPLP QPSSATAARR VPPPCWTKDE TLALIDAYKE KWFALRRGNL KASDWDAVSD  60
VVSSASDPGT VKSSVQCRHK IEKLRKRYRA EKQRSLKNSG KFSSSWDLYP LLDSMNFAST  120
SVAGSDDQDH SIDHKVTVFG DFCLKSNKRE NIDGNSGSNL GFDHEFRGGH NSSFNFDHKW  180
LENGGFVAKG IKKFKSDGRI GDGYGSMVDF DHSFGQDVDS LGEFPLKTLG DRSFLNVGFK  240
SKNYGCLNLN YDYDNDSKEY SIDEEMGFRA RDLGAWDSVP QGIHQKKRGR VDMNFEPGGD  300
CRGSNGDASC SRPGPERKNA GAGVKRGVDP VDEMVSSIKL LAEGFVRMEK MKMEMVKEIE  360
KMRMEMEMKH NEMILESQQK IVDAFSSALS EKKKKKKKPS LMFSNMNGNG VEEWQEDAFI  420
KKER
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1390398EKKKKKKKP
2391397KKKKKKK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.211611e-176boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016736625.10.0PREDICTED: uncharacterized protein LOC107946701
TrEMBLA0A1U8NC800.0A0A1U8NC80_GOSHI; uncharacterized protein LOC107946701
STRINGGorai.008G230600.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM104831828
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G44730.14e-23Trihelix family protein