PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A01G1686
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 444aa    MW: 51051.1 Da    PI: 6.6491
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A01G1686genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix811.7e-25117213185
     trihelix   1 rWtkqevlaLiearremeerlrrgk...........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfdql 85 
                  +Wt+++v++Li+a+++++e++  ++           +kk++W++vsk+++erg+++sp+qC++k+++lnkrykk++++ +++ +++++++ +++d +
  Gh_A01G1686 117 KWTDKMVRLLITAVSYIGEDMAGDCgggmrrkfavlQKKGKWKSVSKVIAERGYHVSPQQCEDKFNDLNKRYKKLNDMLGRGiSCQVVENPALLDVI 213
                  7*********************98888889999999*********************************************9999999999888766 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138374.1E-22115242No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 444 aa     Download sequence    Send to blast
MEGNLSRGMI PGGSSFGGLD LQGSMMVHHH AQNPHNIHHH HHHPNPRRGT SAHPGFPLTA  60
GTMQNSDQPI SMIDYNKMEI GKCSVSDEDE PSFAEEGVDG HNDGNKGKKG SPWQRVKWTD  120
KMVRLLITAV SYIGEDMAGD CGGGMRRKFA VLQKKGKWKS VSKVIAERGY HVSPQQCEDK  180
FNDLNKRYKK LNDMLGRGIS CQVVENPALL DVIDYLTEKE KDDDDVRKIL SSKHLFYEEM  240
CSYHNGNRLH LPHDLKLQRS LQLALRRRDE NENDDVRRHQ HDDLDDDDHD METDDHDELE  300
ENHASHGDNR VIFGAPGGST KRPRQSQVHE DACFQKFLNS QDCNKSSFSC PPVAQADTNQ  360
VLPDYSRAAW LQKQWIESRS LQLEEQKLQI QVEMLELEKQ RFKWQRFSKK SDRELEKIRM  420
ENERMKLENE QMALELKRKG LAAD
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.110560.0boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016707828.10.0PREDICTED: uncharacterized protein LOC107922344
TrEMBLA0A1U8L3B80.0A0A1U8L3B8_GOSHI; uncharacterized protein LOC107922344
STRINGGorai.002G231400.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM50422752
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-152sequence-specific DNA binding transcription factors