PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_024971739.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae; Carduinae; Cynara; Cynara cardunculus; Cynara cardunculus subsp. cardunculus
Family Trihelix
Protein Properties Length: 575aa    MW: 64965.8 Da    PI: 7.0506
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_024971739.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix94.51.1e-2949132287
        trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     W++qe+laL+++r++me ++r++ +k+plW+evs+k+ e g++rs k+Ckek+en+ k+yk+ikeg+ ++   + +t+++fdqlea
  XP_024971739.1  49 WPRQETLALLKIRSDMEFAFRDATAKGPLWDEVSRKLGELGYHRSGKKCKEKFENVYKYYKRIKEGRTSK--ADGKTYRFFDQLEA 132
                     ********************************************************************96..66678*******85 PP

2trihelix103.22e-32392477187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW+k ev aLi++r+ ++ +++++  k+plWee+s+ m + g++r++k+Ckekwen+nk+ykk+ke++k+r +e+s+tcpyf+ql+a
  XP_024971739.1 392 RWPKAEVYALINLRTTLDMKYQDSGPKGPLWEEISAGMLRLGYNRNAKRCKEKWENINKYYKKVKESSKRR-AEDSKTCPYFHQLDA 477
                     8********************************************************************98.89999********85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 575 aa     Download sequence    
MLPGPVANTA DAAASSQPPP QPQASVAVEF SEEERGRFEG ERSSGGNGWP RQETLALLKI  60
RSDMEFAFRD ATAKGPLWDE VSRKLGELGY HRSGKKCKEK FENVYKYYKR IKEGRTSKAD  120
GKTYRFFDQL EALEANLGGP THPTPPPPAM ATSNITTTQL PFSTHPPVTV PSVAIPFGSH  180
QNNVSPISVA APAVVMPHVG GFPFSHPNIS ASTNSTSSAN SSDNEPPVVR KKRKRKWKDF  240
VGRLMKEVIH KQEELQMKFL DQIEKRERER MAREEAWRME EMAKMNREHD MLVQERSIAA  300
AKDAAVITFL QKITEQNPNA AVPQILQQQP QNQPPPPQPI QQQQKQQNLQ PPPAPAPPPQ  360
QHQFQVPALP VVKNVDNSGE VKPNMLSPSP SRWPKAEVYA LINLRTTLDM KYQDSGPKGP  420
LWEEISAGML RLGYNRNAKR CKEKWENINK YYKKVKESSK RRAEDSKTCP YFHQLDAIYR  480
EKANNSSHNN PTRFPESTQM EAIMAEPEQQ WPLPAVVQQQ QQQSTIHQSH ENVDHQNNED  540
DYDDEDEEEE DEGGEYEIVS NKNSLSMGAV RVTEA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1230235RKKRKR
2230236RKKRKRK
3231236KKRKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-116Trihelix family protein