PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0004270.1_g020.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family G2-like
Protein Properties Length: 477aa    MW: 52317.3 Da    PI: 8.0484
Description G2-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0004270.1_g020.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1G2-like64.61.8e-20116169155
                    G2-like   1 kprlrWtpeLHerFveaveqLGGsekAtPktilelmkvkgLtlehvkSHLQkYRl 55 
                                k+r +W + L e+F+ a++++ G ++AtPk+ile+m+ ++Lt+++v+SHLQkYR+
  Pav_sc0004270.1_g020.1.mk 116 KKRCEWKKSLGEKFMLAITHI-GLDNATPKRILEFMNEPDLTIKNVASHLQKYRI 169
                                689******************.********************************6 PP

2Myb_DNA-binding223.9e-07118168148
                                TSSS-HHHHHHHHHHHHHTTTT..-HHHHHHHHT.TTS-HHHHHHHHHHHT CS
            Myb_DNA-binding   1 rgrWTteEdellvdavkqlGgg..tWktIartmg.kgRtlkqcksrwqkyl 48 
                                r  W + + e++  a++  G +  t+k+I + m+   +t k++ s++qky+
  Pav_sc0004270.1_g020.1.mk 118 RCEWKKSLGEKFMLAITHIGLDnaTPKRILEFMNePDLTIKNVASHLQKYR 168
                                568999********************************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 477 aa     Download sequence    
MASKSVLIVA LLFSVLILCL QGTYADRIEG WEGGHARFYG GGDAPASDAI GGGFGNGNWD  60
SQGYGFSNGS CYGLKCASCP KWCLTGDVEK QEKRAKRGRK RSREGDDEES TAAPRKKRCE  120
WKKSLGEKFM LAITHIGLDN ATPKRILEFM NEPDLTIKNV ASHLQKYRIF LMKQLNVAVA  180
GDEDMRERLR RSSFALGHPE LFFNNNERDQ HHSQLLKQQQ MGTSIGSTFQ PSAGIGCTLP  240
LTAAASNNHS SIQFPNYQQS STSNSSRSIP QLIGSGQSSL LNNNPANFLR QQPMLGNGNG  300
DQLFCQQNRS LAPFGMQQLS NNFEKGGMSC GPMNNLGLTY NNIGTNNLMQ IYPPQSQPRT  360
NNPFSYNIAT SVNQNGSNFT PMSSSFDNLG SHFNDVNQFR VYWLLSCSAG IETGASSAYQ  420
FPPNLPDKNG DLDQQQNVPV LPPQEGNFND QCRLPNINVG GGNVENSNRG FMDNTST
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
199118RKRSREGDDEESTAAPRKKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G07210.14e-20ARR-B family protein