PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG91596.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family AP2
Protein Properties Length: 1290aa    MW: 135238 Da    PI: 6.5036
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG91596.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP236.51.2e-11195242155
         AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                 s+y+G++ ++  grW A+I++      r+ ++lg+f+t+eeAa a+++a  k++g
  GBG91596.1 195 SRYRGITAHH--GRWQARIKEA-----RRDIHLGYFDTEEEAAHAYDRAVIKFRG 242
                 78****9876..*********3.....5**********************99997 PP

2AP250.17e-16583630155
         AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                 s+y+G+++++  grW A+I+d      rk ++lg+++t+eeAa+a+++a++k++g
  GBG91596.1 583 SQYRGITRHH--GRWQARIKDS-----RKDIHLGYYDTEEEAARAYDRAALKYRG 630
                 78*****876..*********4.....5*************************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1290 aa     Download sequence    
MAAGQGFRLF TEKKVSKHCA MADMTSSTSR QKSAGNNGAT FGGWKGPTRY AERVTPEDLI  60
TSLRTTGRAD LRPVRTYVKP VGFVSSSSPQ LPALAASSSS PAVVDDSGFM ALATAASSSS  120
GEEPTSCSVQ TTTWCADRGE MGWCPVEGTA LGVGAAAAAA AAAAAAASAA AAAAAAGNAG  180
GGGFVARGWK SPRGSRYRGI TAHHGRWQAR IKEARRDIHL GYFDTEEEAA HAYDRAVIKF  240
RGGNAATNFC ISDYIDLLDA EQNAAGLGSG LSHEMESQKG PLPIDCERSK HAKPLPPSIL  300
QGCFPLSSPA SRSLHTCPRC GKPKRGHACI MDGRAAVAAA AARPTTPFAI PFPSNEEFSA  360
AMVNGMVRVT TQQQLGSAAG HPATGTQHPP SYLHLGVPST LFTPMDGTKP GAGVGGGGGG  420
GSGGTGGDQW HMHGIPPSAQ TGLVSGTPIL GAPIGSASTS GSSACFSVTE SGTSATSTQS  480
TAFGGMKMGV CGSEVAAHFP SMGAPVGRCN EYITVGACLP ARLVEDGVEL RESCRTEQGP  540
GLDTGASHPT TPFLTVDTSL TEPGFEKEGL VQRSRRRAFR RSSQYRGITR HHGRWQARIK  600
DSRKDIHLGY YDTEEEAARA YDRAALKYRG VTALTNFPMT EYAPGLLLGH GCAADESSTG  660
NKDFLNGWSG GNNPQVRVGS EQSQGNMADG ELEARVAEAG SSMALACIPS NGEDMCGEAK  720
VESEENDAKR MRGVCEGGDR GVCSALILFD DTACFGEESG AWHAVEGTHD GIGSVGEGAC  780
IRQDESMVQL EPIGNQPTEG AQSEGSLEVT VDQQTQDQDR MEVQERQGIP SDPSVSLECA  840
DNKRLSGGKM DNMQDSHGWE RVVFCTEDVG DRHGAESSDG PPMGDGTNPS AMPLRSTERM  900
AVSRVESYGT AVQDRGLNSG LASEQVFMSS IGGECTSSRD NGASTAEGQG GVPRVCEMES  960
AEASPKREGA DLSTQGVEAL IEAARQVPII MNEGRQQKQG RGDKSAYVET HGECLASVDA  1020
CKRGDGGKVE ERQELFTNCS QGLVLGRVAG ERAVEFRSGA TQGRLCAAVD DMGAGRAIDG  1080
REGSVIAGAA GMDANRRMME GGGPEAVNWP AGEDIDANDM IPTGLDRCFG GHIGGGCDVS  1140
GGNCKVQSGS EGIRGCSNGS CGSKTFCGKR CRSRVTCCAE RMLTNGRLSC GISTCLAVDN  1200
VSSRSLGRRV LRSWSDELHR RRLTLDLAQK HPNESMRKKR LFRERAGGGI RDQAVMDCRL  1260
QMGMESRTLL CQGGFSWRER KLQGQARSRA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112201240RRRLTLDLAQKHPNESMRKKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G51190.15e-16AP2 family protein