PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG91597.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family AP2
Protein Properties Length: 1234aa    MW: 129228 Da    PI: 6.6465
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG91597.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP236.61.1e-11195242155
         AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                 s+y+G++ ++  grW A+I++      r+ ++lg+f+t+eeAa a+++a  k++g
  GBG91597.1 195 SRYRGITAHH--GRWQARIKEA-----RRDIHLGYFDTEEEAAHAYDRAVIKFRG 242
                 78****9876..*********3.....5**********************99997 PP

2AP250.26.6e-16532579155
         AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                 s+y+G+++++  grW A+I+d      rk ++lg+++t+eeAa+a+++a++k++g
  GBG91597.1 532 SQYRGITRHH--GRWQARIKDS-----RKDIHLGYYDTEEEAARAYDRAALKYRG 579
                 78*****876..*********4.....5*************************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1234 aa     Download sequence    
MAAGQGFRLF TEKKVSKHCA MADMTSSTSR QKSAGNNGAT FGGWKGPTRY AERVTPEDLI  60
TSLRTTGRAD LRPVRTYVKP VGFVSSSSPQ LPALAASSSS PAVVDDSGFM ALATAASSSS  120
GEEPTSCSVQ TTTWCADRGE MGWCPVEGTA LGVGAAAAAA AAAAAAASAA AAAAAAGNAG  180
GGGFVARGWK SPRGSRYRGI TAHHGRWQAR IKEARRDIHL GYFDTEEEAA HAYDRAVIKF  240
RGGNAATNFC ISDYIDLLDA EQNAAGLGSG LSHEMESQKG PLPIDCERSK HAKPLPPSIL  300
QGCFPLSSPA SRSLHTCPRC GKPKRGHACI MDGRAAVAAA AARPTTPFAI PFPSNEEFSA  360
AMVNGMVRVT TQQQLGSAAG HPATGTQHPP SYLHLGVPST LFTPMDGTKP GAGVGGGGGG  420
GSGGTGGDQW HMHGIPPSAQ TGLVSGTPIL GAPIGSASTS GSSACFSVTE SGTSATSTQS  480
TAFGGMKMGV CGSEVAAHFP SMGAPVGRCN EPGFEKEGLV QRSRRRAFRR SSQYRGITRH  540
HGRWQARIKD SRKDIHLGYY DTEEEAARAY DRAALKYRGV TALTNFPMTE YAPGLLLGHG  600
CAADESSTGN KDFLNGWSGG NNPQVRVGSE QSQGNMADGE LEARVAEAGS SMALACIPSN  660
GEDMCGEAKV ESEENDAKRM RGVCEGGDRG VCSALILFDD TACFGEESGA WHAVEGTHDG  720
IGSVGEGACI RQDESMVQLE PIGNQPTEGA QSEGSLEVTV DQQTQDQDRM EVQERQGIPS  780
DPSVSLECAD NKRLSGGKMD NMQDSHGWER VVFCTEDVGD RHGAESSDGP PMGDGTNPSA  840
MPLRSTERMA VSRVESYGTA VQDRGLNSGL ASEQVFMSSI GGECTSSRDN GASTAEGQGG  900
VPRVCEMESA EASPKREGAD LSTQGVEALI EAARQVPIIM NEGRQQKQGR GDKSAYVETH  960
GECLASVDAC KRGDGGKVEE RQELFTNCSQ GLVLGRVAGE RAVEFRSGAT QGRLCAAVDD  1020
MGAGRAIDGR EGSVIAGAAG MDANRRMMEG GGPEAVNWPA GEDIDANDMI PTGLDRCFGG  1080
HIGGGCDVSG GNCKVQSGSE GIRGCSNGSC GSKTFCGKRC RSRVTCCAER MLTNGRLSCG  1140
ISTCLAVDNV SSRSLGRRVL RSWSDELHRR RLTLDLAQKH PNESMRKKRL FRERAGGGIR  1200
DQAVMDCRLQ MGMESRTLLC QGGFSWRERK VSSA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111691189RRRLTLDLAQKHPNESMRKKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G51190.15e-16AP2 family protein