PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG82161.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family ERF
Protein Properties Length: 5021aa    MW: 529249 Da    PI: 5.7186
Description ERF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG82161.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP234.16.6e-1125652610352
         AP2    3 ykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkk 52  
                  y GV + + +g+W+A+ rd   n k+ + + lg f+t+eeAa+a+++a+++
  GBG82161.1 2565 YIGVKRLP-SGKWLAQFRD---N-KNpACIDLGVFDTEEEAARAYDEAARR 2610
                  88***999.7******888...3.379*********************986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 5021 aa     Download sequence    
MGKLNWKRKG GGKGAELTTN KKSRLVVKGI DDGNNNNNGA NDEKVVDSVR EARGRGRQES  60
ADVKRAGGVG VVGVRADGRE TDEEEGPAVD SGKKHHVRRE DVSEANTQRV TRQSTRLEAT  120
GDGVRGHGKG MKKRGTDDDA GRNCGVEAVE GAGAAGGNER VTRQTVRLRN QGIASSHSWQ  180
APNALAHVAA DKRASSGNGG GRCRGSIRGR EDDRGGLMDQ SDVGSRAGRA RGASDGCAVA  240
VTKRAGKKRR EAFPARLGDA GQRRRLLLVE ESDEEAKDED EDEEDEEEAE EEEAEAKQTE  300
EGEREGEEVE EEAEANQGEA GKEEVGDKET GAVGKEEREG EEEEIKRRHD IEKSPRRRAG  360
QASSRRTADA KVVRGGLRAA SPLEVNRDAS PSEVNRTAKR KKGGKGDCMS PSLTSEKDCR  420
LRKRSPARRL EMSCDGLGTI KHRSSPRKSI DGRSDGDDGA CRDDDDDAEE RALIEDEEGE  480
EAHGRDYHSE AREEEETPHS DAAFACSPVL SSPSAGMRAS KGPPAGLARE RKKRVFGQGS  540
RLPAAAGRDE LSNDGQSPDF VESAKLRTAS ARDDTDAGES SEASLSGGRM ADARVKKVQG  600
EDDHPWKRKE KASQGVEGSD HGDEEKSGGT AGAQRDRDVE PGSRGRDAAS MDGNVINNVV  660
GSEVKIGAGG IRKEHVSRTE RGREVGVLAG TAPLAVLAGV KGASLSPTDV AGRRARLRPT  720
GRRISSRSGH STVVEESPRP EVEAAAELPV ARLEVVRRRL RPRRADGSAG IVETEQIATS  780
PPLASASPPL GGTQTEAGGA CVPRQQASLP CDDGSKIPSR NSKAGAPPEE IDFREPPTTS  840
VRAEVGNGGN GGGGGGGSHH DCSPADAEDV AAGGGDNASE AARGRSHANQ QHQHPPFLTP  900
PVTYRRRGTA GSSPPLASPG GTCGSTAAGS GKPAKTTALS SVATGGPEHP ASSLPRPPTT  960
QGVQVEAKLE GKEGERQQPR TEEEKEKGGD GRRPGKVSKE VKEVGKEDER EEEDKEEERE  1020
EEDKEEERQE EDKEEGREEG KKGQERKAEE DEEKEVEERK EAKEEERKVD EENEKEVEGK  1080
EEETKEAQEE ERKEEEEGKE KEREQEKGEK TGKDGEEILK EDEDEIQEFG GVEEKGIASH  1140
EKPEEGKEEE EEEEKNKDDD DVFTGGDDKQ RNWRSSATPR NLSKDDFSPE ADAHGPIGGG  1200
PGESSVFTDV AIAKSLPCLD SEEALGIMVD ELTCAVIKNT ATATAAATVD IKDHVGATCS  1260
PHSRPQKILR GNSLKKSKST SSSSTPAKKA LQALAADDNT RITRSGGGGG RKSGSGSAPC  1320
RREVEDSVAA NGHPDSSALS DSAKRRLDLE TENIKLPKDG NMLGGSRQQH SRKTRELPER  1380
AARCKRPLVD RVDDGSADED DGCGRYKRRR GQRDEDNFAD DKNANDDERV DDDNDDDKYR  1440
GRDDGGGEGP SEGSGRRQER DRKGAGGGAT FLSAAGIGER AVFAKGSNAT TTRAGGSSGS  1500
GRSVTASRAH GGVKGSKGPS AQVGTVSASA SKWCHRNEEK KGVSGQALGA AAGKKGGMKV  1560
ARTRAAALRD SGGGNETGNE RTSAVIAPPP PPLPPIARCE SKKVLCMSTP FRTFKLCFRH  1620
ILEDPSAPYR QCDYVFPATG ARCTTPVLIN PPEDARFCSA HARMMMMMGS SSRAASSADG  1680
KSTTRPGGAA TPVPPPAPIV SPSLSSGRLT SLASSTVNVS TTVRPDRCSP NPSSPPPLLK  1740
KVKQESVENG NEEGDRMTMR PPPPPPPPPP ASPRPPPLSI SQSGKKRYAA VGIRADDRGS  1800
GVAHHSDHEI AKTEHLPLAS GALAASTTAG GHVGGEAKVV GGDGDDGAED GPSGGQARPN  1860
SRGSAGNAGK VGPRGGDDGK RPMARRRDAV EERVETASGG GAAAGSGDDA CCEKSKKKKW  1920
KDSIVTCSAE RRGAASVARE PMPAKDGTRA GEALVERRVD NGSTRRGKSL GNGADLDEDE  1980
EERGAVDERA GATAAGAERP CLREDDHGND AGDGRGRVEQ EGRAKGIGGL SGKTVVKMEE  2040
GWEAGGGRVT PAGISRVTTH TERMSPAHSA RGRGFSPTFG NNERKADVSG SGRILNNDHS  2100
RVGEHPGLVV SNARESERRR NKVTRVHVKV EDDVGGQPVG RAGSDRGCVE GRKTGGEIPV  2160
KDVKEEERRR HKATQMGMAR REEDGARKTE TGRGTNDRGG LEDESGGQAV RRGLQEGGRR  2220
GAGVEGSVMA QSGKRGGSDE SALGGQGQDG DAAGRRDGCS RQRGAMFAPV GRNYYGVTPR  2280
PWGNFIAEIK DPNGGLCTLL GSFAKEEDAA RAYDREVLRW GRNLPLNFPK ESSRGLENST  2340
KSGSVSRKGG RGAAGGGGGG GDEGVGVGRR GGVGTDTRKE EGGGRGGARD GAGAIGKGGG  2400
GGGVREGGVR EGGGRGGVRE GGGRVVVRED VREGGGRVRE AGREGGGAVL KKVVFVLKGE  2460
TMAEGEWDGE VDPNCKDGEE EPCWSDEEEG EIVGKSSVGS RRRGSARELK CVPPPHSESE  2520
GSRKGERIDR PVTPLTAAGA AERDVKTAKV ASPSASGRVQ YSVNYIGVKR LPSGKWLAQF  2580
RDNKNPACID LGVFDTEEEA ARAYDEAARR RPEENRLLNF PIGKKEEGQG VRSSKSGKNG  2640
EGNWGGERKR RKGRKGRGCL SPARTSPHQC DEGAKSPWRL SPGRTLSLNQ SDEEEDDPDQ  2700
GRNMGVGESS TEGGGGNDGR EVVRGGDPGT GSGSGTRLSE SESGKSRVVL HHEDRPPVVD  2760
GNGARLNKRE RDGEGMKDAG SGCRGLKGSS PSEDRSGDED AVELVGITRG IKCENAKHSV  2820
LCSSGNALGS SRNEAGRRED QGDESERRKE DVEGESVARV GTTADVDSVT RSAESKQKGE  2880
KMVNDGVDEE VRVGCLAEAG EAVSARDAGS DPEGERGKPR KQQIMITSQR MHSPDSRKGS  2940
GQQVSPRPLS SPPPLPVPPD EGPRGKTPDS TRSDSRVEQN PLREDNSGNV CMSPKGTNPP  3000
ERIRELQPPS RLLSGPDAAS PLDVMIMPGS PSNRQCTVRQ IPKAASADSA SAAVGPNQEQ  3060
HPPRLRRRFV GPEMVDHHIG SKEVISPTKP PELLEQLERK NLQHASAEGT PIRRLEDDQG  3120
LVPRSQHDSL VKDQQGRVCH QCSASVVVPS GTAGEVSVEQ QQSLAQQRHT GRMEKGAVAA  3180
AIDGSSQSSG TVSPSSMTKQ SAHPVSRPSV PRYSSRSEEL IIIDDEDHCL PDDRKMAADQ  3240
GRPLAAQERI LTKDAELGSV AVNLQPSEAM STSSPAASME HAGLRSELAA AGCIAAAAQQ  3300
TPPVRLHHHH DQPQQQCMRA PFLSIPQSLA ALQQQQQQAR PAAEQLHHAA PGSVQWIASN  3360
AVAQQQRSGM LSSQRMPAQQ RATIPLSHSE QLAWQILELS SAINRQQQLQ QQQQQQQQQQ  3420
QQQQQQQQQQ QQQQQQQQLQ QQQQQQQLQQ QQQQQLQKHQ QQQLQKHQQQ QVQLQQQQQQ  3480
QLQKHQQQQV QLQQQQQQLQ QLRHHHHHHR HQLQQHVGMP SSVATQAGTI NIQQQRAAIV  3540
SNPQVAGSQR LSGISAAALR QQAGISGGSG GSLRLGHSLG VSTAQQLIAL HRSGGKAMPT  3600
VQVQAPHDTI GTYETRQDLG VGSSGVTTST VDIQNSAEKP TSLYDLAAAG DHLLFKAWSS  3660
QSSAAVVSAS PPPVAVPVES GTQSVMTPPQ LQQGQPDSAG RTTVRQRVTH PLMSPRLVAL  3720
QVPLQPSMQQ QQHPGATMSP ATQPALGGDV QQTAAMGSGL FVVPGNASSA SNPTKVSTVA  3780
AGGGVGIGLS WQALALLQQQ QQQGMTGRQQ QVSDGHKVNV HANMVQRASN QQQMQALSQV  3840
ATVQQILRGS QAEAAAVASA AKALCPNSHH SGHQALSSTQ AQMISPNPVA AAAATASQAA  3900
QAQLQMAAMV RQQQQAGSAT SHIGIATLAE CQMHVPSCAL NPSQCAGPAA SAGLIHQQKT  3960
SHVLQPSESQ THFRVTAASA QSEVHSTGGT SVWLSNLAAG ETPRMAHSSA GPLPSQPQFR  4020
QSRENQQPQP ISPIQCQSTS AAAPNRIVPP GANPQVGRMF ERSAAEGTLF AVPGGGARAL  4080
AGGSESTCRL PVEACGQTSA RVVRSFLSVH QTAALPSLMA RMQQGTPSAA TQLHYTKQGR  4140
SSLRSTGSSD SQKVDNLALF SKKLGLHSLV LGLRSHVCQS APVPGQLQSG DFTGDDDDDD  4200
DGDDDCQIIG ILLQNTYERR TGSMAVPLPP VESGLGRGLS VDDRCQPVAG TFRCATCSLR  4260
IPNLAGDSGL AQEVHEMSTA VPPGMIGLPG DPGGAKDGEQ RDCAAANHES LALLEPTAVG  4320
IGDESFSLLE PMTGAEPFLA SFGRSLDSVV GLAASPKKST ATDPRTGPTA GPSTGSGTGP  4380
STGGGAELSQ GSITGSAEQI LHNGCVESVS REVAAEANSG IVMPSGEGPM DSGVKEIENP  4440
AGRGTGCQLQ PSLDDGNGDW VESMTSVALE FACALEGGEN QPKSKAAADM NANVRHIEEL  4500
GTKSAASQPS GNVDGGAVGK EGSVDLEMEV TVESCATSTL PGEMDSAMCI DKGGAVLEMI  4560
SEMKDVSGIK QLEDCLLEEP ADGSPPENQT DEETGMVACK EVNGKKADEG AEVFADKSSD  4620
RGGEDGEGIE AFTCVARSTN DDKDVKLVES PSKEEVGVCL GEGLDQKGSH GKPNVSELES  4680
TPLEETSLQG QAAADNRECG ESPRESRKGF KEGNPQVDDL VSFLAQEKSG APCQVACAEV  4740
VACGQGSDKE SGTVATPSSV IERVNSEAAD DLAEWGVVSE VVEDPMWENE WGRRAASSPR  4800
TRGGWIEELE DEGAEDMNAS DNVIRQWKQG LVQEGILERH SFTSGAVLEA YKEKADKVER  4860
AVQTIMETAA VDATEQSDVG EAVANVGVKG RDGMALCGNA SVVVSQETEA LEVGSFDGGH  4920
PQVCSVNKAG GGGGPTSGPL SKEQVVPTMT SGLGNGMIRS DEVGAAEDAA ASADNGVTCS  4980
EGSGKVEAHN TCTPVLPFST LRVEVSDKEN REENDVCCND S
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
114041412GRYKRRRGQ
217371744PLLKKVKQ
323492357GGRGAAGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33710.28e-11ERF family protein