PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG86218.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family C2H2
Protein Properties Length: 1545aa    MW: 164461 Da    PI: 5.0192
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG86218.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.70.0001814481467121
                  EEETTTTEEESSHHHHHHHHH CS
     zf-C2H2    1 ykCpdCgksFsrksnLkrHir 21  
                  ++CpdCg++F ++ +L+rH  
  GBG86218.1 1448 FPCPDCGRRF-NHASLNRHVL 1467
                  89********.8888888865 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1545 aa     Download sequence    
MLCRSQAHVE CMRSFQREVG RPHKVVDLSE YAAKKKEALK RAEALREEKK RESEGKGPRT  60
RKYSWGGLPA EGLPIDYTWT KPDEDPRTLQ TESSQVSVFS SSAPRTPKQR RKPPLDTNLA  120
EAVAGSIARL TRGNGDAGGG DSGGSTPSER STGALRSSCQ SLPDPAAISR SVARQFRDIS  180
GDPRSSPRSS ASAGVVSRSP RAASDRRGRA DNGGGQGAGG CRKDKEQVCS SSGNQPASGT  240
DEGGEQTLSS PVFCLSPSSS MRGGVNDTPT LEGAGEDLLL PQGKPAAAHD DESAECGQED  300
GFDGCCSVRS GSQGDLPQSP SSLMRETKAF AAKKKPPAIS VLKEFSAFGR SLDRRGDGSG  360
MWGSVRGCVK SGSVNDLRRS VGNLREIAGS SSGGKAPLCT TEMMARASKS PQMKFVPSSI  420
QVGGKVGGKE NGAEERKKAL GAAGSRLSSS VVFERNHFSG KGGPASSTRD GYGEKCAAVP  480
KWPCVGPLPT KTGKSRPGLA GGSGLVGTSA YKSSPRSTAE AGEPKSNKAA AAGEEKRHDG  540
GKGNDKGRGR EGPEVGMGSD GYRGNCGEST DVPRVSDAMG SAGPPSSVLP TAKALEEDVE  600
YRKDLDRTDE YRKELDRTDK YKRKDLDRTD DHAENDGAME LIGEEEATFG ALGRAMEPRI  660
GKEEGDKNGE EEVVLAEGRM LEAAAAAGNA LADRAVEDRK LSSHQLLRKG EMEEDETKDA  720
SGLRRKSVDT RREEEKAAAE KKEREVVATT KPHENDTELA PAQILAVQAT TRDPYAEGEE  780
DTVRAPSYAS QSQAAERLLG EVEAKKKSDA GQRRVSIDAG RETPRQYMIE IVSEGKDDGN  840
EARNEGGGLA MESSVEQTGV QEKSIRFADE LGTRGGGPYE GREVTEGRSH ASRAEEEGSD  900
VQCSRVAEEE EVGDGSTHPG RQACLIPGEE IEEERDDAVV MEEAGGSVEG ETADRCETER  960
QGRTSAQRGG RTSEEEHDGT ELFAGSQRED RSAGIEEGRL DGAAGSAKLV SAFQNAIVAE  1020
GATSANDSGD EDKTEDDTKS KSQRCCKKTS CTVGMAGEHG EEEEEEGGGR GVSKEKPRTE  1080
HDQYGQSDGG VEIKEQDSER REDARAYGQS TASEHNSNNN RNSWQEEVST EEERENGGNS  1140
GCDMSGESSL EFETPLSTLK SERESSLEFE TPSSTFKPER GRGGGVIDFR KSIELWEKSA  1200
ECGFVTKDGR RIDGNTKSLG KLTLGEAIRY TKRKMKEGGG GGGGRDRNGT GGGTPPLPVR  1260
EKDGSGTTTK HGPPSRYRLP LTPAAPLIRA PAPPAVRSLV PSTTCDTNAP VEEGVAGKDE  1320
GSGEGVADGR GNLSETNAGD RTSDAQEDVV EDRANGADRV KNGCGIEKSE MEEEREDSDA  1380
TDDDRTAIED ARGAAHRWSE SSPAEQEAEQ QRHKEGGEHN SVRGAHVVGG ATSAVGQMTG  1440
AATGEAMFPC PDCGRRFNHA SLNRHVLFCK SSGGRARIRP SFRVNGSSSS DHERSWGHDE  1500
NLVARVGGSE ATCKDSAGKH QQQQQQQEQQ EQRPHQPQQQ EQEHP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14051KRAEALREEKKR