PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023913552.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family C2H2
Protein Properties Length: 1632aa    MW: 180449 Da    PI: 5.6884
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023913552.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H211.30.00115171542123
                      EEET..TTTEEESSHHHHHHHHHH.T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                      +kC    C +sF++k +L  H r+ +
  XP_023913552.1 1517 HKCDleGCRMSFKTKAELLLHKRNrC 1542
                      789999****************9876 PP

2zf-C2H2110.001316001626123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      ykC+   Cg sF+  s++ rH r+  H
  XP_023913552.1 1600 YKCKveGCGLSFRFVSDYSRHRRKtgH 1626
                      99*********************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1632 aa     Download sequence    
MGNVEIPNWL KGLPLAPEFR PTDTEFADPI AYISKIEKQA SAFGICKIIP PLPKPSKKYV  60
FCNLNKSLSK SPELGSDVNC PGVCSSSKTG SGDGGNDGEA RAVFTTRHQE LGQSVKRTKG  120
AAQSPQLGVH KQVWQSGEIY SLEQFESKSK VFSRSILGMV KEVSPLVVEA MFWKATSEKP  180
IYVEYANDIP GSGFGEPEGQ SRYFHRRRRK RNFYQRSKEN SDCKNNEIDS VTDSLNDEIK  240
DSSAKNEPDI SLETPKPPTT SSTLLSDDTS RCSRRKSSSA INEMEGTAGW KLSNSPWNLQ  300
IIARSPGSLT RFMPDDIPGV TSPMVYIGML FSWFAWHVED HELHSMNFLH TGSAKTWYAV  360
PGDYAFAFEE VIRSEAYGVN IDHLAALTLL GEKTTLLSPE VVVASGIPCC RLVQNPGEFV  420
VTFPRAYHVG FSHGFNCGEA ANFGTPQWLT VAKEAAVRRA AMNYLPMLSH QQLLYLLTMS  480
FVSRVPRSLL PGVRSSRLRD RQKEERELLV KKAFIEDILK ENIKLSNLLG KVSTCHAVLW  540
NADLLPYPSR DSQLPSIVAT DSTSPRENAS HIHTENNNNI QNDLIHEMNL YIETLNDLSL  600
DGDDLSCDFQ VDSGTLACVA CGILGFPFMS VVQPSEEASL KLQPMDHRLV QGGPRVSGPE  660
NACSSPDCDG FIKSNIPENL PPVPDVSLPP KDLKMSLLTK FDTQWNTSSK FLRPRIFCLE  720
HAIEIVELLQ SKGGVNMLMI CHSDYQKMKA HAVAVSEEIG KPFDYNEVPL DTASQEDLNL  780
IDLAIDDEEH DECGEDWTSK LGINLRYCAK VRKNSPSMQV QHALTLGGLF SEKSATSECL  840
TVKWQSRRSR SRKSNHSSHG KVCDTAQSKK DEVLGGRSNG IIVNNEKKLI QYSRRNPKLK  900
LGGSTDASTV DGCPGKNLSK DVSAATYGDR DKQSGKATEN DLSNKGNSKS DAVFVFSNAS  960
GVSEMQHESS GFAATSGGSL NSAPSRIEDS PASDTLVVVE TQSENHTLDD FDIDGKACNV  1020
ATWDGSEMQP KIKSTDETKE EDKTTYAKKC GSPLIIATDE RSGMQAENQS KEKISITNES  1080
FHLVSEGQCN VSTEGDVLMN EVSDLSKPPT PHIADPVVRN FEAQMENVVL EESCINSKVL  1140
VCATLDNEVQ QKIHTTSRIE YDGPLSSNVA PINQLTLAST EGVSCNIAVT NHPTLASMQE  1200
SCEGPTEIRA AEDISIAMSS DAVEQELESE NGSTEDEPVS SYVIPTNEPT ASREIHSPDR  1260
NNEELVSSSV SPMEVSQPCV SLEQCPEVPR GCSAEEDLHD GVTLDTEVQQ DIQTSDGTDE  1320
GMPVPSFITQ VEKEPVTISG GECYKVPRGI NAEENLSGGV ALDNEVEQKK ELTNENDEEF  1380
SEYDTLINPP SPAPIQKRSR IRRDTHVENL LHKEVCSSQD DRELESIEST LLEPRSSTDK  1440
RRKRKREADH LTENKFGCSD FIRSPCEGLR PRAGKDATSR CGIDTSKIVE EKLVRKKVSK  1500
ASDVPLPPKN KNSMGSHKCD LEGCRMSFKT KAELLLHKRN RCPHEGCGKR FSNHKYAMLH  1560
KRVHDDDRPL KCPWKGCTMS FKWAWARTEH IRVHTGERPY KCKVEGCGLS FRFVSDYSRH  1620
RRKTGHYVNS PA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
114401446KRRKRKR
214411446RRKRKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein