PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023917830.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family C2H2
Protein Properties Length: 1471aa    MW: 164453 Da    PI: 9.0402
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023917830.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.40.0002313801403223
                      EET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    2 kCp..dCgksFsrksnLkrHirtH 23  
                      +Cp   Cgk F ++ +L++H r+H
  XP_023917830.1 1380 VCPvkGCGKKFFSHKYLVQHRRVH 1403
                      69999*****************99 PP

2zf-C2H212.50.0004314391465123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      y+C+   Cg++F+  s++ rH r+  H
  XP_023917830.1 1439 YVCGepGCGQTFRFVSDFSRHKRKtgH 1465
                      89********************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1471 aa     Download sequence    
MAASEAAQEV VLSWLKTLPL APEYHPTLSE FQDPISYIFK IEKEASKYGI CKIVPPVPPS  60
PKKTAISNLN RSLLARNPDS DPKSAPTFTT RQQQIGFCPR KPRPVQRPVW QSGEWYTFQE  120
FEAKAKAFEK AYLKKCGGGS GGGNGNGKKT TPLSALEIET LYWKATVDKP FRVEYANDMP  180
GSAFVPLNAK KSKISSSSEG VSLGETAWNM RGVSRANGSL LKFMKEEIPG VTSPMVYVAM  240
MFSWFAWHVE DHDLHSLNYL HMGAGKTWYG VPREAAVAFE EVVRVHGYGG EINPLVTFAT  300
LGEKTTVMSP EVFISAGVPC CRLVQNAGEF VVTFPRAYHT GFSHGFNCGE AANIATPEWL  360
RVAKDAAIRR ASINYPPMVS HFQLLYDLAL ALHSRMPMGN NAGPRSSRLK DKKKGEGETV  420
VKELFAQNVV QNNELLHILG KGSSIVLLPR SSSDISVCSK LRVGSQLRVN PTLTPGLCSS  480
KEANPGIGQV KNFLPVKGKF GSFYERFDNI CSSNSKTLNT DSERGSTAQG DGLSDQRLFS  540
CVTCGILSFA CVAIVQPRDA ASRYLMTADC SFLNDWTVGS GITGNGFAIA NGDAITSDQN  600
THPGWKEKSI PDGLFDVPVQ SHDGQRQMED QRYEVVSNTE TQREPSALGL LAMTYGNSSD  660
SEDDQGEPDF PACAEQKKLT NSSSESIYQC DNSGLPSMQD CPQGATGVRS PSLSRHGVED  720
GSNQTSDCSA EFRTDDPASR RSDGLMDTFS DPITVSHVSS DCSLDVHDVD QTKFGKENVP  780
RENKNMPFAP RSDEDSSRMH VFCLEHAKEV EKQLRRIGGV HILLLCHPDY PKIVDDAKSL  840
AEEMGIDYPW NNITFRDATK EDEDRIRSAL DSQEAIPGNG DWAVKLGINL FYSANLSRSP  900
LYSKQMPYNS VIYNAFGRCS PASSPKKAKV YRRRSGRQRK VVAGKWCGKV WMSHQVHPFL  960
AKGDSDDEEE EDMSFQTWTM PDEKFEIKSE STHKSETTMV ARKYGRKRKM TVESGSAKKA  1020
KFIDRGDAFF DYSVEDNSHQ QGRTPRGKLA KSVERDEAVS DDSLDNYSHE LQRTSKSKEA  1080
FFDEREDAVS DDSLMDDSHQ QHRRIHRGKQ SKCFDKEDAV SDSSLGHNFH QHNRRIVKSK  1140
IGREAFSDDS LEGNSHQQIK RYNRRNQAKC IEREDAVLDD SLGVNSHKQH RRLPNIKQAT  1200
SIEREDAVSD VPLDDDSHQQ HRRILRNKPM RVEISQPMKK GPPRRVKKGT SWSTKQVTHR  1260
PIKKESPQLM KQQTPRLRNN QSERNSSHIG LLVEEDQEGG PSTRLRKRTK KTVKESEAKP  1320
KAKKQASRIK VRKNASAVKA PAGHNDPKIG DGEVEFLCDI EGCTMSFGSK HELVLHKRNV  1380
CPVKGCGKKF FSHKYLVQHR RVHLDERPLR CPWKGCKMTF KWAWARTEHI RVHTGARPYV  1440
CGEPGCGQTF RFVSDFSRHK RKTGHSAKKG R
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1925940PKKAKVYRRRSGRQRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein