PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TEY80708.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Lamiales; Lamiaceae; Nepetoideae; Mentheae; Salvia; Calosphace; core Calosphace
Family C3H
Protein Properties Length: 871aa    MW: 97373.8 Da    PI: 7.5077
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TEY80708.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH20.86.6e-07277298426
                 ---SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   4 elCrffartGtCkyGdrCkFaHg 26 
                 ++Cr+f + G C+ Gd+C+F+H 
  TEY80708.1 277 VPCRNFVK-GSCRWGDNCRFSHH 298
                 79******.*************5 PP

Sequence ? help Back to Top
Protein Sequence    Length: 871 aa     Download sequence    
MGERRKRKSL WSEEAETKPF TGNHSGIGKD KHPSHSGEQN HEFPASGSHG LPKFRYHSGH  60
PSMESIHEDP VDWINGSYPR VRENAYQEKE IGGENSYYQD MSPKFKYNHL LEDDQSHSRR  120
YQGRGRSRSR SRSRSKGRCR SRSLSRGMER ERTRGESRSD SDARGWTRSR SPLGDYKHQA  180
SGWSDGRSLP EMSNEADFAP GRGRRDDFSK YFLPKNTNHR DGALGGRDTS ESRRSRADHG  240
NDSKLSYSRG ARIESRSDAS DPYHGENEQF LNNSRNVPCR NFVKGSCRWG DNCRFSHHIA  300
SDETCVEGTK RSSSDKDPEH KPYRNRNPLC KYFAAGKCDR DNCKFFHEDP NFNRLEGRLG  360
KVNNDHNSRD GSSRDNDSIS RDNDLSSRDK VNKRDGLRWE SCNEAAGISN DMNPSGGDSS  420
TVIVTESTVD NSNAGQNNTM SHSLGNEKVN WGLPQLTTNL VPVHHAGNGS YGGDAGTTES  480
AIEENMMAEQ DVLVFHGSLL QNPDGNTNKL QVDQRFPLNA WQQNVSLVSH IQHQNHGVVE  540
NNVNGKSVLL GSSSITTEAG SKDMPALNGA EIHHVANLNL SQIQKLQNAI QMAGISETNT  600
MLSFPKSVIS EQSAQYTNTL PSTTLQQVPP VPNQRFENQH STKIPIEEVS NSIEMVPSTA  660
NASGHVPLDC AYMQLNRVTV SHDPISSAEN GRNSNEHEEG TKMPPTDNTE IDNSERSQEQ  720
QVMAHKNAEC DEVKGGMGEQ SKGVQVSKHS ETLEDHGKAD ESIGNKDDKG TRLFKNSLIE  780
FVKDKLKPKW KEGRMSRDVH KTIVKKVVDK ITSSIQTDHV PKTQEKIEQY LSFSEPKITK  840
LVQVNCVIQS VLFKPTSLSK DFLDILRSGC T
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1431RRKRKSLWSEEAETKPFTGNHSGIGKDK
2532RRKRKSLWSEEAETKPFTGNHSGIGKDK
3124132RGRSRSRSR
4124134RGRSRSRSRSR
5124136RGRSRSRSRSRSK
6126134RGRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.11e-08C3H family protein