PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TEY68531.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Lamiales; Lamiaceae; Nepetoideae; Mentheae; Salvia; Calosphace; core Calosphace
Family C2H2
Protein Properties Length: 877aa    MW: 97584.4 Da    PI: 8.2822
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TEY68531.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H214.88.2e-0579100223
                 EETTTTEEESSHHHHHHHHHHT CS
     zf-C2H2   2 kCpdCgksFsrksnLkrHirtH 23 
                 kC++C + F ++ n++rHir+H
  TEY68531.1  79 KCEKCYREFCSPINYRRHIRVH 100
                 7*******************99 PP

Sequence ? help Back to Top
Protein Sequence    Length: 877 aa     Download sequence    
MPAAKLYSSG TLNAMKSEEG NDSLGTFIKQ ATGREPLLPF PRTVDGPVQW IQLLNALDQP  60
VVDFSGLPLL TPVKVQMQKC EKCYREFCSP INYRRHIRVH RRSLNINKEW HKNRDLLAEF  120
WDELSLQHAK ELVSFNDIIL KDIPGSSVVK ALSSSLRKAG VWTLPQAYVK AGATLLDIIQ  180
AKPSRLPISS QELFSILDDA SERTFLCAGT ADSVQNYVFD GETAKNSLEL KNLVALTSFL  240
FEQLLVKAWV ADKDAEALRC QKLLVEEEEA EQKRQAVIVE RKKQKKRRQK EQKVKEQYCG  300
CSSNLNVYID AVDRPTSAEA SDMSSPSGSN SNSQAVPTTI DSVQPQNKES DEDIEVQFDA  360
SSEHKKQRDS PAVELQMLVA NGHRHAATNR LQALKSQRGS RYGLHADLNP QTLKPELMHK  420
LGPSKDRRIE NGSKVWTKKF KNNNDWENPR PPSLEEVESN RIEQNNGEVI IGSIPVTLKS  480
YYDQQEASPP NETQDTCSAE QLKKKNASEK PVKCNSLQSG TNRAPSKLWR PVSHGETKNV  540
LTVGRINEDL EDSAMLTKVH NHTPSSERSG QSESLDSDGC QNGKQFHVIS DDNAQGGLMP  600
FYSEAVKEFL ARRWKEAIAG DHIKLALSSE REPPGRPDVQ PASGPPRSNG ENQCGRIIQA  660
QQQFSLTNHV GRNRSIGDIT LPSEDHNHPE TANAALNPAG KPSGLFTFRQ LNALAIGVVL  720
SASGMPFVFG ESNKLLSLYV AAGAAIGLFL PIAYIFEGVF EGDKEGIKAA APHVFLLASQ  780
VFVEGLSFSG GFSLPIRVFV PVFYNSRRIF TIVDWIRGEI WKADGRYEGS ARRLYVGRAL  840
AVANMAFWCF NLFGFLLPVY LPKAFRIYYQ SSHIKPN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1281287RKKQKKR
2281289RKKQKKRRQ
3282290RKKQKKRRQ
4285290KKRRQK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G25610.11e-111C2H2 family protein