PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY38549.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family C2H2
Protein Properties Length: 590aa    MW: 64596.8 Da    PI: 9.419
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY38549.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H216.91.7e-0598120123
                 EEETTTTEEESSHHHHHHHHHHT CS
     zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                 ++C++C+k F r  nL+ H r H
  GAY38549.1  98 FVCEICNKGFQRDQNLQLHRRGH 120
                 89*******************88 PP

2zf-C2H212.50.00045174196123
                 EEETTTTEEESSHHHHHHHHHHT CS
     zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                 +kC +C+k++  +s+ k H +t+
  GAY38549.1 174 WKCDKCSKRYAVQSDWKAHSKTC 196
                 58*****************9998 PP

Sequence ? help Back to Top
Protein Sequence    Length: 590 aa     Download sequence    
MMKGLIFHQQ QQQQQQHHHH QVLEENMSNL TSQSGTEASV SSGNIRGAET TNHQQYFATP  60
PTQAQPPAKK KRNLPGNPDP DAEVIALSPK TLMATNRFVC EICNKGFQRD QNLQLHRRGH  120
NLPWKLKQRT SKEIRKKVYV CPEPNCVHHD PSRALGDLTG IKKHFCRKHG EKKWKCDKCS  180
KRYAVQSDWK AHSKTCGTRE YRCDCGTLFS RRDSFITHRA FCDALAEEST RAITGTNPIL  240
SSSSHHQPGI VAGASSHVNL QIPQFNPQDF SVFSLKKEQQ SYSLRQEMPP WLGSQQPSIL  300
GSAVPGLGQP PSSSHTVDHL SSPSSSIFNT RLHQDHQFTQ TTHQDLTRND HPANPNPSLG  360
PTLSVPHTNY HQAMASAFPH MSATALLQKA AQMGATMSSS KASTATGNSS SSSSPAHHAA  420
LTRPHQQPPP PQQAHVSATP EHPAGNNKTK TTTGFGLNLS SREGVVHGLT PFGTKTSGGG  480
SSGAFIQEMI MNTSFSSGYA AASPFDDALT FGGVFNSKKE PHLNHSFNES SSLSRTSGIN  540
DHGEEMTRDF LGLRALSQTD ILNIAGLGNC IDTRSSHEQQ LNHSQKPWQG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1125137KLKQRTSKEIRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G55110.11e-118C2H2 family protein