PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KXZ49867.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Gonium
Family C2H2
Protein Properties Length: 1584aa    MW: 162681 Da    PI: 6.7039
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KXZ49867.1genomeGPGRPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H214.69.3e-053961223
                EETTTTEEESSHHHHHHHHHH.T CS
     zf-C2H2  2 kCpdCgksFsrksnLkrHirt.H 23
                kCp+C+k++ ++  Lk Hi + H
  KXZ49867.1 39 KCPYCDKTYQQSGRLKDHIAKqH 61
                7******************9988 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS501578.9123866IPR007087Zinc finger, C2H2
SMARTSM003550.00693861IPR015880Zinc finger, C2H2-like
PROSITE patternPS0002804061IPR007087Zinc finger, C2H2
Gene3DG3DSA:3.10.110.101.6E-6490612IPR016135Ubiquitin-conjugating enzyme/RWD-like
SuperFamilySSF544951.3E-5493606IPR016135Ubiquitin-conjugating enzyme/RWD-like
PROSITE profilePS5090810.878497613IPR006575RWD domain
SMARTSM004871.6E-9714955IPR014001Helicase superfamily 1/2, ATP-binding domain
Gene3DG3DSA:3.40.50.3002.8E-35741937IPR027417P-loop containing nucleoside triphosphate hydrolase
PROSITE profilePS5119211.886751936IPR014001Helicase superfamily 1/2, ATP-binding domain
SuperFamilySSF525406.2E-24754938IPR027417P-loop containing nucleoside triphosphate hydrolase
CDDcd000463.50E-9759934No hitNo description
SuperFamilySSF525406.2E-2410531156IPR027417P-loop containing nucleoside triphosphate hydrolase
SuperFamilySSF525407.82E-1811221290IPR027417P-loop containing nucleoside triphosphate hydrolase
SMARTSM008471.0E-2012281323IPR007502Helicase-associated domain
PfamPF044082.0E-1712291322IPR007502Helicase-associated domain
PfamPF077173.9E-1614091517IPR011709Domain of unknown function DUF1605
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0004386Molecular Functionhelicase activity
GO:0005515Molecular Functionprotein binding
GO:0005524Molecular FunctionATP binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 1584 aa     Download sequence    Send to blast
MGGTKNRDAK MKANKVFSSG GGGGTKGPAV VDKRGQEVKC PYCDKTYQQS GRLKDHIAKQ  60
HADQLQPQDA PAAEAGAGPA AAAAAPAAGK PAAAAAKATA AAAPAAASAA AANLAAAAAA  120
AARSAASKGP GSGSGGAGGP SGSGAGAGGA ASAAGGAGGG MMTLNSRAGY YTCKSPTMHL  180
HEWTQREKRP RPRVVPKQLD SGLWTCKVVM PDPKRQEDDV VVFLDEEHAA PDEAEARERG  240
AVAALNRVQG DRALERILPK DYVPLWNALG EERKAREQAA AARRAAEDRR KQQDEARRKR  300
SNTRTPQVVV MSAAKQRLVQ SLLRDRGAAP GGAGATRGSS AAESAEADLS GVVEQLSDMG  360
FSPQHVQLAY ERGAGAAAGG GGGGVSVEAL LDWLLINLPQ DQLPSQFTKG STTNMVAVVR  420
RGHAGAGGGG ASLQSVARLM AYGFTSAECE SALAAAGGDE LAAHVRLYGE LTGVQLPNPG  480
PSSEAAAADG AADAWAEELM VLESIYDSQF AVLSADAVRL TLELPADCAV ATRADAQLVL  540
EFVRLGGGGG GGGGEGAPAY PDAPPLVSVS CAGLAAGALR HLTRVMAQES QQLLGQPALH  600
DLATCALEAA AAIDEEAAAE AEAGGGASRG AGSGARRGDA DADDGDGGDL ERQGSSLVEG  660
FEGLMVEDEE EDDEAEAKEG RRKAGGPADR RARQEGPAGA GGGGRRLPID LAAESARLSR  720
LQADLDALPR HADMRRARAA LPAAAKRPEL LELLRCHDVV VVSGATGCGK STQASGRAGG  780
REGRGGKAAE EDGAVPQYIL EDAIASGQGA RCNIVVTQPR RISALGLASR VAAERGESAG  840
ETVGYSVRLE HRASAATRLT FVTTGILLRR LLSDPDLEGA THIVLDEVHE RSIEIDLLLL  900
LLRDVLARRR RQAAAAGSGV PEPPPLKLVL MSATADAQLF AGYMNADGGD GALIGSGAAG  960
GGGGGAKGKA GGKGAAGAGG GGGGGGGASV GMITIPGFTY PVREFYLEDV FEMTGHAVGR  1020
DNRCAKRGGA GGKDKDKAFE PKLSHALKSY SDQTMRSLSL VDEDQINYEL LVDLVAEIVG  1080
RHRRDGAAAF LSDWPQAFSS GRGDTSGGGA LLVFLPGAPE ISRLQEGWVS RAAAQQRRGR  1140
AGRVRPGICF RVFSSEQWDR MPNHTEPEML RSPLESVCLL VKGMTSAKAE AGGGAAAAAV  1200
AGAGSASGGV ADFLSRCLSP PSGRSVSAAV GLLRNIGAFD ASEQLTSLGR HLNRMPMDPR  1260
VGKALVYGCM LGCLDPVLTV TAAMAHGRPV FLNLQSAGEG VAAARAQLLK AAVASKSDHI  1320
ALVAAYNAWC KAVDKGGRQA GSHLCSDCGL SESSLEAIQS GRAEYARVLE ELGFLGDQDG  1380
CARGGSAASA CRAAASSLPG SEWLAAPANR NAGNARFLKA ALCAGFYPSV LRVDHPRTKY  1440
KEVYGGAEEA DSDPKDIKFF DREKGRTFIH PGSVAFTVGK FESGWLVYTQ MTETSKLFVR  1500
EVSMVPVYAM LLFGGEISVE HSAGLLRLDG WAEFKAAPTV GVMVREMRSE LDRLLGAKIA  1560
DPGLPLASNR IVAAVEELLS TDGF
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5aor_A4e-7673815623811126DOSAGE COMPENSATION REGULATOR
5aor_B4e-7673815623811126DOSAGE COMPENSATION REGULATOR
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1187191KRPRP
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A150GJ800.0A0A150GJ80_GONPE; Uncharacterized protein