PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY58471.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family C3H
Protein Properties Length: 887aa    MW: 98158.3 Da    PI: 9.0504
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY58471.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.61.7e-06188208526
                 --SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   5 lCrffartGtCkyGdrCkFaHg 26 
                 lC++fa+ G C++G+ C+F+H 
  GAY58471.1 188 LCKDFAA-GKCRRGSHCHFSHH 208
                 9******.*************5 PP

Sequence ? help Back to Top
Protein Sequence    Length: 887 aa     Download sequence    
MSGTRRKHNS KWDLKEESQL SHEKVRDSAR PGKAGISFYE RESRSGRFSP RAAGYNSGHN  60
WSAREADDIQ SSRHDMQFSS REPLPGSRSS RKDDRIDDYR ENFKATATWD ADGNYDMKMS  120
PGLDDWRQQI RRRSPRKDWN GHRRSRSRSR SRSRSRSWNR SRSPVRELRR ESGVYGRNRG  180
RPGVSAQLCK DFAAGKCRRG SHCHFSHHSS QSYEDNWDSR HKQAGAPRFS TPHESREYPI  240
RSGRNREGSL EIVDIPCKFF AAGNCRNGKY CKFFHSSQAL ASPVRRSRDD SLVRGQNSDE  300
REKLWHGSKW TDATTISDAA RLSEDKNERM GAKKSRDDGL VRNHNSDDVE KLWNGSTWNG  360
TDISTDAAKL SENQNVGMGA PGPRFSGWST DDRLPHTLDE NATHSKITAV TLGGDEINKM  420
EASQGSIKIA GAVMGAPESG GTENWLGDME MSPEWNYPVK PCSRVMNEDH GQITRSSQSL  480
PICDTSVLHE QGIIQETSGL LCDEAATMEP MMDKSYLKRD INQRDVGGVR LPGADKVAIG  540
ETAIPHIDLN FSANVLPTQG LEQNGQSSSA LPFLNLNSIG QSQGAINSES SRGGNINNPQ  600
NHAVFQVEKS INKPGTGDGS ALQFSSAIQP TQNMVSSEQL TQLTNLSASL VQILGNGQQL  660
PQLYAALNSH NVMQVPSSVK SEGPIAPDSA VASQTSEAIR SQNQNQSQYD PLSDSIDPKQ  720
LELVSPPGFS VNPSGQKSNA DGKPNGGLEN HKVSEINGEV EAEEGRKAQA ENKVPQENGE  780
VQKTDGDDKD DKADEGKKSK DTKGLRAFKF ALAEFVKELL KPTWKEGQIN KDAYKNIVKK  840
VVDKVIGTMQ GAANIPQTQE KIDQYLSFSK SKLTKLVQAY VEKSRKG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1144152RSRSRSRSR
2144154RSRSRSRSRSR
3146154RSRSRSRSR
4146156RSRSRSRSRSR
5148156RSRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.12e-13C3H family protein